小明's picture

20 39

小明

xiaoming

·

xiaominghero

AI & ML interests

nlp

Recent Activity

upvoted a paper 7 days ago

Step-DeepResearch Technical Report

upvoted a paper 14 days ago

Step-GUI Technical Report

upvoted an article 21 days ago

We Got Claude to Fine-Tune an Open Source LLM

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Step-DeepResearch Technical Report

Paper • 2512.20491 • Published 9 days ago • 77

upvoted a paper 14 days ago

Step-GUI Technical Report

Paper • 2512.15431 • Published 15 days ago • 124

upvoted an article 21 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

29 days ago

•

552

upvoted a collection 4 months ago

MobileLLM-R1

MobileLLM-R1, a series of sub-billion parameter reasoning models • 10 items • Updated Nov 21, 2025 • 27

upvoted 2 papers 4 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291

upvoted a collection 4 months ago

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 9 days ago • 84

upvoted a paper 5 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 145

upvoted an article 5 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

741

upvoted a collection 5 months ago

SmolDocling datasets

Datasets used to train SmolDocling • 6 items • Updated Jul 31, 2025 • 31

upvoted 3 papers 5 months ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31, 2025 • 114

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30, 2025 • 68

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22, 2025 • 63

upvoted a collection 6 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Nov 14, 2025 • 162

upvoted 2 papers 7 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13, 2025 • 73

upvoted a collection 10 months ago

Document AI

42 items • Updated 28 days ago • 3

upvoted a paper 10 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14, 2025 • 123

upvoted a paper 11 months ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14, 2025 • 55

upvoted a paper 12 months ago

HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips

Paper • 1906.03327 • Published Jun 7, 2019 • 1