Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 8 days ago • 309
Appear2Meaning: A Cross-Cultural Benchmark for Structured Cultural Metadata Inference from Images Paper • 2604.07338 • Published 8 days ago • 5
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 14 days ago • 470
Devy1/Qwen2.5-Coder-CONTROL-checkpoints_multi_language_2k-1.5B-Base-3 2B • Updated 14 days ago • 18 • 1
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 263