TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 9 days ago • 106
Synthetic Sandbox for Training Machine Learning Engineering Agents Paper • 2604.04872 • Published 9 days ago • 14
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 263