Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b Viewer • Updated 30 days ago • 306k • 11.4k • 312
Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Paper • 2503.19855 • Published Mar 25, 2025 • 29