GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 20 days ago • 366
Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models? Paper • 2603.22582 • Published about 1 month ago • 7