Pause or Fabricate? Training Language Models for Grounded Reasoning Paper • 2604.19656 • Published 8 days ago • 10
GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification Paper • 2604.14258 • Published 14 days ago • 23
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published 16 days ago • 141
KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation Paper • 2604.08455 • Published 20 days ago • 47
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 27 days ago • 99