zbx's picture

5

zbx

27cups

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Pause or Fabricate? Training Language Models for Grounded Reasoning

upvoted a paper 8 days ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

upvoted a paper 15 days ago

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Pause or Fabricate? Training Language Models for Grounded Reasoning

Paper • 2604.19656 • Published 8 days ago • 10

upvoted a paper 8 days ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

Paper • 2604.14258 • Published 14 days ago • 23

upvoted a paper 15 days ago

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published 16 days ago • 141

upvoted a paper 18 days ago

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Paper • 2604.08455 • Published 20 days ago • 47

upvoted a paper 26 days ago

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published 27 days ago • 99