Yuhui Wang
Michael109
·
AI & ML interests
None yet
Organizations
RL
-
CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models
Paper • 2505.12504 • Published • 24 -
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Paper • 2505.15277 • Published • 104 -
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
Paper • 2505.00703 • Published • 44 -
OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Paper • 2505.08617 • Published • 41
3D construction
RL
-
CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models
Paper • 2505.12504 • Published • 24 -
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Paper • 2505.15277 • Published • 104 -
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
Paper • 2505.00703 • Published • 44 -
OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Paper • 2505.08617 • Published • 41
models
0
None public yet
datasets
0
None public yet