Junbo Li
MartingaleSF
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs
Organizations
None yet