Chen JingQi
KyrinChen
AI & ML interests
Agent & RL
Recent Activity
upvoted a paper 25 days ago
AI Can Learn Scientific Taste upvoted a paper about 1 month ago
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning upvoted a paper 2 months ago
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative OptimizationOrganizations
None yet