Gongxun Li
AlexGeek
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
upvoted
a
paper
about 2 hours ago
Does Your Reasoning Model Implicitly Know When to Stop Thinking?
liked
a model
10 days ago
AIDC-AI/Ovis2.6-30B-A3B