Wanwei He
Grocery
AI & ML interests
LLM
Recent Activity
liked
a model 7 days ago
Qwen/Qwen3.5-35B-A3B commented on
a paper
6 months ago
Implicit Actor Critic Coupling via a Supervised Learning Framework for
RLVR upvoted a paper 6 months ago
Implicit Actor Critic Coupling via a Supervised Learning Framework for
RLVR