arxiv:2603.22446
Kexin Huang
737443h
AI & ML interests
None yet
Recent Activity
authored a paper 1 day ago
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization authored a paper 1 day ago
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation authored a paper 1 day ago
Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMsOrganizations
None yet