YijuGuo
AI & ML interests
LLM Alignment
Recent Activity
upvoted a paper 12 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation upvoted a paper 16 days ago
AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research authored
a paper
21 days ago
Controllable Preference Optimization: Toward Controllable
Multi-Objective Alignment