Constantin
Alexandre-Numind
AI & ML interests
Training AI models @Numind
Recent Activity
upvoted a paper about 23 hours ago
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning upvoted an article 7 days ago
From GRPO to DAPO and GSPO: What, Why, and How liked
a model 15 days ago
Qwen/Qwen3.5-9B