Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2604.12374 • Published 7 days ago • 35
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems Paper • 2604.14228 • Published 7 days ago • 18
Youssofal/MiniMax-M2.7-Abliterated-Heretic-GGUF Text Generation • 229B • Updated 6 days ago • 4.94k • 35
How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data Paper • 2604.14164 • Published 29 days ago • 31
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 5 days ago • 53
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective Jan 27 • 71