Jackrong/Qwen3.5-2B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 2B • Updated 5 days ago • 30.3k • 92
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 28B • Updated 4 days ago • 40.7k • 450
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better Paper • 2602.05393 • Published Feb 5 • 8
Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models Paper • 2512.06266 • Published Dec 6, 2025 • 8
DavidAU/ERNIE-4.5-37B-A3B-Thinking-Brainstorm20x Text Generation • 37B • Updated Sep 17, 2025 • 6 • 4
DavidAU/MN-CaptainErisNebula-Chimera-v1.1-THINKING-ClaudeOpus4.5-12B-heretic-uncensored Text Generation • 12B • Updated Jan 7 • 238 • 9