DavidAU/Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning Text Generation • 8B • Updated about 6 hours ago • 67 • 27
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning Paper • 2512.20605 • Published 10 days ago • 59
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF Text Generation • 18B • Updated Dec 1, 2025 • 57.7k • 450
nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct Text Generation • 8B • Updated Apr 17, 2025 • 411 • 121