Running Featured 23 Chasing the Counting Manifold in Open LLMs 📚 23 Counting manifolds in open LLMs from behavior to SAEs.
Running on CPU Upgrade Featured 3.08k The Smol Training Playbook 📚 3.08k The secrets to building world-class LLMs
aimagelab/LLaVA_MORE-llama_3_1-8B-finetuning Image-Text-to-Text • 8B • Updated Aug 2, 2025 • 336 • 11
Running 3.76k The Ultra-Scale Playbook 🌌 3.76k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 1.01k Model Memory Utility 🚀 1.01k Calculate VRAM needed to train and run Hugging Face models
Runtime error Featured 161 Beam Search Visualizer ✍ 161 View how beam search decoding works, in detail!