Running Featured 22 Chasing the Counting Manifold in Open LLMs 📚 22 Counting manifolds in open LLMs from behavior to SAEs.
Running on CPU Upgrade Featured 3.07k The Smol Training Playbook 📚 3.07k The secrets to building world-class LLMs
aimagelab/LLaVA_MORE-llama_3_1-8B-finetuning Image-Text-to-Text • 8B • Updated Aug 2, 2025 • 279 • 11
Running 3.76k The Ultra-Scale Playbook 🌌 3.76k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 1.01k Model Memory Utility 🚀 1.01k Calculate VRAM needed to train and run Hugging Face models
Runtime error Featured 161 Beam Search Visualizer ✍ 161 View how beam search decoding works, in detail!