Running Featured 39 Porting nanochat to Transformers: an AI modeling history lesson π 39 Learn about ML and Transformers through nanochat
Running 42 The Eiffel Tower Llama π 42 Explore the Eiffel Tower Llama experiment with open-source models
Qwen/Qwen3-VL-235B-A22B-Instruct Image-Text-to-Text β’ 236B β’ Updated 12 days ago β’ 69.7k β’ β’ 325
Running 3.55k The Ultra-Scale Playbook π 3.55k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 990 Model Memory Utility π 990 Calculate vRAM needed for model training and inference
Running 15 Transformers Modular Refactor π» 15 Interactive analyzer for modular models in Transformers lib