Running on CPU Upgrade Featured 2.57k The Smol Training Playbook π 2.57k The secrets to building world-class LLMs
Running 3.55k The Ultra-Scale Playbook π 3.55k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation β’ 71B β’ Updated Feb 24 β’ 361k β’ β’ 733
Running on CPU Upgrade Featured 990 Model Memory Utility π 990 Calculate vRAM needed for model training and inference
BAAI/bge-reranker-v2-minicpm-layerwise Text Classification β’ 3B β’ Updated Mar 19, 2024 β’ 1.78k β’ 63