mlx-community/DeepSeek-R1-Distill-Qwen-32B-abliterated-4bit Text Generation • 5B • Updated Feb 20 • 332 • 5
mlx-community/DeepSeek-R1-Distill-Qwen-32B-abliterated Text Generation • 33B • Updated Feb 20 • 433 • • 3
Running 3.61k The Ultra-Scale Playbook 🌌 3.61k The ultimate guide to training LLM on large GPU Clusters