view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 10 days ago • 234
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains Paper • 2511.04962 • Published Nov 7 • 52
Running 3.56k The Ultra-Scale Playbook 🌌 3.56k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 2.58k The Smol Training Playbook 📚 2.58k The secrets to building world-class LLMs
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 158
DiffGuard: Text-Based Safety Checker for Diffusion Models Paper • 2412.00064 • Published Nov 25, 2024 • 3
DiffGuard: Text-Based Safety Checker for Diffusion Models Paper • 2412.00064 • Published Nov 25, 2024 • 3