Ministral 3 - Additional Checkpoints Collection Different formats and Quantized versions of our Ministral 3 family; 14B/8B/3B Instruct/Reasoning GGUF, 3B Instruct ONNX and 14B/8B/3B Instruct BF16. • 13 items • Updated 6 days ago • 13
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 5 days ago • 54
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 8 days ago • 226
view article Article Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness Nov 5 • 10
Environment Hub Collection A collection of OpenEnv-spec Environments • 6 items • Updated 29 days ago • 15
view article Article huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning +2 Oct 27 • 71
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 145