Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 8 days ago • 73
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 8 days ago • 120
Real-time Vision Models Collection A collection of real-time detectors. • 19 items • Updated 17 days ago • 21
Jan-v2-VL Collection Jan-v2-VL: an 8B VLM focused on reliable, many-step task execution. • 6 items • Updated 28 days ago • 37
view article Article Transformer’ları Cebe Sığdırmak: Modelleri Optimize Edip Uç Cihazlarda Çalıştırma Nov 3 • 4
view article Article How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare Oct 28 • 19
Granite 4.0 Collection IBM's new Granite-4.0 models! Run Dynamic GGUFs or fine-tune with Unsloth. • 38 items • Updated 8 days ago • 21
Guided Decoding and Its Critical Role in Retrieval-Augmented Generation Paper • 2509.06631 • Published Sep 8 • 10
Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications Paper • 2509.17671 • Published Sep 22 • 9
mmBERT: a modern multilingual encoder Collection mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9 • 49