view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency Jan 30, 2025 • 233
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 302
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 7 items • Updated 3 days ago • 146
baidu/ERNIE-4.5-VL-28B-A3B-Thinking Image-Text-to-Text • 30B • Updated about 14 hours ago • 771 • 522