SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks Paper • 2602.12670 • Published Feb 13 • 56
gradientai/Llama-3-8B-Instruct-Gradient-1048k Text Generation • 8B • Updated Oct 29, 2024 • 11.8k • 680
gradientai/Llama-3-70B-Instruct-Gradient-1048k Text Generation • 71B • Updated Oct 28, 2024 • 31 • 122
prometheus-eval/prometheus-8x7b-v2.0 Text Generation • 47B • Updated Nov 29, 2024 • 1.06k • 49
Mantis Collection Mantis model family optimized for multi-image reasoning with interleaved text/image format • 10 items • Updated 23 days ago • 11
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs Paper • 2404.16375 • Published Apr 25, 2024 • 18 • 2