r-g2-2024/Llama-3.1-70B-Instruct-multimodal-JP-Graph-v0.1 Visual Question Answering • 71B • Updated Jul 30, 2025 • 95 • 19
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 23 days ago • 248k • 1.55k