AI & ML interests
multi-modal foundation models
Recent Activity
-
mvp-lab/LLaVA-OneVision-1.5-Instruct-Data
Viewer • Updated • 21.9M • 56.5k • 65 -
mvp-lab/LLaVA-OneVision-1.5-Mid-Training-85M
Viewer • Updated • 91.5M • 158k • 60 -
lmms-lab/LLaVA-OneVision-1.5-8B-Instruct
Image-Text-to-Text • 9B • Updated • 40.7k • 62 -
lmms-lab/LLaVA-OneVision-1.5-4B-Instruct
Image-Text-to-Text • 5B • Updated • 4.81k • 18
-
mvp-lab/LLaVA-OneVision-1.5-Instruct-Data
Viewer • Updated • 21.9M • 56.5k • 65 -
mvp-lab/LLaVA-OneVision-1.5-Mid-Training-85M
Viewer • Updated • 91.5M • 158k • 60 -
lmms-lab/LLaVA-OneVision-1.5-8B-Instruct
Image-Text-to-Text • 9B • Updated • 40.7k • 62 -
lmms-lab/LLaVA-OneVision-1.5-4B-Instruct
Image-Text-to-Text • 5B • Updated • 4.81k • 18
spaces 23
Running on Zero
CFM SVC
🎙
Singing Voice Conversion Based on CFM
Running on Zero
MIDI-LLM Style Steering Demo
🎹
Generate styled MIDI music with playback and download options
Running on Zero
MultiSubjectVTON
📊
Multi-subject VTON model
Sleeping
Dungeon of Decisions
⚔
Play an AI‑driven D&D adventure with text, image, and voice
Sleeping
SyncAI
🎵
AI Music Ads Generator
Sleeping
Character Based AI Paper tutor
😈
Generate lecture summary and quiz from a PDF paper
datasets 6
mvp-lab/LLaVA-OneVision-1.5-RL-Data
Viewer
• Updated
• 69.2k • 346 • 6
mvp-lab/LLaVA-OneVision-1.5-Mid-Training-85M
Viewer
• Updated
• 91.5M • 158k • 60
mvp-lab/LLaVA-OneVision-1.5-Instruct-Data
Viewer
• Updated
• 21.9M • 56.5k • 65
mvp-lab/LLaVA-558K-Webdataset
Updated
• 590 • 4
mvp-lab/LLaVA-NeXT-780k-webdataset
Updated
• 1.1k
mvp-lab/LLaVA-OneVision-1.5-Mid-Training-Webdataset-Quick-Start-3M
Updated
• 5.9k • 2