Mobile Vision Perception Lab

community

https://github.com/mvp-ai-lab

AI & ML interests

multi-modal foundation models

Recent Activity

ICGenAIShare04 published a Space about 14 hours ago

mvp-lab/cfm_svc

ICGenAIShare04 updated a Space 6 days ago

mvp-lab/cfm_svc

ICGenAIShare01 updated a Space 7 days ago

mvp-lab/midi-steering-demo

View all activity

Collections 2

spaces 23

CFM SVC

Singing Voice Conversion Based on CFM

MIDI-LLM Style Steering Demo

Generate styled MIDI music with playback and download options

MultiSubjectVTON

Multi-subject VTON model

Dungeon of Decisions

Play an AI‑driven D&D adventure with text, image, and voice

SyncAI

AI Music Ads Generator

Character Based AI Paper tutor

Generate lecture summary and quiz from a PDF paper

models 2

mvp-lab/ControlNet_Weight

Updated 12 days ago

mvp-lab/LLaVA-OneVision-1.5-8B-RL

9B • Updated Dec 3, 2025 • 26 • 2

datasets 6

mvp-lab/LLaVA-OneVision-1.5-RL-Data

Viewer • Updated Jan 6 • 69.2k • 346 • 6

mvp-lab/LLaVA-OneVision-1.5-Mid-Training-85M

Viewer • Updated Nov 24, 2025 • 91.5M • 158k • 60

mvp-lab/LLaVA-OneVision-1.5-Instruct-Data

Viewer • Updated Nov 21, 2025 • 21.9M • 56.5k • 65

mvp-lab/LLaVA-558K-Webdataset

Updated Oct 21, 2025 • 590 • 4

mvp-lab/LLaVA-NeXT-780k-webdataset

Updated Oct 11, 2025 • 1.1k

mvp-lab/LLaVA-OneVision-1.5-Mid-Training-Webdataset-Quick-Start-3M

Updated Sep 20, 2025 • 5.9k • 2