Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Junyi Li
ProvenceStar
AI & ML interests
Multimodal Model, Reinforcement Learning
Recent Activity
liked
a model
about 1 month ago
zsgvivo/videozoomer
upvoted
a
paper
about 1 month ago
Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection
upvoted
a
paper
about 2 months ago
DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
Organizations
None yet