Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Boqiang Zhang's picture
10 8

Boqiang Zhang

Cyril666
21world's profile picture violetliang's profile picture
ยท
https://cyrilsterling.github.io/
  • CyrilSterling

AI & ML interests

Multi-modal Large Language Models Vision-Language-Action Models

Recent Activity

authored a paper 3 days ago
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding
authored a paper 3 days ago
What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness
authored a paper 3 days ago
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
View all activity

Organizations

Language Technology Lab at Alibaba DAMO Academy's profile picture Auden's profile picture pg-team's profile picture

Cyril666 's datasets

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs