Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Qin Zhou's picture
In a Training Loop 🔄
6 2

Qin Zhou

Matrix53
·
https://matrix53.github.io
  • ACMatrix53
  • Matrix53

AI & ML interests

Computer Vision, Diffusion Model, Video Generation

Recent Activity

authored a paper about 3 hours ago
ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models
authored a paper about 3 hours ago
Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter
upvoted a collection about 5 hours ago
Papers
View all activity

Organizations

None yet

Collections 1

Papers
  • ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models

    Paper • 2506.09740 • Published Jun 11 • 1
  • Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter

    Paper • 2309.02773 • Published Sep 6, 2023 • 1
Papers
  • ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models

    Paper • 2506.09740 • Published Jun 11 • 1
  • Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter

    Paper • 2309.02773 • Published Sep 6, 2023 • 1

Papers 2

arxiv:2506.09740
arxiv:2309.02773

spaces 1

Running

Vietnamese Handwriting

🏃

Generate Vietnamese handwriting

May 18, 2022

models 0

None public yet

datasets 1

Matrix53/elbo-t2ialign

Updated Sep 14 • 20
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs