Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Siyuan Li's picture
11 44 13

Siyuan Li

Lupin1998
jingbo02's profile picture MarcusB3n's profile picture syjian's profile picture
·
https://lupin1998.github.io/
  • LupinLSY
  • Lupin1998
  • siyuan-li-lupin1998

AI & ML interests

Network Design, Self-supervised Learning, Computer Vision, Data-centric ML, AI for Science

Organizations

MogaNet's profile picture OpenSTL's profile picture ICML2023's profile picture odl-raiser's profile picture OpenRaiser's profile picture

Collections 2

LLMS
  • Taming LLMs by Scaling Learning Rates with Gradient Grouping

    Paper • 2506.01049 • Published Jun 1, 2025 • 38
AIGC
  • MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

    Paper • 2504.00999 • Published Apr 1, 2025 • 95
  • Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

    Paper • 2409.12191 • Published Sep 18, 2024 • 78
LLMS
  • Taming LLMs by Scaling Learning Rates with Gradient Grouping

    Paper • 2506.01049 • Published Jun 1, 2025 • 38
AIGC
  • MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

    Paper • 2504.00999 • Published Apr 1, 2025 • 95
  • Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

    Paper • 2409.12191 • Published Sep 18, 2024 • 78

Papers 29

arxiv:2511.14806
arxiv:2511.11134
arxiv:2510.23479
arxiv:2507.21049

models 1

Lupin1998/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Aug 29, 2025

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs