Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
39
34
30
Shizhe Diao
shizhediao2
Follow
renjiepi's profile picture
sukabluat's profile picture
di-zhang-fdu's profile picture
18 followers
ยท
13 following
https://shizhediao.github.io/
shizhediao
shizhediao
shizhediao
AI & ML interests
LLM pre-training and reasoning
Recent Activity
upvoted
a
paper
2 days ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
liked
a model
4 days ago
nvidia/Nemotron-Flash-1B
updated
a dataset
25 days ago
nvidia/ToolScale
View all activity
Organizations
shizhediao2
's datasets
None public yet