Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
s
august66
Follow
Kyleyee's profile picture
callmespring's profile picture
mamba413's profile picture
3 followers
·
2 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 3 hours ago
august66/hh_helpfulness_mc_rewards_DPO
published
a dataset
about 3 hours ago
august66/hh_helpfulness_mc_rewards_DPO
updated
a model
about 6 hours ago
august66/hh_qwen1.5_drpo_gated_fixed_beta
View all activity
Organizations
models
17
Sort: Recently updated
august66/hh_qwen1.5_drpo_gated_fixed_beta
2B
•
Updated
about 2 hours ago
•
23
august66/hh_qwen1.5_DM_2e-6
2B
•
Updated
1 day ago
•
14
august66/hh_qwen1.5_drpo_laplace_fixed_beta
2B
•
Updated
2 days ago
•
68
august66/hh_qwen1.5_IS_KL_Laplace
2B
•
Updated
3 days ago
•
13
august66/hh_qwen_1.5b_sft_dpo_generation
Text Generation
•
2B
•
Updated
4 days ago
•
44
august66/hh_qwen1.5_IS_KL
2B
•
Updated
9 days ago
•
24
august66/hh_qwen1.5_drpo_fixed_beta
2B
•
Updated
15 days ago
•
33
august66/hh_qwen1.5_IS_CLIP
2B
•
Updated
15 days ago
•
40
august66/hh_qwen1.5_drpo_adaptive_beta
Updated
15 days ago
august66/hh_qwen1.5_is_clip_1000_5e6
2B
•
Updated
16 days ago
•
22
View 17 models
datasets
38
Sort: Recently updated
august66/hh_helpfulness_mc_rewards_DPO
Updated
about 3 hours ago
august66/hh_helpfulness_qwen2.5_1.5b_generation_stochastic
Viewer
•
Updated
3 days ago
•
46.1k
•
5
august66/hh_helpfulness_mc_rewards_IS_clip
Viewer
•
Updated
3 days ago
•
46.1k
•
10
august66/hh_helpfulness_qwen2.5_1.5b_generation_dpo
Viewer
•
Updated
4 days ago
•
46.1k
•
20
august66/hh_helpfulness_qwen2.5_1.5b_generation
Viewer
•
Updated
10 days ago
•
46.1k
•
60
august66/hh_helpfulness_mc_rewards
Viewer
•
Updated
10 days ago
•
46.1k
•
17
august66/hh_helpfulness_drpo_from_sft
Viewer
•
Updated
17 days ago
•
46.1k
•
505
august66/hh_helpful_base
Viewer
•
Updated
23 days ago
•
46.1k
•
233
august66/hh_harmless_base
Viewer
•
Updated
24 days ago
•
44.8k
•
17
august66/drpo_hh_qwen2.5_1.5b_with_ref_prob_vllm_conv
Viewer
•
Updated
25 days ago
•
43.8k
•
37
View 38 datasets