AI & ML interests
None defined yet.
Recent Activity
View all activity
models 34
AIPlans/Qwen3-0.6B-ORPO-Crosscoder-MixedDataset
Updated
AIPlans/Qwen3-0.6B-GRPO-Crosscoder-MixedDataset
Updated
AIPlans/Qwen3-0.6B-KTO-Crosscoder-MixedDataset
Updated
AIPlans/Qwen3-0.6B-IPO-Crosscoder-MixedDataset
Updated
AIPlans/Crosscoder_GRPO
Updated
AIPlans/Qwen3-0.6B-ReMax
Reinforcement Learning • 0.6B • Updated
• 2 • 2
AIPlans/Qwen3-0.6B-GRPO-RM_NVIDIA
Text Generation • 0.6B • Updated
• 10
AIPlans/Qwen3-0.6B-GRPO_Epoch2
Text Generation • 0.6B • Updated
• 1
AIPlans/Qwen3-0.6B-GRPO_Epoch1
Text Generation • 0.6B • Updated
• 4
AIPlans/Qwen3-0.6B-GRPO
Updated
datasets 17
AIPlans/Helpsteer2-helpfulness-prompts
Viewer
• Updated
• 7.22k • 20
AIPlans/helpsteer2-helpfulness-preference-cleaned
Viewer
• Updated
• 6.99k • 19
AIPlans/trackio-experiments
Updated
• 7
AIPlans/ultrafeedback_binarized_chinese
Viewer
• Updated
• 14k • 20
AIPlans/ultrafeedback_binarized
Viewer
• Updated
• 14k • 23
AIPlans/FilteredPKU-SafeRLHF_chinese
Viewer
• Updated
• 12k • 9
AIPlans/FilteredPKU-SafeRLHF
Viewer
• Updated
• 12k • 12
AIPlans/SafetyBench_WithLabels_Better_chinese
Viewer
• Updated
• 546 • 87
AIPlans/SafetyBench_WithLabels
Viewer
• Updated
• 546 • 88
AIPlans/ToxiGen_chinese
Viewer
• Updated
• 1k • 95