arxiv:2506.11930
Jiang
Dongwei
AI & ML interests
None yet
Organizations
models
17
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_newdata
Text Generation
•
8B
•
Updated
•
5
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_longer
Text Generation
•
8B
•
Updated
•
6
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr
Text Generation
•
8B
•
Updated
•
12
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Text Generation
•
8B
•
Updated
•
10
Dongwei/Qwen-2.5-7B_Base_Math_smalllr
Text Generation
•
8B
•
Updated
•
6
•
6
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
•
8B
•
Updated
•
6
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
•
2B
•
Updated
•
9
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
•
2B
•
Updated
•
12
Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
•
8B
•
Updated
•
8
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
•
8B
•
Updated
•
7