MultiRL

non-profit

AI & ML interests

None defined yet.

Recent Activity

KimSHine updated a model about 1 hour ago

MultiRL/qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action

KimSHine published a model about 2 hours ago

MultiRL/qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action

KimSHine updated a model about 11 hours ago

MultiRL/qwen3_1.7b_webshop_atomic_action_epoch2

View all activity

MultiRL 's models 187

MultiRL/qwen3_4b_easy_rl_new

4B • Updated Dec 16, 2025

MultiRL/qwen3_1.7b_easy_rl_gspo

2B • Updated Dec 16, 2025 • 1

MultiRL/qwen3_4b_sft_new

4B • Updated Dec 15, 2025 • 1

MultiRL/qwen3_1.7b_easy_rl_final_step120

2B • Updated Dec 15, 2025

MultiRL/qwen3_4b_medium_rl_final

4B • Updated Dec 15, 2025

MultiRL/qwen3_4b_sft_one_act

4B • Updated Dec 14, 2025 • 1

MultiRL/qwen3_1.7b_easy_rl_reinforce_ori

2B • Updated Dec 14, 2025 • 4

MultiRL/qwen3_1.7b_easy_rl_reinforce_alpha_0.5

2B • Updated Dec 14, 2025 • 1

MultiRL/qwen3_1.7b_easy_rl_reinforce_alpha_1

2B • Updated Dec 14, 2025 • 2

MultiRL/qwen3_1.7b_easy_rl_reinforce_alpha_0

2B • Updated Dec 14, 2025 • 2

MultiRL/qwen3_1.7b_sft_one_act

2B • Updated Dec 14, 2025 • 1

MultiRL/qwen3_1.7b_easy_rl_final

2B • Updated Dec 13, 2025 • 1

MultiRL/qwen3_4b_easy_rl_final

4B • Updated Dec 13, 2025

MultiRL/qwen3_1.7b_sft_final

2B • Updated Dec 11, 2025 • 2

MultiRL/qwen3_4b_sft_final

4B • Updated Dec 11, 2025 • 1

MultiRL/qwen3_1.7b_easy_rl_new

2B • Updated Dec 6, 2025

MultiRL/qwen3_4b_standard_medium_rl

4B • Updated Dec 6, 2025

MultiRL/qwen3_4b_standard_easy_rl

4B • Updated Dec 5, 2025 • 1

MultiRL/qwen3_4b_medium_rl_progress_C

4B • Updated Dec 5, 2025

MultiRL/qwen3_4b_medium_rl

4B • Updated Dec 4, 2025

MultiRL/qwen3_1.7b_easy_rl_test_task_group

2B • Updated Dec 1, 2025 • 7

MultiRL/qwen3_1.7b_easy_rl_test

2B • Updated Nov 30, 2025 • 6

MultiRL/qwen3_1.7b_sudoku_sft

2B • Updated Nov 28, 2025 • 1

MultiRL/qwen3_1.7b_easy_reinforce_batch_32_by_pass

2B • Updated Nov 26, 2025

MultiRL/qwen3_1.7b_easy_reinforce_batch_64_by_pass

2B • Updated Nov 25, 2025

MultiRL/qwen3_1.7b_easy_reinforce_test

2B • Updated Nov 23, 2025

MultiRL/qwen3_1.7b_C_easy_gspo_test

2B • Updated Nov 22, 2025

MultiRL/qwen3_1.7b_base_C_normal_short_sft_lr_1e_5_C_easy_grpo_step70

2B • Updated Nov 17, 2025

MultiRL/qwen3_1.7b_C_short_sft_lr_1e_5_C_easy_reinforce_step80

2B • Updated Nov 15, 2025

MultiRL/qwen3_1.7b_base_C_normal_concise_sft_lr_5e_6

2B • Updated Nov 15, 2025