Hamish Ivison's picture

Hamish Ivison

hamishivi

·

https://ivison.id.au

AI & ML interests

NLP :)

Recent Activity

updated a model about 4 hours ago

hamishivi/tmax-qwen3.5-4b-sft-20260313-mlx

published a model about 4 hours ago

hamishivi/tmax-qwen3.5-4b-sft-20260313-mlx

updated a model about 23 hours ago

hamishivi/random_rewards_8401_step2500

View all activity

Organizations

Collections 8

View 8 collections

Papers 14

arxiv:2512.13961

arxiv:2511.19399

arxiv:2511.07317

arxiv:2503.01807

models 240

hamishivi/tmax-qwen3.5-4b-sft-20260313-mlx

Text Generation • 4B • Updated about 4 hours ago

hamishivi/random_rewards_8401_step2500

8B • Updated about 23 hours ago

hamishivi/random_rewards_step1_5k

8B • Updated 7 days ago • 33

hamishivi/random_rewards_step2k

8B • Updated 8 days ago • 33

hamishivi/step500_test

196k • Updated 10 days ago • 30

hamishivi/random_rewards_step1000

8B • Updated 19 days ago • 27

hamishivi/rl_rag_random_rewards_step500

8B • Updated 26 days ago • 28

hamishivi/1412_rl_rag_open_judge_citation_step2500

8B • Updated Feb 9 • 3

hamishivi/1412_rl_rag_open_judge_citation_step_2000

8B • Updated Feb 4 • 1

hamishivi/1412_rl_rag_open_judge_citation_1237_step1500

8B • Updated Jan 29 • 4

View 240 models

datasets 199

hamishivi/rlenv-appworld-eval

Viewer • Updated 13 days ago • 57 • 29

hamishivi/rlenv-appworld-train

Viewer • Updated 13 days ago • 90 • 29

hamishivi/rlenv-appworld-eval-nothink

Viewer • Updated 13 days ago • 57 • 7

hamishivi/rlenv-appworld-train-nothink

Viewer • Updated 13 days ago • 90 • 7

hamishivi/rlenv-guess-number-nothink

Viewer • Updated 14 days ago • 100 • 25

hamishivi/rlenv-counter-nothink

Viewer • Updated 14 days ago • 100 • 21

hamishivi/agent-task-combined

Preview • Updated 15 days ago • 146

hamishivi/rlenv-guess-number

Viewer • Updated 15 days ago • 100 • 22

hamishivi/rlenv-counter

Viewer • Updated 15 days ago • 100 • 11

hamishivi/rlenv-wordle-nothink

Viewer • Updated 16 days ago • 2k • 99

View 199 datasets