Models for "RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments" - https://arxiv.org/abs/2511.07317
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a model about 4 hours ago
hamishivi/tmax-qwen3.5-4b-sft-20260313-mlx published
a model about 4 hours ago
hamishivi/tmax-qwen3.5-4b-sft-20260313-mlx updated
a model about 23 hours ago
hamishivi/random_rewards_8401_step2500