EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper • 2601.15876 • Published 21 days ago • 89
oaimli/longtune_scitrek_grounding_reinforcement_gemma_5_alex Image-Text-to-Text • 4B • Updated Dec 22, 2025
oaimli/longtune_scitrek_grounding_reinforcement_gemma_5_alex Image-Text-to-Text • 4B • Updated Dec 22, 2025
oaimli/longtune_hotpotqa_grounding_reinforcement_qwen_5_225 Text Generation • 4B • Updated Dec 22, 2025 • 1
oaimli/longtune_hotpotqa_grounding_reinforcement_qwen_5_225 Text Generation • 4B • Updated Dec 22, 2025 • 1
oaimli/longtune_hotpotqa_reasoning_reinforcement_qwen Text Generation • 4B • Updated Dec 11, 2025 • 2
oaimli/longtune_hotpotqa_reasoning_reinforcement_qwen Text Generation • 4B • Updated Dec 11, 2025 • 2