sandbox - a MercedeSnape Collection

MercedeSnape 's Collections

Technical Report

Problem Definition

reasoning evaluation

agent reasoning

sandbox

updated 2 days ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 86

Note RL in sandbox 疑似开发了一个通用的sandbox？