Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MercedeSnape
's Collections
agentic RL
Technical Report
world model
sandbox
Benchmark
ViT
Problem Definition
future
self-evolving
LLM reasoning
reasoning evaluation
mm thinking
agent reasoning
agent training
agent env
mas
model paradigm
Memory
RAG
Tokenization
pretrain
MoE
KG
survey
sandbox
updated
2 days ago
Upvote
-
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper
•
2601.16206
•
Published
Jan 22
•
86
Note
RL in sandbox 疑似开发了一个通用的sandbox?
Upvote
-
Share collection
View history
Collection guide
Browse collections