yang
dearaj23
·
AI & ML interests
None yet
Recent Activity
authored a paper about 12 hours ago
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization upvoted a paper 1 day ago
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization updated a collection 2 months ago
Agent BenchmarkOrganizations
None yet
RL
multi-agent
-
MALT: Improving Reasoning with Multi-Agent LLM Training
Paper • 2412.01928 • Published • 46 -
Multi-Agent System for Comprehensive Soccer Understanding
Paper • 2505.03735 • Published • 25 -
DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation
Paper • 2510.09116 • Published • 97 -
basicv8vc/SimpleQA
Viewer • Updated • 4.33k • 2.58k • 31
CoT
memory
deep research
-
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 117 -
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning
Paper • 2509.13305 • Published • 91 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 109 -
Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents
Paper • 2510.14438 • Published • 14
LLM
survey
Agent Benchmark
memory
RL
deep research
-
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 117 -
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning
Paper • 2509.13305 • Published • 91 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 109 -
Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents
Paper • 2510.14438 • Published • 14
multi-agent
-
MALT: Improving Reasoning with Multi-Agent LLM Training
Paper • 2412.01928 • Published • 46 -
Multi-Agent System for Comprehensive Soccer Understanding
Paper • 2505.03735 • Published • 25 -
DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation
Paper • 2510.09116 • Published • 97 -
basicv8vc/SimpleQA
Viewer • Updated • 4.33k • 2.58k • 31
LLM
CoT
survey