13 49 58

Tong Zhu

Spico

https://Spico197.github.io

AI & ML interests

Information Extraction, Mixture-of-Experts, LLM

Recent Activity

upvoted a paper about 19 hours ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

upvoted a paper about 19 hours ago

PlayCoder: Making LLM-Generated GUI Code Playable

upvoted a paper 22 days ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

View all activity

Organizations

upvoted 2 papers about 19 hours ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

Paper • 2604.19295 • Published 2 days ago • 25

PlayCoder: Making LLM-Generated GUI Code Playable

Paper • 2604.19742 • Published 2 days ago • 20

upvoted a paper 22 days ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published 24 days ago • 86

commented a paper 26 days ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 28 days ago • 131 •

upvoted an article about 2 months ago

Article

Your MoE Model Does Not Have to Select Fixed Number of Experts

Feb 26

•

published an article about 2 months ago

Article

Your MoE Model Does Not Have to Select Fixed Number of Experts

Feb 26

•

upvoted an article 2 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

310

upvoted a paper 2 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

upvoted a paper 3 months ago

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published Feb 5 • 28

liked a dataset 3 months ago

librarian-bots/paper-recommendations-v2

Viewer • Updated Feb 21 • 9.99k • 27 • 16

upvoted a paper 3 months ago

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper • 2601.18631 • Published Jan 26 • 48

New activity in nvidia/Nemotron-Competitive-Programming-v1 3 months ago

User's content is empty in "competitive_coding_python"

#1 opened 3 months ago by

uwesis

upvoted 3 papers 3 months ago

MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Paper • 2601.11969 • Published Jan 17 • 27

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Paper • 2601.11655 • Published Jan 15 • 63

Toward Efficient Agents: Memory, Tool learning, and Planning

Paper • 2601.14192 • Published Jan 20 • 57

upvoted an article 3 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20, 2024

•

113

authored 4 papers 4 months ago

LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

Paper • 2411.15708 • Published Nov 24, 2024

Iterative Value Function Optimization for Guided Decoding

Paper • 2503.02368 • Published Mar 4, 2025 • 15

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Paper • 2503.05447 • Published Mar 7, 2025 • 8

Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models

Paper • 2503.16779 • Published Mar 21, 2025 • 1

Tong Zhu

AI & ML interests

Recent Activity

Organizations

Spico's activity

Your MoE Model Does Not Have to Select Fixed Number of Experts

Your MoE Model Does Not Have to Select Fixed Number of Experts

Transformers v5: Simple model definitions powering the AI ecosystem

User's content is empty in "competitive_coding_python"

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models