Building on HF

10 24 4

Bo Liu

Benjamin-eecs

https://benjamin-eecs.github.io/

AI & ML interests

Reinforcement Learning, Reasoning, Machine Learning Systems

Recent Activity

authored a paper 28 days ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

upvoted a paper 29 days ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

upvoted a paper 3 months ago

Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents

View all activity

Organizations

authored a paper 28 days ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Paper • 2603.18886 • Published Mar 19 • 6

upvoted a paper 29 days ago

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Paper • 2603.18886 • Published Mar 19 • 6

upvoted a paper 3 months ago

Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents

Paper • 2601.18217 • Published Jan 26 • 13

liked a dataset 4 months ago

facebook/principia-bench

Viewer • Updated Dec 18, 2025 • 2.24k • 337 • 19

upvoted a paper 4 months ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 39

liked a dataset 5 months ago

facebook/principia-collection

Viewer • Updated Dec 19, 2025 • 554k • 362 • 44

authored a paper 5 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 83

upvoted a paper 5 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 83

upvoted a paper 6 months ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published Nov 3, 2025 • 32

authored a paper 6 months ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published Oct 28, 2025 • 18

upvoted a paper 6 months ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published Oct 28, 2025 • 18

authored a paper 6 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 39

upvoted a paper 6 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 39

authored a paper 6 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 277

upvoted 2 papers 6 months ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1, 2025 • 60

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 277

Bo Liu

AI & ML interests

Recent Activity

Organizations

Benjamin-eecs's activity