glenn ba's picture

glenn ba

glennba

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

liked a model 12 days ago

prism-ml/Bonsai-8B-gguf

upvoted a paper 12 days ago

Towards a Medical AI Scientist

View all activity

Organizations

None yet

upvoted a paper about 5 hours ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 339

liked a model 12 days ago

prism-ml/Bonsai-8B-gguf

Text Generation • 8B • Updated 4 days ago • 101k • 661

upvoted a paper 12 days ago

Towards a Medical AI Scientist

Paper • 2603.28589 • Published 24 days ago • 89

updated a collection 12 days ago

texts papers

9 items • Updated 12 days ago

upvoted a paper 12 days ago

Grad2Reward: From Sparse Judgment to Dense Rewards for Improving Open-Ended LLM Reasoning

Paper • 2602.01791 • Published Feb 2 • 1

updated a collection 12 days ago

texts papers

9 items • Updated 12 days ago

upvoted a paper 12 days ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 426

upvoted 2 articles 12 days ago

Article

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

+3

14 days ago

•

29

Article

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

16 days ago

•

59

updated a collection 12 days ago

texts papers

9 items • Updated 12 days ago

upvoted a paper 12 days ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 182

updated a collection 13 days ago

texts papers

9 items • Updated 12 days ago

upvoted a paper 13 days ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 145

updated a collection 13 days ago

texts papers

9 items • Updated 12 days ago

upvoted a paper 13 days ago

ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning

Paper • 2603.05863 • Published Mar 6 • 6

upvoted 2 papers 16 days ago

InCoder-32B-Thinking: Industrial Code World Model for Thinking

Paper • 2604.03144 • Published 20 days ago • 232

Token Warping Helps MLLMs Look from Nearby Viewpoints

Paper • 2604.02870 • Published 20 days ago • 34