18 8

Brendan Slevin

brend007

brendan-slevin-ab7496a7

AI & ML interests

None yet

Recent Activity

liked a model about 9 hours ago

deepseek-ai/DeepSeek-V3.2

liked a model about 9 hours ago

embedl/Cosmos-Reason2-2B-W4A16-Edge2

liked a model about 21 hours ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

View all activity

Organizations

None yet

liked 2 models about 9 hours ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated Dec 1, 2025 • 327k • • 1.27k

embedl/Cosmos-Reason2-2B-W4A16-Edge2

Image-Text-to-Text • 1B • Updated 4 days ago • 4.22k • 9

liked a model about 21 hours ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Text Generation • 28B • Updated 1 day ago • 260 • 24

upvoted a paper 2 days ago

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

Paper • 2602.21534 • Published 4 days ago • 22

liked a dataset 2 days ago

WinkingFace/CryptoLM-Solana-SOL-USDT

Viewer • Updated Mar 19, 2025 • 32.3k • 76 • 10

liked a model 2 days ago

nvidia/Nemotron-Terminal-8B

Text Generation • 8B • Updated 1 day ago • 217 • 14

upvoted a paper 3 days ago

PyVision-RL: Forging Open Agentic Vision Models via RL

Paper • 2602.20739 • Published 5 days ago • 28

upvoted a paper 5 days ago

EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots

Paper • 2602.18071 • Published 9 days ago • 22

upvoted a paper 6 days ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 12 days ago • 99

upvoted 2 articles 9 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

605

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

10 days ago

•

469

upvoted 2 papers 10 days ago

DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories

Paper • 2602.10809 • Published 18 days ago • 52

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Paper • 2602.12036 • Published 17 days ago • 98

upvoted an article 12 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

16 days ago

•

129

upvoted a collection 12 days ago

Qwen3.5

Collection

17 items • Updated about 2 hours ago • 508

upvoted an article 14 days ago

Article

Custom Kernels for All from Codex and Claude

17 days ago

•

upvoted a paper 17 days ago

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

Paper • 2602.10224 • Published 19 days ago • 19

upvoted an article 24 days ago

Article

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

26 days ago

•

upvoted a paper 28 days ago

The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation

Paper • 2601.17737 • Published Jan 25 • 55

upvoted an article about 1 month ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

Brendan Slevin

AI & ML interests

Recent Activity

Organizations

brend007's activity

We Got Claude to Fine-Tune an Open Source LLM

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

Forge: Scalable Agent RL Framework and Algorithm

Custom Kernels for All from Codex and Claude

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective