Urro's picture

In a Training Loop 🔄

Urro PRO

urroxyz

·

https://urro.xyz/

urroxyz

AI & ML interests

computational linguistics major 🤖🔎🔠 i am autistic. if i come off rude, i probably didn't mean to. please feel free to ask me for clarification.

Recent Activity

new activity about 7 hours ago

urroxyz/Voxtral-Mini-3B-2507_timestamped:inaccurate output

liked a model about 8 hours ago

numiros/Comma-Epsilon-v0.1

updated a collection 1 day ago

HUMAN-WRITTEN & LEGALLY-SOURCED*

View all activity

Organizations

upvoted 4 papers 2 days ago

AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors

Paper • 2601.20524 • Published 3 days ago • 3

Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization

Paper • 2604.08476 • Published 3 days ago • 5

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

Paper • 2604.08516 • Published 3 days ago • 33

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published 3 days ago • 41

upvoted a changelog 2 days ago

Hugging Face Changelog

ZeroGPU overquota

2 days ago

• 62

upvoted 11 papers 3 days ago

Test-Time Scaling Makes Overtraining Compute-Optimal

Paper • 2604.01411 • Published 11 days ago • 26

Token Warping Helps MLLMs Look from Nearby Viewpoints

Paper • 2604.02870 • Published 9 days ago • 32

Self-Distilled RLVR

Paper • 2604.03128 • Published 9 days ago • 154

Less Detail, Better Answers: Degradation-Driven Prompting for VQA

Paper • 2604.04838 • Published 6 days ago • 12

Mimic Intent, Not Just Trajectories

Paper • 2602.08602 • Published 15 days ago • 13

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 6 days ago • 101

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published 10 days ago • 395

REAM: Merging Improves Pruning of Experts in LLMs

Paper • 2604.04356 • Published 6 days ago • 4

DARE: Diffusion Large Language Models Alignment and Reinforcement Executor

Paper • 2604.04215 • Published 7 days ago • 19

In-Place Test-Time Training

Paper • 2604.06169 • Published 5 days ago • 20

MARS: Enabling Autoregressive Models Multi-Token Generation

Paper • 2604.07023 • Published 4 days ago • 30

upvoted a paper 7 days ago

IMU-1: Sample-Efficient Pre-training of Small Language Models

Paper • 2602.02522 • Published Jan 25 • 7

upvoted 3 papers 8 days ago

COSMOS: Predictable and Cost-Effective Adaptation of LLMs

Paper • 2505.01449 • Published Apr 30, 2025 • 4

SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models

Paper • 2403.07384 • Published Mar 12, 2024 • 3

Less is More: Improving LLM Alignment via Preference Data Selection

Paper • 2502.14560 • Published Feb 20, 2025 • 1