XSkill: Continual Learning from Experience and Skills in Multimodal Agents Paper • 2603.12056 • Published 4 days ago • 23
CodePercept: Code-Grounded Visual STEM Perception for MLLMs Paper • 2603.10757 • Published 5 days ago • 11
LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published 5 days ago • 34
Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards Paper • 2603.09117 • Published 6 days ago • 8
MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants Paper • 2603.09652 • Published 6 days ago • 14
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published 6 days ago • 61
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing Paper • 2603.09877 • Published 6 days ago • 41
view article Article Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge 7 days ago • 9
Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs Paper • 2603.09095 • Published 6 days ago • 26
WildActor: Unconstrained Identity-Preserving Video Generation Paper • 2603.00586 • Published 16 days ago • 34
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling Paper • 2603.04791 • Published 11 days ago • 16
DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval Paper • 2603.04743 • Published 11 days ago • 47
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios Paper • 2602.23166 • Published 18 days ago • 40