LoGoPlanner: Localization Grounded Navigation Policy with Metric-aware Visual Geometry Paper • 2512.19629 • Published Dec 22, 2025 • 26
LongVideoAgent: Multi-Agent Reasoning with Long Videos Paper • 2512.20618 • Published about 1 month ago • 54
Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations Paper • 2512.21004 • Published about 1 month ago • 13
NVIDIA Nemotron 3: Efficient and Open Intelligence Paper • 2512.20856 • Published about 1 month ago • 35
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Paper • 2512.20557 • Published about 1 month ago • 50
Omni-Weather: Unified Multimodal Foundation Model for Weather Generation and Understanding Paper • 2512.21643 • Published 29 days ago • 13