Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities Paper β’ 2601.21937 β’ Published 19 days ago β’ 19
Cosmos-Predict2 Collection β οΈ This collection is archived. π https://huggingface.co/collections/nvidia/cosmos-predict25 β’ 13 items β’ Updated 13 days ago β’ 34
VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation Paper β’ 2601.10124 β’ Published Jan 15 β’ 4
WildRayZer: Self-supervised Large View Synthesis in Dynamic Environments Paper β’ 2601.10716 β’ Published Jan 15 β’ 4
Enhancing Sentiment Classification and Irony Detection in Large Language Models through Advanced Prompt Engineering Techniques Paper β’ 2601.08302 β’ Published Jan 13 β’ 5
Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale Paper β’ 2601.10338 β’ Published Jan 15 β’ 6
Deriving Character Logic from Storyline as Codified Decision Trees Paper β’ 2601.10080 β’ Published Jan 15 β’ 6
RigMo: Unifying Rig and Motion Learning for Generative Animation Paper β’ 2601.06378 β’ Published Jan 10 β’ 12
V-DPM: 4D Video Reconstruction with Dynamic Point Maps Paper β’ 2601.09499 β’ Published Jan 14 β’ 9
LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning Paper β’ 2601.10129 β’ Published Jan 15 β’ 11
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following Paper β’ 2601.06431 β’ Published Jan 10 β’ 12
TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts Paper β’ 2601.08881 β’ Published Jan 12 β’ 13
Inference-time Physics Alignment of Video Generative Models with Latent World Models Paper β’ 2601.10553 β’ Published Jan 15 β’ 12
M^4olGen: Multi-Agent, Multi-Stage Molecular Generation under Precise Multi-Property Constraints Paper β’ 2601.10131 β’ Published Jan 15 β’ 17
FlowAct-R1: Towards Interactive Humanoid Video Generation Paper β’ 2601.10103 β’ Published Jan 15 β’ 74
HeartMuLa: A Family of Open Sourced Music Foundation Models Paper β’ 2601.10547 β’ Published Jan 15 β’ 42