SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization Paper β’ 2602.04811 β’ Published 7 days ago β’ 1
Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis Paper β’ 2601.14253 β’ Published 22 days ago β’ 10
V-DPM: 4D Video Reconstruction with Dynamic Point Maps Paper β’ 2601.09499 β’ Published 29 days ago β’ 9
UM-Text: A Unified Multimodal Model for Image Understanding Paper β’ 2601.08321 β’ Published 30 days ago β’ 9
ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation Paper β’ 2601.03955 β’ Published Jan 7 β’ 3
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation Paper β’ 2512.24724 β’ Published Dec 31, 2025 β’ 7
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow Paper β’ 2512.24766 β’ Published Dec 31, 2025 β’ 9
What matters for Representation Alignment: Global Information or Spatial Structure? Paper β’ 2512.10794 β’ Published Dec 11, 2025 β’ 9
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper β’ 2512.07843 β’ Published Nov 24, 2025 β’ 22
Runtime error MCP Featured 143 LongCat Image Edit π 143 Generate or edit images using text prompts
Runtime error MCP Featured 143 LongCat Image Edit π 143 Generate or edit images using text prompts
Running on Zero Featured 169 VibeVoice-Realtime-0.5B π¨ 169 Generate natural speech from text with customizable voices
Running on Zero Featured 169 VibeVoice-Realtime-0.5B π¨ 169 Generate natural speech from text with customizable voices