Wei Liu's picture

Wei Liu

lefutonku

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories

upvoted a paper about 4 hours ago

VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection

upvoted a paper about 4 hours ago

Utonia: Toward One Encoder for All Point Clouds

View all activity

Organizations

None yet

upvoted 4 papers about 4 hours ago

WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories

Paper • 2603.02049 • Published 4 days ago • 15

VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection

Paper • 2603.00912 • Published 5 days ago • 32

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published 3 days ago • 135

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published 2 days ago • 124

upvoted a paper 7 days ago

World Action Models are Zero-shot Policies

Paper • 2602.15922 • Published 17 days ago • 13

upvoted 2 papers 25 days ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 175

Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5, 2025 • 54

upvoted 8 papers about 1 month ago

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Paper • 2504.19413 • Published Apr 28, 2025 • 43

Masked Depth Modeling for Spatial Perception

Paper • 2601.17895 • Published Jan 25 • 26

MetricAnything: Scaling Metric Depth Pretraining with Noisy Heterogeneous Sources

Paper • 2601.22054 • Published Jan 29 • 5

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 157

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Paper • 2601.07372 • Published Jan 12 • 44

Advancing Open-source World Models

Paper • 2601.20540 • Published Jan 28 • 128

UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented Generation

Paper • 2504.08761 • Published Mar 31, 2025 • 7

A Pragmatic VLA Foundation Model

Paper • 2601.18692 • Published Jan 26 • 47

liked a model about 1 month ago

robbyant/lingbot-depth

Depth Estimation • Updated Jan 28 • 65

upvoted 4 papers about 1 month ago

What Matters in Data Curation for Multimodal Reasoning? Insights from the DCVLR Challenge

Paper • 2601.10922 • Published Jan 16 • 3

Uncertainty-Aware Gradient Signal-to-Noise Data Selection for Instruction Tuning

Paper • 2601.13697 • Published Jan 20 • 4

FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-Language Navigation

Paper • 2601.13976 • Published Jan 20 • 22

Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization

Paper • 2601.12993 • Published Jan 19 • 75