VLA - a zzfive Collection

zzfive 's Collections

inference optimization

RL+reason model

VLA

updated Feb 7

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective

Paper • 2507.01925 • Published Jul 2, 2025 • 39
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge

Paper • 2507.04447 • Published Jul 6, 2025 • 45
A Survey on Vision-Language-Action Models for Autonomous Driving

Paper • 2506.24044 • Published Jun 30, 2025 • 14
EmbRACE-3K: Embodied Reasoning and Action in Complex Environments

Paper • 2507.10548 • Published Jul 14, 2025 • 37
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos

Paper • 2507.15597 • Published Jul 21, 2025 • 34
ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Paper • 2507.16815 • Published Jul 22, 2025 • 42
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

Paper • 2507.23682 • Published Jul 31, 2025 • 24
InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation

Paper • 2507.17520 • Published Jul 23, 2025 • 15
MolmoAct: Action Reasoning Models that can Reason in Space

Paper • 2508.07917 • Published Aug 11, 2025 • 45
Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27, 2025 • 32
Mechanistic interpretability for steering vision-language-action models

Paper • 2509.00328 • Published Aug 30, 2025 • 3
F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Paper • 2509.06951 • Published Sep 8, 2025 • 33
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11, 2025 • 254
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Paper • 2509.09674 • Published Sep 11, 2025 • 80
FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies

Paper • 2509.04996 • Published Sep 5, 2025 • 15
A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning

Paper • 2509.15937 • Published Sep 19, 2025 • 20
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training

Paper • 2510.06710 • Published Oct 8, 2025 • 43
VLA-0: Building State-of-the-Art VLAs with Zero Modification

Paper • 2510.13054 • Published Oct 15, 2025 • 16
Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning

Paper • 2510.14300 • Published Oct 16, 2025 • 12
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Paper • 2510.13778 • Published Oct 15, 2025 • 17
GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Paper • 2510.19430 • Published Oct 22, 2025 • 53
10 Open Challenges Steering the Future of Vision-Language-Action Models

Paper • 2511.05936 • Published Nov 8, 2025 • 6
NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards

Paper • 2511.14659 • Published Nov 18, 2025 • 13
Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future

Paper • 2512.16760 • Published Dec 18, 2025 • 15
SOP: A Scalable Online Post-Training System for Vision-Language-Action Models

Paper • 2601.03044 • Published Jan 6 • 28
A Pragmatic VLA Foundation Model

Paper • 2601.18692 • Published Jan 26 • 49
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

Paper • 2602.00919 • Published Jan 31 • 323