arxiv:2507.16815
Fu-En Yang
FuEnYang
AI & ML interests
Computer Vision, Deep Learning, Vision-Language Models (VLMs), Vision-Language-Action Models (VLAs), Reasoning Models, Embodied AI
Recent Activity
upvoted
a
paper
about 16 hours ago
Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
upvoted
a
paper
about 16 hours ago
SpatialTree: How Spatial Abilities Branch Out in MLLMs
upvoted
a
paper
about 16 hours ago
Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations