EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 26 days ago • 115
See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models Paper • 2512.02231 • Published Dec 1, 2025 • 8