generate a video from an image with a text prompt
Generate 3D depth maps from images
Scalable and Versatile 3D Generation from images