Scalable and Versatile 3D Generation from images
Generate depth maps from images
Text-to-Video
Generate sound effects for silent videos