Generate depth map from an image
Chat with images and text using Qwen-VL-Plus
Interact with images and texts using Qwen-VL-Max