Multimodal Image Understanding Pipeline

Upload an image and ask a question about the uploaded image. The app returns an image caption, answers your question, analyzes sentiment, and reports latency.