🤖 InternVL2.5-4B Multimodal Chat

Welcome to the InternVL2.5-4B chat interface! This AI assistant can:

💬 Have conversations with text
🖼️ Analyze and describe images
🎥 Process and understand videos
📝 Extract text from images (OCR)
🎯 Answer questions about visual content

Instructions:

Type your message in the text box
Optionally upload an image or video
Click Send to get a response
Use "Clear" to reset the conversation

Chat History

Your Message

📎 Upload Media

Upload Image

Upload Video

Supported formats:

Images: JPG, PNG, WEBP, GIF
Videos: MP4, AVI, MOV, WEBM

Tips:

For images: Ask about content, extract text, or describe what you see
For videos: Ask for descriptions, analysis, or specific details
You can upload one media file at a time

💡 Example Prompts

About InternVL2.5-4B: A powerful multimodal AI model developed by Shanghai AI Lab, Tsinghua University and partners.

API Usage: This interface supports API calls. The chat endpoint accepts JSON with message, image, and video fields.