๐ค InternVL2.5-4B Multimodal Chat
Welcome to the InternVL2.5-4B chat interface! This AI assistant can:
- ๐ฌ Have conversations with text
- ๐ผ๏ธ Analyze and describe images
- ๐ฅ Process and understand videos
- ๐ Extract text from images (OCR)
- ๐ฏ Answer questions about visual content
Instructions:
- Type your message in the text box
- Optionally upload an image or video
- Click Send to get a response
- Use "Clear" to reset the conversation
๐ Upload Media
Supported formats:
- Images: JPG, PNG, WEBP, GIF
- Videos: MP4, AVI, MOV, WEBM
Tips:
- For images: Ask about content, extract text, or describe what you see
- For videos: Ask for descriptions, analysis, or specific details
- You can upload one media file at a time
๐ก Example Prompts
About InternVL2.5-4B: A powerful multimodal AI model developed by Shanghai AI Lab, Tsinghua University and partners.
API Usage: This interface supports API calls. The chat endpoint accepts JSON with message
, image
, and video
fields.