Running
7
mm-ctx
๐
mm CLI in your browser โ multimodal context for agents
mm CLI in your browser โ multimodal context for agents
Chat with an AI that understands text, images, and videos
Answer questions about your images
In-browser vision-language inference with LFM2.5-VL-1.6B
Docling with a vision language model as OCR backend
Real-time video captioning in your browser
Zero-shot P&ID graph extraction with Claude Opus 4.6
Vision-language model demo by LLM-jp