multimodal-rag
Installation
SKILL.md
Multimodal RAG (2026)
Build retrieval-augmented generation systems that handle images, text, and mixed content.
Overview
- Image + text retrieval (product search, documentation)
- Cross-modal search (text query -> image results)
- Multimodal document processing (PDFs with charts)
- Visual question answering with context
- Image similarity and deduplication
- Hybrid search pipelines