multimodal-rag
Installation
SKILL.md
Multimodal RAG ()
Build retrieval-augmented generation systems that handle images, text, and mixed content.
Overview
- Image + text retrieval (product search, documentation)
- Cross-modal search (text query -> image results)
- Multimodal document processing (PDFs with charts)
- Visual question answering with context
- Image similarity and deduplication
- Hybrid search pipelines