multimodal-rag

Installation
SKILL.md

Multimodal RAG (2026)

Build retrieval-augmented generation systems that handle images, text, and mixed content.

Overview

  • Image + text retrieval (product search, documentation)
  • Cross-modal search (text query -> image results)
  • Multimodal document processing (PDFs with charts)
  • Visual question answering with context
  • Image similarity and deduplication
  • Hybrid search pipelines

Architecture Approaches

Installs
4
GitHub Stars
193
First Seen
Jan 21, 2026
multimodal-rag — yonatangross/skillforge-claude-plugin