embedding-strategies

Installation

Summary

Comprehensive guide for selecting, implementing, and optimizing embedding models for vector search and RAG applications.

Covers 10+ embedding models with dimensions, token limits, and domain specialization (Voyage AI, OpenAI, open-source options for code, finance, legal, and multilingual content)
Provides four chunking strategies: token-based, sentence-based, semantic sections, and recursive character splitting with overlap handling
Includes three implementation templates for Voyage AI, OpenAI, and local Sentence Transformers with specialized query/document prefixes
Features domain-specific pipelines for general documents and code, plus evaluation metrics (precision, recall, MRR, NDCG) for retrieval quality assessment
Best practices section covers model selection, preprocessing, batching, caching, and common pitfalls

SKILL.md

Embedding Strategies

Guide to selecting and optimizing embedding models for vector search applications.

Model	Dimensions	Max Tokens	Best For

Related skills

Installs

6.6K

Repository

GitHub Stars

35.3K

First Seen

Jan 20, 2026

Security Audits