vector-index-tuning

Installation
Summary

Optimize vector index performance across latency, recall, and memory tradeoffs.

  • Covers HNSW parameter tuning (M, efConstruction, efSearch) with benchmarking templates and automated recommendation logic based on vector count and target recall
  • Includes quantization strategies: scalar INT8, product quantization, binary quantization, and FP16 compression with memory estimation tools
  • Provides Qdrant collection configuration templates optimized for three scenarios: recall-focused, speed-focused, balanced, and memory-constrained deployments
  • Includes search performance monitoring, latency profiling (p50/p95/p99), and recall measurement against ground truth
SKILL.md

Vector Index Tuning

Guide to optimizing vector indexes for production performance.

When to Use This Skill

  • Tuning HNSW parameters
  • Implementing quantization
  • Optimizing memory usage
  • Reducing search latency
  • Balancing recall vs speed
  • Scaling to billions of vectors

Core Concepts

1. Index Type Selection

Data Size           Recommended Index
Related skills

More from wshobson/agents

Installs
6.1K
Repository
wshobson/agents
GitHub Stars
35.2K
First Seen
Jan 20, 2026