qdrant-search-quality-diagnosis
Installation
SKILL.md
How to Diagnose Bad Search Quality
Before tuning, establish baselines. Use exact KNN as ground truth, compare against approximate HNSW. Target >95% recall@K for production.
Don't Know What's Wrong Yet
Use when: results are irrelevant or missing expected matches and you need to isolate the cause.
- For a no-code quick check, use the Web UI's ANN Recall tab to compare approximate vs exact
recall@kWeb UI ANN Recall - For the same comparison in code (CI gating, regression tests), run each query twice — once approximate, once with
exact=true— and computerecall@kfrom the overlap ANN recall in CI - Exact search bad = model or search pipeline problem. Exact good, approximate bad = tune HNSW.
- Check if quantization degrades quality (compare with and without)
- Check if filters are too restrictive (then you might need to use ACORN)
- If duplicate results from chunked documents, use Grouping API to deduplicate Grouping
Payload filtering and sparse vector search are different things. Metadata (dates, categories, tags) goes in payload for filtering. Text content goes in sparse vectors for search.