Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.
Add this skill
npx mdskills install sickn33/vector-index-tuningClear scope and workflow but lacks actionable implementation details and concrete examples
1---2name: vector-index-tuning3description: Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.4---56# Vector Index Tuning78Guide to optimizing vector indexes for production performance.910## Use this skill when1112- Tuning HNSW parameters13- Implementing quantization14- Optimizing memory usage15- Reducing search latency16- Balancing recall vs speed17- Scaling to billions of vectors1819## Do not use this skill when2021- You only need exact search on small datasets (use a flat index)22- You lack workload metrics or ground truth to validate recall23- You need end-to-end retrieval system design beyond index tuning2425## Instructions26271. Gather workload targets (latency, recall, QPS), data size, and memory budget.282. Choose an index type and establish a baseline with default parameters.293. Benchmark parameter sweeps using real queries and track recall, latency, and memory.304. Validate changes on a staging dataset before rolling out to production.3132Refer to `resources/implementation-playbook.md` for detailed patterns, checklists, and templates.3334## Safety3536- Avoid reindexing in production without a rollback plan.37- Validate changes under realistic load before applying globally.38- Track recall regressions and revert if quality drops.3940## Resources4142- `resources/implementation-playbook.md` for detailed patterns, checklists, and templates.43
Full transparency — inspect the skill content before installing.