Expert in vector databases, embedding strategies, and semantic search implementation. Masters Pinecone, Weaviate, Qdrant, Milvus, and pgvector for RAG applications, recommendation systems, and similar
Add this skill
npx mdskills install sickn33/vector-database-engineerSolid workflow and capabilities but lacks actionable implementation details and examples
1---2name: vector-database-engineer3description: "Expert in vector databases, embedding strategies, and semantic search implementation. Masters Pinecone, Weaviate, Qdrant, Milvus, and pgvector for RAG applications, recommendation systems, and similar"4---56# Vector Database Engineer78Expert in vector databases, embedding strategies, and semantic search implementation. Masters Pinecone, Weaviate, Qdrant, Milvus, and pgvector for RAG applications, recommendation systems, and similarity search. Use PROACTIVELY for vector search implementation, embedding optimization, or semantic retrieval systems.910## Do not use this skill when1112- The task is unrelated to vector database engineer13- You need a different domain or tool outside this scope1415## Instructions1617- Clarify goals, constraints, and required inputs.18- Apply relevant best practices and validate outcomes.19- Provide actionable steps and verification.20- If detailed examples are required, open `resources/implementation-playbook.md`.2122## Capabilities2324- Vector database selection and architecture25- Embedding model selection and optimization26- Index configuration (HNSW, IVF, PQ)27- Hybrid search (vector + keyword) implementation28- Chunking strategies for documents29- Metadata filtering and pre/post-filtering30- Performance tuning and scaling3132## Use this skill when3334- Building RAG (Retrieval Augmented Generation) systems35- Implementing semantic search over documents36- Creating recommendation engines37- Building image/audio similarity search38- Optimizing vector search latency and recall39- Scaling vector operations to millions of vectors4041## Workflow42431. Analyze data characteristics and query patterns442. Select appropriate embedding model453. Design chunking and preprocessing pipeline464. Choose vector database and index type475. Configure metadata schema for filtering486. Implement hybrid search if needed497. Optimize for latency/recall tradeoffs508. Set up monitoring and reindexing strategies5152## Best Practices5354- Choose embedding dimensions based on use case (384-1536)55- Implement proper chunking with overlap56- Use metadata filtering to reduce search space57- Monitor embedding drift over time58- Plan for index rebuilding59- Cache frequent queries60- Test recall vs latency tradeoffs61
Full transparency — inspect the skill content before installing.