How do I install Prompt Caching?

Install Prompt Caching with a single command: npx mdskills install sickn33/prompt-caching. This downloads the skill files into your project and your AI agent picks them up automatically.

What platforms support Prompt Caching?

Prompt Caching works with Claude Code, Claude Desktop, Cursor, Vscode Copilot, Windsurf, Continue Dev, Codex, Gemini Cli, Amp, Roo Code, Goose, Opencode, Trae, Qodo, Command Code. Skills use the open SKILL.md format which is compatible with any AI coding agent that reads markdown instructions.

← Back to skills

Prompt Caching

Name: Prompt Caching: AI Agent Skill
Brand: sickn33
Availability: InStock
Rating: 6 (1 reviews)
Author: sickn33

ProductivityIntermediate

Caching strategies for LLM prompts including Anthropic prompt caching, response caching, and CAG (Cache Augmented Generation) Use when: prompt caching, cache prompt, response cache, cag, cache augmented.

by @sickn33 13,166Updated 2/20/2026

Add this skill

npx mdskills install sickn33/prompt-caching

Fork & Edit

Are you @sickn33? Sign in with GitHub to claim this listing.

Skill Advisor6.0

Strong caching framework with anti-patterns and edge cases, but lacks actionable implementation steps

+Identifies multiple caching levels with clear use cases
+Includes anti-patterns and sharp edges table with severity ratings
+Covers LLM-specific caching concerns like temperature and prefix structure
-Missing specific trigger conditions and step-by-step implementation instructions
-Content appears truncated with incomplete sections (principle #2 cut off)

SKILL.md

Edit in Browser

1---
2name: prompt-caching
3description: "Caching strategies for LLM prompts including Anthropic prompt caching, response caching, and CAG (Cache Augmented Generation) Use when: prompt caching, cache prompt, response cache, cag, cache augmented."
4source: vibeship-spawner-skills (Apache 2.0)
5---
6 
7# Prompt Caching
8 
9You're a caching specialist who has reduced LLM costs by 90% through strategic caching.
10You've implemented systems that cache at multiple levels: prompt prefixes, full responses,
11and semantic similarity matches.
12 
13You understand that LLM caching is different from traditional caching—prompts have
14prefixes that can be cached, responses vary with temperature, and semantic similarity
15often matters more than exact match.
16 
17Your core principles:
181. Cache at the right level—prefix, response, or both
192. K
20 
21## Capabilities
22 
23- prompt-cache
24- response-cache
25- kv-cache
26- cag-patterns
27- cache-invalidation
28 
29## Patterns
30 
31### Anthropic Prompt Caching
32 
33Use Claude's native prompt caching for repeated prefixes
34 
35### Response Caching
36 
37Cache full LLM responses for identical or similar queries
38 
39### Cache Augmented Generation (CAG)
40 
41Pre-cache documents in prompt instead of RAG retrieval
42 
43## Anti-Patterns
44 
45### ❌ Caching with High Temperature
46 
47### ❌ No Cache Invalidation
48 
49### ❌ Caching Everything
50 
51## ⚠️ Sharp Edges
52 
53| Issue | Severity | Solution |
54|-------|----------|----------|
55| Cache miss causes latency spike with additional overhead | high | // Optimize for cache misses, not just hits |
56| Cached responses become incorrect over time | high | // Implement proper cache invalidation |
57| Prompt caching doesn't work due to prefix changes | medium | // Structure prompts for optimal caching |
58 
59## Related Skills
60 
61Works well with: `context-window-management`, `rag-implementation`, `conversation-memory`
62

Full transparency — inspect the skill content before installing.

New to skill.md files?

See what a SKILL.md file is, how to install one, and how it differs from AGENTS.md or cursorrules.

Read the guide →