prompt-caching
Multiple-layer LLM caching strategies to reduce token costs and latency across prompt prefixes, responses, and semantic matches.
- Supports three caching approaches: Anthropic's native prompt caching for repeated prefixes, response caching for identical or similar queries, and Cache Augmented Generation (CAG) for pre-cached documents
- Includes cache invalidation patterns and guidance on structuring prompts for optimal caching performance
- Highlights critical anti-patterns: caching with high temperature, missing invalidation logic, and over-caching low-value data
- Addresses sharp edges like cache miss latency spikes and prompt prefix changes that break caching effectiveness
Prompt Caching
Caching strategies for LLM prompts including Anthropic prompt caching, response caching, and CAG (Cache Augmented Generation)
Capabilities
- prompt-cache
- response-cache
- kv-cache
- cag-patterns
- cache-invalidation
Prerequisites
- Knowledge: Caching fundamentals, LLM API usage, Hash functions
- Skills_recommended: context-window-management
Scope
More from sickn33/antigravity-awesome-skills
docker-expert
You are an advanced Docker containerization expert with comprehensive, practical knowledge of container optimization, security hardening, multi-stage builds, orchestration patterns, and production deployment strategies based on current industry best practices.
15.0Knodejs-best-practices
Node.js development principles and decision-making. Framework selection, async patterns, security, and architecture. Teaches thinking, not copying.
11.2Ktypescript-expert
TypeScript and JavaScript expert with deep knowledge of type-level programming, performance optimization, monorepo management, migration strategies, and modern tooling.
8.3Kapi-security-best-practices
Implement secure API design patterns including authentication, authorization, input validation, rate limiting, and protection against common API vulnerabilities
7.0Kclean-code
This skill embodies the principles of \"Clean Code\" by Robert C. Martin (Uncle Bob). Use it to transform \"code that works\" into \"code that is clean.\"
6.6Knextjs-best-practices
Next.js App Router principles. Server Components, data fetching, routing patterns.
5.2K