ClaudeAgentCacheAdapter

Cache wrapper for the Claude Agent SDK (claude_code_sdk). Caches query() async generator results — eliminating redundant agent runs for identical prompts.

Installation

pip install claude-code-sdk

Usage

import claude_code_sdk
from claude_code_sdk import ClaudeCodeOptions

from chengeta_ai import CacheManager, InMemoryBackend, CacheKeyBuilder
from chengeta_ai.adapters.claude_agent_adapter import ClaudeAgentCacheAdapter

manager = CacheManager(
    backend=InMemoryBackend(),
    key_builder=CacheKeyBuilder(namespace="myapp"),
)
adapter = ClaudeAgentCacheAdapter(manager)

options = ClaudeCodeOptions(max_turns=5)

# First run — live agent call, result buffered and cached
messages = []
async for msg in adapter.query("Fix the import error in utils.py", options=options):
    messages.append(msg)
    print(msg)

# Second run — identical prompt, replayed from cache instantly
async for msg in adapter.query("Fix the import error in utils.py", options=options):
    print(msg)  # same messages, no agent call

Invalidate all cached runs

adapter.invalidate_all()

How It Works

ClaudeAgentCacheAdapter.query() is an async generator that:

Checks the cache for the prompt + options key.
Cache hit: replays stored messages one by one — caller gets the same async generator interface.
Cache miss: streams the live agent, collects all messages, stores them, and yields each chunk in real time.

This means callers always get a streaming interface regardless of cache state.

API Reference

Method	Description
`query(prompt, options, **kwargs)`	Cached async generator wrapping `claude_code_sdk.query()`
`invalidate_all()`	Invalidate all cached Claude agent responses

Installation​

Usage​

Invalidate all cached runs​

How It Works​

API Reference​

Installation

Usage

Invalidate all cached runs

How It Works

API Reference