life-echo/api/.agents/skills/redis-development/rules/semantic-cache-best-practices.md at 8f66f94a1a89b868aa1c9cd31700113f7cdcd5a1

Files

Sully 53e0065e3e refactor(api): TOML 配置 SSOT、统一错误契约、Auth/事务加固与可观测性 (#33 )

配置 SSOT（TOML + .env）
统一错误契约
Auth 与事务边界
Redis / Celery 可靠性:业务 Redis（DB/0）与 Celery broker/backend（DB/1）显式拆分；连接池、sync client
可观测性（OpenTelemetry + LGTM）

2026-05-22 13:44:50 +08:00

1.9 KiB

Raw Blame History

title, impact, impactDescription, tags, description, alwaysApply

title	impact	impactDescription	tags	description	alwaysApply
Configure Semantic Cache Properly	MEDIUM	Correct threshold tuning balances hit rate vs accuracy	langcache, cache, threshold, ttl, semantic	Configure Semantic Cache Properly	true

Configure Semantic Cache Properly

Note: LangCache is currently in preview on Redis Cloud. Features and behavior may change.

Tune similarity threshold and cache separation for optimal LangCache results.

Correct: Tune similarity threshold for your use case.

from langcache import LangCache

lang_cache = LangCache(
    server_url=f"https://{os.getenv('HOST')}",
    cache_id=os.getenv("CACHE_ID"),
    api_key=os.getenv("API_KEY")
)

# Stricter matching - fewer false positives (0.95 = very similar)
result = lang_cache.search(
    prompt="What is Redis?",
    similarity_threshold=0.95
)

# Looser matching - higher hit rate (0.8 = somewhat similar)
result = lang_cache.search(
    prompt="What is Redis?",
    similarity_threshold=0.8
)

Correct: Use separate caches for different use cases.

# Create different cache IDs in Redis Cloud for different LLM tasks
support_cache = LangCache(
    server_url=server_url,
    cache_id="support-cache-id",
    api_key=api_key
)

code_cache = LangCache(
    server_url=server_url,
    cache_id="code-cache-id",
    api_key=api_key
)

Incorrect: Using a single cache for all LLM tasks.

# All tasks share one cache - responses may not be relevant
result = lang_cache.search(prompt="How do I reset my password?")
# Could return a code snippet if someone asked a similar coding question

Best practices:

Start with threshold 0.9, adjust based on your use case
Use custom attributes to filter results within a single cache
Monitor cache hit rates to evaluate effectiveness
Use separate cache IDs for fundamentally different LLM tasks

Reference: LangCache Best Practices

1.9 KiB Raw Blame History

Configure Semantic Cache Properly

1.9 KiB

Raw Blame History