-
v1.6.0
Stablereleased this
2026-04-22 07:55:21 +00:00 | 33 commits to main since this releasev1.6.0 — Valkey caching + async queue for LLM scaling
New features
- Valkey caching for LLM responses (
/api/askand/api/events/{id}/explain)- Ask cache TTL: 1 hour
- Explain cache TTL: 24 hours
- Async queue (
arq) for LLM requests to prevent timeouts at scalePOST /api/ask?async_mode=truereturns ajob_idimmediatelyGET /api/jobs/{job_id}polls for results
- Valkey replaces Redis as the in-memory store (BSD-licensed fork)
Infrastructure
- New
workerservice in Docker Compose (runsarq jobs.WorkerSettings) - New
redis/valkeyservice with healthchecks REDIS_URLconfiguration setting
API changes
AskRequestaddedasync_mode: boolfieldAskResponseaddedjob_id: str | Nonefield
Dependencies
- Added
redisandarq
Downloads
- Valkey caching for LLM responses (