feat: Redis caching + async queue for LLM scaling (v1.6.0) · f75f165911 - aoc

feat: Redis caching + async queue for LLM scaling (v1.6.0)

Release / build-and-push (push) Successful in 1m24s

Details

CI / lint-and-test (push) Failing after 29s

Details

- Add async Redis client singleton (redis_client.py) for caching and arq pool
- Add arq job functions (jobs.py) for background LLM processing
- Cache ask/explain LLM responses with TTL (1h ask, 24h explain)
- Add async mode to /api/ask: enqueue job, return job_id, poll /api/jobs/{id}
- Add GET /api/jobs/{job_id} endpoint for job status polling
- Add arq worker service to docker-compose (dev + prod)
- Switch from Redis to Valkey (BSD fork) in Docker Compose
- Add REDIS_URL config setting
- Add tests for cache hit, async mode, and job status

This commit is contained in:

Tomas Kracmar

2026-04-22 09:55:05 +02:00

parent 47e0dfc2ca

commit f75f165911

16 changed files with 498 additions and 14 deletions

VERSION

+1 -1

View File

@@ -1 +1 @@
 .5.0
 .6.0

feat: Redis caching + async queue for LLM scaling (v1.6.0) Release / build-and-push (push) Successful in 1m24s Details CI / lint-and-test (push) Failing after 29s Details

feat: Redis caching + async queue for LLM scaling (v1.6.0)

Release / build-and-push (push) Successful in 1m24s

Details

CI / lint-and-test (push) Failing after 29s

Details