cqrenet/aoc - aoc - CQRE.NET Git

Author	SHA1	Message	Date
tomas.kracmar	e2cea50d87	hotfix(v1.7.9): auth diagnostics and rate-limit exemptions CI / lint-and-test (push) Successful in 2m30s Details Release / build-and-push (push) Successful in 4m46s Details - Exempt /api/config/auth, /api/config/features, /health, /metrics from rate limiting - Fix generic exception handler to return proper JSON for HTTPException instead of re-raising - Add startup log with auth_enabled and version - Add frontend console logging for auth config fetch errors - Show 'Auth: OFF' or 'Auth: misconfigured' on auth button instead of empty text - Add backend debug logging to /api/config/auth endpoint	2026-04-27 10:09:44 +02:00
tomas.kracmar	d01e7801ed	security: v1.7.7 hardening release CI / lint-and-test (push) Successful in 51s Details Release / build-and-push (push) Successful in 1m57s Details - Add WEBHOOK_CLIENT_SECRET validation for Graph webhooks - Add Redis-backed rate limiting (fetch/ask/write/default tiers) - Validate LLM_BASE_URL to prevent SSRF (HTTPS only, block private IPs) - Enforce non-wildcard CORS when AUTH_ENABLED=true - Add Content-Security-Policy headers - Fix audit middleware to use verified JWT claims via contextvars - Cap bulk_tags updates to 10,000 documents - Return generic error messages to clients (no internal detail leakage) - Strict AlertCondition Pydantic model for alert rules - Security warning on MCP stdio server startup - Remove MongoDB/Redis host ports from docker-compose - Remove mongo_query from /ask API response	2026-04-27 09:16:57 +02:00
tomas.kracmar	e348881083	feat: Admin Operations SIEM — alerts, notifications, pre-built rules - Add pluggable notification system (webhook, Slack, Teams) with retry - Add alert deduplication: same rule + actor within 15 min = one alert - Add 10 pre-built admin-ops rule templates seeded on startup: - Failed Conditional Access, After-Hours Admin Activity - New Application Registration, Admin Role Assignment - License Change, Bulk User Deletion - Device Compliance Failure, Exchange Transport Rule Change - Service Principal Credential Added, External Sharing Enabled - Add /api/alerts, /api/alerts/{id}/status, /api/alerts/summary endpoints - Add alert dashboard to frontend with status filters and ack/resolve buttons - Add alert summary badge in hero header (high/medium/low counts) - New env vars: ALERT_WEBHOOK_URL, ALERT_WEBHOOK_FORMAT, ALERT_DEDUPE_MINUTES	2026-04-22 14:12:36 +02:00
tomas.kracmar	0eebcd0765	feat: clickable pills, configurable page size, CQRE.NET branding - Service/category pills are now clickable: click to filter by that service - Result pills (Success, Failure, etc.) are now clickable: click to filter by that result - Click again to clear the filter (toggle behavior) - Change default page size from 100 to 25 - Add DEFAULT_PAGE_SIZE config (env var, default 25), exposed via /api/config/features - Change footer brand from CQRE to CQRE.NET - Add pill--clickable hover styles - Bump CSS cache-buster to v=10	2026-04-22 11:53:01 +02:00
tomas.kracmar	cbd46adaa6	style: ruff format CI / lint-and-test (push) Successful in 25s Details	2026-04-22 10:08:32 +02:00
tomas.kracmar	f75f165911	feat: Redis caching + async queue for LLM scaling (v1.6.0) Release / build-and-push (push) Successful in 1m24s Details CI / lint-and-test (push) Failing after 29s Details - Add async Redis client singleton (redis_client.py) for caching and arq pool - Add arq job functions (jobs.py) for background LLM processing - Cache ask/explain LLM responses with TTL (1h ask, 24h explain) - Add async mode to /api/ask: enqueue job, return job_id, poll /api/jobs/{id} - Add GET /api/jobs/{job_id} endpoint for job status polling - Add arq worker service to docker-compose (dev + prod) - Switch from Redis to Valkey (BSD fork) in Docker Compose - Add REDIS_URL config setting - Add tests for cache hit, async mode, and job status	2026-04-22 09:55:05 +02:00
tomas.kracmar	2fffe3aec2	feat: operation-level privacy gating instead of broad service-level CI / lint-and-test (push) Successful in 21s Details - Replace broad service-level hiding with fine-grained operation-level gating - PRIVACY_SENSITIVE_OPERATIONS config: hide specific operations across ALL services - PRIVACY_SERVICES still works for broad service-level blocking (optional) - Users without PRIVACY_SERVICE_ROLES: * Don't see sensitive operations in /api/filter-options * Can't query sensitive operations via /api/events or /api/ask * Get 403 on /api/events/{id}/explain for sensitive events - Exchange/Teams services remain visible; only privacy ops are hidden - Update .env.example with new operation-level config docs	2026-04-22 08:23:46 +02:00
tomas.kracmar	b2f4cabef4	feat: service-level role gating for privacy-sensitive services (Option A) CI / lint-and-test (push) Successful in 25s Details - Add PRIVACY_SERVICES and PRIVACY_SERVICE_ROLES config variables - Add user_can_access_privacy_services(claims) helper in auth.py - /api/events filters out privacy services for users without required roles - /api/filter-options excludes privacy services from dropdown options - /api/ask excludes privacy services from NLQ queries - /api/events/{id}/explain returns 403 for privacy events if unauthorized - Teams added to default noisy service exclusion (frontend + backend) - Update .env.example with privacy config documentation - Add tests for event filtering, filter-options exclusion, and explain 403	2026-04-22 07:26:21 +02:00
tomas.kracmar	e069869a94	feat: exclude Teams from defaults + GUID resolution in explain CI / lint-and-test (push) Successful in 26s Details - Add Teams to noisy services excluded by default (frontend + backend ask) - Exchange, SharePoint, and Teams now unchecked by default in filters - Enhance explain endpoint with GUID resolution: * Extract UUIDs from raw event JSON recursively * Resolve directory objects via Graph API (user, group, SP, device) * Include resolved names in LLM prompt so explanations reference human-readable names instead of raw GUIDs - Add asyncio import for to_thread wrapper around sync Graph calls	2026-04-22 07:12:10 +02:00
tomas.kracmar	fb2386e190	feat: saved searches (bookmarks) CI / lint-and-test (push) Successful in 23s Details - Add saved_searches_collection to database.py with index on created_by+created_at - New routes/saved_searches.py: GET /api/saved-searches, POST, DELETE - Saved searches are scoped per user (created_by = token sub) - Mount router in main.py - Frontend: Save filters button, saved search pills with load/delete - loadSavedSearches called on initApp - applySavedSearch restores filters and validates services against current options - Add CSS for saved-searches row - Add tests for CRUD, delete 404, and name validation	2026-04-22 07:04:07 +02:00
tomas.kracmar	658ddd0aac	feat: copy raw event and AI explain in modal CI / lint-and-test (push) Successful in 32s Details - Add POST /api/events/{id}/explain endpoint that fetches event + related events and asks the LLM for a plain-language explanation with security context - Add 'Copy' button to raw event modal (uses navigator.clipboard) - Add 'Explain' button to raw event modal (only when AI_FEATURES_ENABLED) - Show explanation in modal with markdown rendering - Add CSS for modal actions and explanation panel - Add tests for explain endpoint (404, no LLM key, mocked LLM success)	2026-04-21 22:26:26 +02:00
tomas.kracmar	5122739c01	feat: MCP server over SSE with OIDC auth CI / lint-and-test (push) Successful in 36s Details - Extract shared MCP tool handlers to mcp_common.py - mcp_server.py now uses shared handlers (stdio transport for local dev) - New routes/mcp.py: SSE transport behind existing OIDC Bearer auth - Mount MCP ASGI app at /mcp in main.py when AI_FEATURES_ENABLED - /mcp/sse -> establishes SSE stream (requires valid token when auth enabled) - /mcp/messages/ -> receives MCP client messages - Update README with SSE MCP docs - Add tests for mount existence, auth, and message routing	2026-04-21 07:38:12 +02:00
tomas.kracmar	60b6ad15c4	Release v1.3.0: AI feature flag and MCP server CI / lint-and-test (push) Successful in 45s Details Release / build-and-push (push) Successful in 1m34s Details - Add AI_FEATURES_ENABLED config flag to gate AI/natural-language features - Conditionally register /api/ask router based on AI_FEATURES_ENABLED - Add GET /api/config/features endpoint for frontend feature detection - Update frontend to hide Ask panel when AI features are disabled - Implement standalone MCP server (backend/mcp_server.py) with tools: * search_events, get_event, get_summary, ask - Add mcp dependency to requirements.txt - Update .env.example, AGENTS.md, and ROADMAP.md - Bump VERSION to 1.3.0	2026-04-20 18:11:26 +02:00
tomas.kracmar	b4e504a87b	feat: intent-aware querying + smart sampling for large audit datasets Release / build-and-push (push) Successful in 1m31s Details CI / lint-and-test (push) Successful in 34s Details - Add keyword-based intent extraction: 'device' → Intune, 'user' → Directory, etc. - Broad questions without intent auto-exclude noisy services (Exchange, SharePoint) - Smart stratified sampling: failures always included, high-value services prioritised - Fetch up to 1000 events from MongoDB, then curate best 200 for the LLM - Excluded services noted in LLM prompt and query_info so the admin knows the scope	2026-04-20 17:41:21 +02:00
tomas.kracmar	a255be93fe	feat: aggregate large event sets before sending to LLM CI / lint-and-test (push) Successful in 18s Details Release / build-and-push (push) Successful in 29s Details When a query matches >50 events, the LLM now receives: - Aggregated counts by service, operation, result, and actor - A list of failures (up to 10) - The 50 most recent raw events as samples This scales to thousands of events without blowing the token budget or losing signal. The LLM gets a bird's-eye view plus concrete examples. Also updates the system prompt to handle both individual event lists and aggregated overviews correctly.	2026-04-20 16:23:55 +02:00
tomas.kracmar	cfe9397cc5	feat: raise LLM event limit to 200 and show total count awareness CI / lint-and-test (push) Successful in 23s Details Release / build-and-push (push) Successful in 27s Details - Bump LLM_MAX_EVENTS default from 50 to 200 - Add total_matched count to /api/ask response - Include 'Showing X of Y total' header in LLM prompt so the model knows when its view is a subset and avoids false certainty - Update system prompt to instruct acknowledging scale when truncated - Update test mocks to accept new total parameter	2026-04-20 16:13:52 +02:00
tomas.kracmar	cf0283b20b	feat: natural language queries respect UI filters (v1.2.0) CI / lint-and-test (push) Successful in 22s Details Release / build-and-push (push) Successful in 36s Details - AskRequest now accepts optional filter fields: services, actor, operation, result, start, end, include_tags, exclude_tags - ask_question merges NL-extracted constraints with explicit UI filters - Frontend sends active filter state with every ask request - Show filter hint below ask input when filters are active - Add tests for service+result filtering and actor filtering in /api/ask Bump version to 1.2.0	2026-04-20 16:07:35 +02:00
tomas.kracmar	4303b8f02c	fix: use max_completion_tokens and remove temperature for Azure OpenAI compat CI / lint-and-test (push) Successful in 35s Details Release / build-and-push (push) Successful in 40s Details - Replace max_tokens with max_completion_tokens (required by newer Azure models) - Remove hardcoded temperature (not supported by all model types) - Add response body logging on LLM API errors for easier debugging	2026-04-20 15:55:00 +02:00
tomas.kracmar	9ec193ea13	feat: expose LLM error reason in /api/ask response and UI CI / lint-and-test (push) Successful in 21s Details Release / build-and-push (push) Successful in 28s Details - Add llm_error field to AskResponse so users know why AI summarisation was skipped - Show orange warning banner in frontend when LLM is not configured or call fails - Update AskEndpoint tests to assert llm_error presence	2026-04-20 15:45:32 +02:00
tomas.kracmar	be319688f6	feat: add Azure OpenAI / MS Foundry support for /api/ask CI / lint-and-test (push) Successful in 24s Details Release / build-and-push (push) Successful in 43s Details - Add LLM_API_VERSION config for Azure api-version query param - Detect Azure endpoints and use api-key header instead of Bearer - Handle base URLs that already include /chat/completions path - Update .env.example with Azure OpenAI guidance	2026-04-20 15:28:12 +02:00
tomas.kracmar	22d237fbfb	style: apply ruff fixes CI / lint-and-test (push) Successful in 33s Details Release / build-and-push (push) Successful in 37s Details	2026-04-20 15:21:34 +02:00
tomas.kracmar	0ef50c91f7	feat: natural language query + production hardening CI / lint-and-test (push) Failing after 41s Details Release / build-and-push (push) Successful in 1m33s Details Features: - Add /api/ask endpoint for plain-language audit log queries - Regex-based time/entity extraction (no LLM required for parsing) - LLM-powered narrative summarisation with OpenAI-compatible APIs - Graceful fallback to structured bullet lists when LLM is unavailable - Frontend ask panel with markdown rendering and cited events Production: - Harden Dockerfile: non-root user, gunicorn+uvicorn workers - Add docker-compose.prod.yml with internal networks and health checks - Add nginx reverse proxy with security headers - MongoDB no longer exposed externally in production Tests: - 29 new tests for ask parsing, query building, and endpoint behaviour - Fix conftest monkeypatch for routes.ask events collection Bump version to 1.1.0	2026-04-20 15:10:55 +02:00
tomas.kracmar	4713b43afe	style: apply ruff formatting to all backend files CI / lint-and-test (push) Failing after 38s Details	2026-04-16 18:58:41 +02:00
tomas.kracmar	b86539399b	fix(ci): resolve ruff SIM108 lint error and use github.token for registry login CI / lint-and-test (push) Failing after 22s Details	2026-04-16 18:55:52 +02:00
tomas.kracmar	3761aa6d74	feat(tags): add bulk tagging and tag-based filtering CI / lint-and-test (push) Failing after 1m24s Details - Add include_tags/exclude_tags query params to /api/events - Add POST /api/events/bulk-tags endpoint with append/replace modes - Frontend: add Include tags / Exclude tags filter inputs - Frontend: add Bulk tag matching button with prompt for tag and mode - Update filter layout to accommodate new tag fields - Add tests for tag filtering and bulk tag append/replace	2026-04-16 18:50:57 +02:00
tomas.kracmar	82bafc06c9	fix(auth): resolve JWT InvalidSignatureError and improve frontend UX CI / lint-and-test (push) Has been cancelled Details - Fix auth by using idToken fallback when accessToken audience mismatches - Add PyJWT verification with audience-aware token selection in frontend - Source health: track last_attempt_time and error status per source - Frontend: fix modal outside x-data scope, add circular-safe JSON stringify - Frontend: support multi-select service filter with All/None toggles - Frontend: improve filter layout into organized rows - Frontend: fix text overflow and result pill colors (success/succeeded) - Intune: normalize application actors (auditActorType=Application) - Add cache-control middleware for HTML/API responses - Update tests for multi-service filtering and source health	2026-04-16 11:32:45 +02:00
tomas.kracmar	b35cac42e0	feat: implement Phase 4 enhancements CI / lint-and-test (push) Has been cancelled Details - Migrate frontend to Alpine.js for reactive state management - Add source health dashboard in UI and /api/source-health endpoint - Add event tagging (PATCH /api/events/{id}/tags) and commenting (POST /api/events/{id}/comments) - Add CSV/JSON export from the UI - Add rule-based alerting engine (rules.py) with CRUD endpoints (/api/rules) - Add SIEM export via webhook (siem.py) - Add AOC audit trail middleware logging all mutations to aoc_audit collection - Update config with SIEM_ENABLED, SIEM_WEBHOOK_URL, ALERTS_ENABLED - Add tests for rules engine, tags, comments, and source health	2026-04-14 15:38:39 +02:00
tomas.kracmar	b0198012eb	feat: implement Phase 3 scaling CI / lint-and-test (push) Has been cancelled Details - Replace skip-based pagination with cursor-based pagination (timestamp\|_id cursors) - Add Prometheus /metrics endpoint with request latency, fetch volume, and error counters - Implement incremental fetch watermarking per source (watermarks collection in MongoDB) - Add Graph change notification webhook endpoint (/api/webhooks/graph) - Add correlation ID middleware for distributed tracing (x-request-id header) - Update frontend to use cursor-based pagination with Prev/Next navigation - Update tests for cursor pagination, metrics, webhooks, and watermark mocking	2026-04-14 14:58:50 +02:00
tomas.kracmar	9271b4e461	feat: implement Phase 2 stabilization CI / lint-and-test (push) Has been cancelled Details - Cache Graph API tokens with expiry-aware reuse in graph/auth.py - Add tenacity-based retry/backoff wrapper (utils/http.py) and apply to all Graph/source API calls - Add Pydantic request/response models (models/api.py) and FastAPI query constraints - Add unit tests for event_model, auth and integration tests for API endpoints - Configure ruff linter/formatter in pyproject.toml - Add GitHub Actions CI pipeline (.github/workflows/ci.yml) - Add requirements-dev.txt with pytest, mongomock, httpx, ruff - Clean up typing imports and fix ruff linting across codebase	2026-04-14 12:02:28 +02:00
tomas.kracmar	4f6e16d64d	feat: implement Phase 1 hardening - Verify JWT signatures via JWKS in auth.py - Fix broken frontend auth button references - Add Pydantic Settings for env validation (RETENTION_DAYS, CORS_ORIGINS) - Create MongoDB indexes + TTL on startup - Add /health endpoint and CORS middleware - Escape regex input in event queries - Fix dedupe() return calculation in maintenance.py - Replace basic logging with structured structlog JSON logs - Update README and add ROADMAP.md	2026-04-14 11:48:29 +02:00
tomas.kracmar	205b69713e	Added authentication	2025-11-29 14:19:34 +01:00
tomas.kracmar	47f4a22bef	Added periodic fetch	2025-11-29 09:48:50 +01:00
tomas.kracmar	90f0e14f6e	First version	2025-11-28 21:43:44 +01:00

33 Commits