Skip to content

CAP-124 — Metrics, alerting & log aggregation

Category Platform & infrastructure
Business goal Not yet linked to a business goal
Satisfying module MOD-076
Mode ALERT
BD owner BD09 Technology
Human needed Review only

Collects time-series metrics from all services, evaluates SLO and alerting rules on a 15-second evaluation cycle, and routes pages to the on-call engineer via configured channels. Aggregates structured logs from all services into a central searchable store with 90-day hot retention and 7-year cold retention for regulatory purposes. SLO dashboards provide real-time reliability visibility across all platform components.