CAP-124 — Metrics, alerting & log aggregation¶
| Category | Platform & infrastructure |
| Business goal | Not yet linked to a business goal |
| Satisfying module | MOD-076 |
| Mode | ALERT |
| BD owner | BD09 Technology |
| Human needed | Review only |
Collects time-series metrics from all services, evaluates SLO and alerting rules on a 15-second evaluation cycle, and routes pages to the on-call engineer via configured channels. Aggregates structured logs from all services into a central searchable store with 90-day hot retention and 7-year cold retention for regulatory purposes. SLO dashboards provide real-time reliability visibility across all platform components.