Warning
🚧 Work in Progress: This page is currently under construction. Content may be incomplete or subject to change. To contribute, see the contribution guide.
Alert Catalog
Registry of all configured production alerts.
Alerts by system
| System | Alert | Condition | Severity | Notification channel |
|---|---|---|---|---|
| BigQuery / Airflow | DAG failure | task_state = failed | P2 | Teams tech-incidents + email |
| BigQuery / Airflow | DAG delayed | execution > SLA by 30min | P3 | Data squad email |
| GCP | Cost above threshold | billing > R$ X | P3 | Leadership email |
| (fill in) | (fill in) | (fill in) | (fill in) | (fill in) |
How to add a new alert
- Configure the alert in the monitoring platform (GCP Monitoring / Azure Monitor)
- Document here with: system, condition, severity, channel
- Test the alert in staging before activating in production