This standard ensures changes are introduced with minimal failures and maximum resilience by measuring and managing Change Failure Rate (CFR)—a core DORA metric. It enables high-velocity delivery without compromising quality, stability, or trust.
Aligned to our "Resilience Over Uptime" and "Secure by Design" policies, this standard drives investment in robust testing, observability, and safe deployment practices. Without it, change introduces risk blindly, erodes confidence, and limits the ability to innovate at pace.
| Category | Description |
|---|---|
| People & Culture | Change failures are seen as individual mistakes. There is a culture of blame or silence. |
| Process & Governance | Failures are not systematically logged. Post-incident learning is informal or skipped. |
| Technology & Tools | No automated rollback or failure detection. Monitoring is reactive and fragmented. |
| Measurement & Metrics | CFR is not tracked. No agreed definition of what counts as a failed change. |
| Category | Description |
|---|---|
| People & Culture | Teams acknowledge change-related incidents and start to share learnings. Safe dialogue is emerging. |
| Process & Governance | Some post-mortems occur after major failures. Processes vary by team or severity. |
| Technology & Tools | Rollbacks or patches are possible but often manual. Basic monitoring flags regressions. |
| Measurement & Metrics | Teams start logging incidents and linking to deployments. Failure tracking is ad hoc. |
| Category | Description |
|---|---|
| People & Culture | CFR is a shared accountability. Teams reflect on quality and resilience in retros. |
| Process & Governance | A clear definition of failed change is established. Quality reviews include CFR trends. |
| Technology & Tools | Staging, feature flags, and rollback automation are applied to mitigate impact. |
| Measurement & Metrics | CFR is tracked as a delivery metric. Trends are reviewed and discussed regularly. |
| Category | Description |
|---|---|
| People & Culture | CFR metrics are used to guide investment in quality and reliability. Teams are proactive. |
| Process & Governance | CFR thresholds trigger deeper reviews. Learning is shared across the org. |
| Technology & Tools | Failures trigger automated remediation steps. Deployments are validated pre- and post-release. |
| Measurement & Metrics | CFR dashboards exist. Root causes are categorised. Remediation time is measured. |
| Category | Description |
|---|---|
| People & Culture | Teams champion resilience culture. CFR drives experiments and quality improvements. |
| Process & Governance | CFR data informs policy and controls. Resilience playbooks evolve through shared learning. |
| Technology & Tools | Failure insights inform platform evolution. Adaptive tooling supports safer changes. |
| Measurement & Metrics | CFR trends influence architectural decisions. Patterns drive cross-cutting initiatives. |
A failed change is one that introduces:
This applies across software deployments, infrastructure changes, data platform updates, and operational config changes.