Comprehensive and Detailed Explanation From Exact Extract:
Anti-fragility focuses on an organization’s ability to respond, adapt, learn, and recover from incidents. The most useful metrics relate to incident detection, response, reliability, and recovery. Deployment frequency, while important in DevOps and DORA metrics, does not directly measure anti-fragility.
From the SRE Workbook, Incident Response section:
“Improving antifragility requires better detection, better recovery mechanisms, and clear reliability goals.”
Key metrics relevant to anti-fragility:
MTTD (Mean Time To Detect) — quicker detection improves resilience
MTTR/RPO — recoverability measures
SLOs — define acceptable reliability thresholds and guide learning
Deployment frequency primarily measures delivery velocity, not resilience.
The Site Reliability Engineering Book emphasizes:
“Antifragility is improved by learning from incidents and strengthening recovery mechanisms rather than by increasing release cadence.”
Why other options are correct for anti-fragility:
A. Mean Time To Detect — critical for detecting failures quickly
B. SLOs — define boundaries for reliability and failure tolerance
D. Recovery Point Objective — measures potential loss during failures
Thus, C is the least useful metric for improving antifragility.
[References:, SRE Workbook, “Incident Response”, Site Reliability Engineering Book, “Postmortem Culture”, Google DORA Research (role of deployment frequency vs. resilience metrics), , ]