Question 9 of 10Pro Only
What strategies can an SRE team use to reduce Mean Time to Detect (MTTD) and Mean Time to Recover (MTTR) for production incidents?
Sample answer preview
MTTD and MTTR are the two most actionable metrics for measuring and improving incident response. MTTD measures how long it takes from when a problem starts to when the team becomes aware of it. MTTR measures how long from detection to full recovery.
MTTDMTTRsynthetic monitoringanomaly detectionrollbackfeature flags