Question 9 of 10Pro Only

What strategies can an SRE team use to reduce Mean Time to Detect (MTTD) and Mean Time to Recover (MTTR) for production incidents?

Sample answer preview

MTTD and MTTR are the two most actionable metrics for measuring and improving incident response. MTTD measures how long it takes from when a problem starts to when the team becomes aware of it. MTTR measures how long from detection to full recovery.

MTTDMTTRsynthetic monitoringanomaly detectionrollbackfeature flags

Unlock the full answer

Get the complete model answer, key points, common pitfalls, and access to 9+ more SRE / Platform Engineer interview questions.

Upgrade to Pro

Starting at $19/month • Cancel anytime