DevOps Mechanisms, Metrics, Tools > Metrics > Mean Time to Recover (MTTR)
Mean Time to Recover (MTTR)
The mean time to recover metric is used to measure how long it takes to recover from failure. High quality code and robust testing and deployment processes are utilized to avoid failures. However, failures are sometimes caused by external factors. As such, it is important to be prepared to quickly and efficiently detect a failure and implement processes to minimize the time required to recover from it. MTTR time is typically calculated based on an outage duration in minutes or hours.
The MTTR metric is measured during the Deploy and Operate phases.