One of the foundations of incident management in site reliability engineering (SRE) practice is the incident retrospective. It documents all the learnings from an incident and serves as a checklist for follow-up actions.
Read full article on The New Stack