(adapted from Amazon Web Services)
Mission: Improve the overall quality of our systems by documenting events to identify root causes and address them through trackable action items.
(<= 3 paragraphs)
(>=1 graphs/tables illustrating impact of event)
(embedded mermaid is nice, but links to images is also fine)
gantt
title Timeline of Issue Arrival
dateFormat YYYY-MM-DD HH:mm
(start point): 2021-05-06 11:58, 1h
fallout event 1 : 2021-05-06 15:08, 1h
discussion (detection point): 2021-05-06 15:11, 1d
fallout event 2 : 2021-05-07 16:02, 1h
fix deployed (mitigation point) : 2021-05-08 10:00, 1h
(1-2 paragraph summary of user-facing impact/experience during the event)
(Note: See terminology cheat sheet below the four Q&A's)
(Detection/Mitigation/Diagnosis/Resolution: Four questions, see below.)
Question: How was the event detected (e.g. an alarm? manual?)
Question: How could time to detection be improved? As a thought exercise, how would you have cut the time in half?
Question: How did you reach the point where you knew how to mitigate the impact (here called the "decision point")?
Question: How could time to mitigation be improved? As a thought exercise, how would you have cut the time in half?
Term | Definition |
---|---|
Start Time | when your users first started being impacted |
Detection Time | when the team came to know there was impact. |
Response Time | when a person first started actively working on the problem (not merely acknowledged it) |
Mitigation Time | when the problem was resolved from the user’s point of view |
Resolution Time | when the incident response is “finished” from the responder’s point of view |
(Explain how incident was managed. Include event start and end times, not just team's perception of event.)
Timestamp (Time Zone here) | Event |
---|---|
2021-05-05 22:09 | event description |
2021-05-06 11:48 | event description |
(wikipedia; dig down until root cause is identified)
The Five Whys yield lessons, and lessons yield actions.
(link back to lessons)
(link back to actions, and include near term deadlines)
or
or
By clicking below, you agree to our terms of service.
New to HackMD? Sign up
Syntax | Example | Reference | |
---|---|---|---|
# Header | Header | 基本排版 | |
- Unordered List |
|
||
1. Ordered List |
|
||
- [ ] Todo List |
|
||
> Blockquote | Blockquote |
||
**Bold font** | Bold font | ||
*Italics font* | Italics font | ||
~~Strikethrough~~ | |||
19^th^ | 19th | ||
H~2~O | H2O | ||
++Inserted text++ | Inserted text | ||
==Marked text== | Marked text | ||
[link text](https:// "title") | Link | ||
 | Image | ||
`Code` | Code |
在筆記中貼入程式碼 | |
```javascript var i = 0; ``` |
|
||
:smile: | ![]() |
Emoji list | |
{%youtube youtube_id %} | Externals | ||
$L^aT_eX$ | LaTeX | ||
:::info This is a alert area. ::: |
This is a alert area. |
On a scale of 0-10, how likely is it that you would recommend HackMD to your friends, family or business associates?
Please give us some advice and help us improve HackMD.
Do you want to remove this version name and description?
Syncing