Monitoring
Monitoring applications
Key for where information can be obtained: (K)ubernetes, (5) ae5_tools, (S)ystem/OS, (A)pplication (Ac: canned/automatic, Au: user/custom functionality), A, (W)ebserver, [P]roxy in front or your apps, (?)
- Ensure they stay up
- (Ac) Currently up
- (K) Uptime (duration and percentage)
- (P/Ac/?) User failure rate (number and percentage)
- Resource utilization (under/over-utilization)
- (K) Memory usage
- (K) CPU usage
- (K) Number of times memory limits were reached (approximated by k8s restarts)
- Popularity
- (Ac) Number of visitors
- (Ac) Duration of visits
- (Ac) Bounce rate?
- (Ac) Open sessions
- Responsiveness
- (Ac) Time to first render
- (Ac) Responsiveness inside application
- Fine grained tracking/monitoring
- [Au] Which parts of app are used by visitors?
- [Au] What error conditions are encountered
Domain conditions
- (Au) Some metric exceeds threshold
- (Au) App raises explicit error condition
Indicators
- Traffic light
- Speedometer/Gauge
- Donut
- KPI (numeric absolute/change)