Is the workload healthy?¶
Periodically verifying workload health and configuring alerts when there's a problem is important for service credibility. The industry has coined this term Observability, and it refers to the collection of logs and the presentation of data from these logs in Dashboards.
This video from Grafana1 explains the concept and how it relates to service reliability.
YouTube video: Grafana for Beginners - What is Observability2
