- Cloud Observability in Action - Michael Hausenblas - 2024 - End-to-end observability: logs, metrics, traces; Prometheus, OpenTelemetry.
- Observability with Grafana - Rob Chapman & Peter Holmes - 2024 - Implementing observability using the Grafana stack.
- Monitoring Cloud-Native Applications - Nicolas M. Chaillan - 2021 - Kubernetes monitoring with Prometheus, InfluxDB, and Grafana.
- Site Reliability Engineering - Beyer, Jones, Petoff, Murphy - 2016 - SLOs, toil reduction, and operating at scale — free online.
- Prometheus: Up & Running - Brian Brazil - 2018 - Metrics and monitoring with Prometheus.