Where we talk about what Service Level Objectives actually are and why they are so important in the field of Site Reliability Engineering. We cover the definition of an SLO, how they relate to error budgets, and take a look at various implementations of time series databases' support for calculating accurate percentiles.


Comments for the episode are welcome - at the bottom of the show notes for the episode there is a Disqus setup, or you can email us at [email protected].


Sponsors for Episode 98:

42 Lines is a DevOps consulting firm specializing in
Observability, Cloud Migration, Cost Control, Security Practices, and Team
Mentoring.


Links for Episode 98:

Atlassian Incident Management
High Availibility Percentage Calculation
Google SRE Book: Embracing Risk
Quantile Definition
Four Golden Signals
Histograms at Scale
VictoriaMetrics Histograms
Circonus Log-Linear Histograms
T-Digests