Pitfalls in Measuring SLOs- With Animation

Presented by

Liz Fong-Jones, Principal Developer Advocate & Danyel Fisher, Principal Design Researcher

About this talk

We built support for SLOs (Service Level Objectives) against our event store so we could monitor our own complex distributed system. In the process of doing so, we learned that there were a number of important aspects that we didn’t expect from carefully reading the SRE workbook. This talk is the story of the missing pieces, unexpected pitfalls, and how we solved those problems. We’d like to share what we learned and how we iterated on our SLO adventure. In this talk, we will discuss how we brought the theory of SLOs to practice, and what we learned that we hadn’t expected in the process. We’ll discuss implementing the SLO feature and burn alerts; and our experiences from working with the SRE team who started using the alerts. Our hope is that when you buy or build your SLO tools, you’ll know what to look for, and how to get started. implementors will be able to start with a more solid ground, and that we will be able to advance the state of SLO support for all teams that wish to implement them.
Related topics:

More from this channel

Upcoming talks (0)
On-demand talks (108)
Subscribers (1666)
Honeycomb Learn is a webcast series designed to educate DevOps teams about what observability is and how to get started. Observability-driven development is the ability to ask any question about your production systems so you better understand and debug when incidents occur. Teams ship code confidently, learn continuously, and spend more time innovating.