Introduction to Site Reliability Engineering | Datadog

Introduction to Site Reliability Engineering

Join Datadog on Friday, March 27, from 11:30 a.m. – 12:30 p.m. EDT for a virtual introduction to Site Reliability Engineering.

In this session, we'll start with the basics of SRE, including some common terminology and theory, then dive into practical examples—including lessons learned from our own journey here at Datadog. We'll discuss the relationship between SRE and DevOps, what success looks like (and how to measure it), and the power of error budgets to help make your systems and applications more reliable. We'll also cover how to identify and nurture both internal and external talent in order to build a cross-functional team.

SRE is a large, complex topic, and you're sure to have questions; we'll end the session by opening the floor to a live Q&A to dive into the topics you're curious about.

Speakers

event/waldo_grunenwald.png
Waldo Grunenwald Datadog
See more info
event/daniel_maher.png
Daniel Maher Datadog
See more info
event/leo_cavaille.png
Léo Cavaillé Datadog
See more info
event/waldo_grunenwald.png

Waldo Grunenwald

Technical Evangelist
Datadog

Waldo is a Tech Evangelist for Datadog, which means that he gets to travel and meet people, and advocate on their behalf. He is a recovering SRE and Operations Engineer, has been active in the DevOps community for quite some time, and is keen to help organizations stop hurting themselves. Despite being a raging introvert, he enjoys public speaking. In his spare time, he enjoys collecting hobbies that he doesn't have the time to engage in. He hates writing about himself in the third person, and aspires to one day be a better bio writer.

event/daniel_maher.png

Daniel Maher

Technical Evangelist
Datadog

Dan is a veteran of the original dotcom bubble and has since worked in a variety of environments from start-ups to global corporations, including stints as a founder, university lecturer, and a day labourer. Today, Dan is a member of the Devopsdays global team, and a technical evangelist at Datadog.

event/leo_cavaille.png

Léo Cavaillé

Engineering Manager Resiliency/Reliability
Datadog

Léo is the Engineering Manager for Resilience and Reliability teams at Datadog. He’s passionate about incident response and resilience engineering, previously a Staff SRE Léo worked on opening the first new Datadog region in Europe and the migration of the infrastructure to Kubernetes. He also worked as a lead software engineer in the first team that built the APM product from the ground up.

RSVP

By attending this event I acknowledge that I may be exposed to information which the Exhibitor, Datadog, Inc., considers confidential and wishes to limit the disclosure of. I hereby agree to keep such information confidential and to not disclose such information to any third parties. I further represent that I am not employed by a competitor of the Exhibitor and am not attending this Event with the intent to access any such confidential information or gain a competitive advantage for my employer. I hereby give permission to the Exhibitor to use my image, likeness, appearance, voice, and any written or spoken testimonials given by me in connection with the marketing and promotion of this Event and/or any Datadog products or Events.