DEV Community

The GeekNarrator

Site Reliability Engineering Masterclass with Luis Mineiro

In this episode I talk to Luis Mineiro, Senior Director at Delivery Hero about Site Reliability Engineering. 

We have busted a lot of myths around Site Reliability Engineering as a concept. 

We have talked about the following: 

 00:00 Introduction 

02:00 What is SRE? What is NOT SRE? 

10:00 DevOps vs SRE vs Developers 

13:10 When do we need a dedicated SRE team? 

20:00 Traits of a Strong SRE culture 

29:00 Observability vs Monitoring vs Alerting 

41:40 Adding Observability to a Sandiwch Shop 

53:00 How to add distributed tracing to the Sandwich shop? 

57:10 How do we define SLO's for the Sandwich shop? 

01:01:00 How to I define timeouts between services? 

01:05:00 How do I determine cost of adding 9's to my SLOs? 

01:15:00 How do I transition from a Developer to SRE?  

References:  SRE Books from Google:  https://sre.google/books

Luis Linkedin: https://www.linkedin.com/in/lmineiro/

The GeekNarrator Page: https://www.linkedin.com/company/the-...

Kaivalya Apte Linkedin: https://www.linkedin.com/in/kaivalya-... 

I hope you like the video and learn a lot about SRE.  

Cheers,  The GeekNarrator

Episode source