The Stack Overflow Podcast
Podcast #111 – SRE: Occasionally Maintaining Infrastructure That You Hate
Welcome to a very special episode of The Stack Overflow Podcast recorded Thursday, June 8 at Stack Overflow HQ in NYC. Why is this a very special episode, you ask? It's all about Site Reliability Engineers (SRE). Joel and Jay are away (slackers) so David is running the show along with Ilana and 3 members of our very own SRE team. Tom Limoncelli, Mark Henderson, and Jason Harvey are here to discuss how we build and maintain Stack Overflow and the Stack Exchange network. So much great stuff in this episode:
- Tom's rants are much calmer than Joel's, but just as infuriating. Stop hiding buttons, Google!
- Always tip your bartender and use a "USB condom".
- Tom and David are fascinated by our Ping Pong Buddy, for some reason.
- We're still waiting for Tom's protocol droid. I, for one, welcome our new robot overlords.
- Differences between Stack Overflow and other SRE teams. *cough, we're NOT in the cloud, cough*
- Special SRE edition of Failure is Not an Option. (Apparently Mark is a master criminal. We're not sure that he moved out of Australia voluntarily…)
- In the News: How not to react when someone makes a mistake.
- Oh, and btw, the SRE team is hiring!
- Joel on Software post referenced in Tom's rant. Short and to the point.
- Web Pages That Suck is still a thing, albeit outdated (2015).
- OMTR - Otterbox uniVERSE and Resurgence battery cases.
- Did we mention that we're hiring!
- Buy Tom's Book: The Practice of System and Network Administration: Vol 1 and Vol 2
- From the News: via Reddit - First Day Database Destruction. Who's fault was it, really?