

Slight Reliability
Stephen Townshend
Learning SRE, one day at a time.
Episodes
Mentioned books

Nov 21, 2023 • 45min
Slight Reliability Episode 76 - Sampling Distributed Traces with Paige Cruz
Send us Fan MailPaige Cruz (from Chronosphere) is back. This week we discuss sampling. What is sampling? Why do it? What kinds of sampling are there?You can check out Chronosphere's cloud native observability platform here: https://chronosphere.io/You can find Paige on:LinkedIn: https://www.linkedin.com/in/paigerduty/X: https://twitter.com/paigerdutyYou can find the official Slight Reliability podcast website at: https://slightreliability.com/You can find Stephen at:LinkedIn: https://www.linkedin.com/in/stephentownshend/X: https://twitter.com/the_kiwi_sreInstagram: https://www.instagram.com/slight_reliability/

Nov 20, 2023 • 38min
Slight Reliability Episode 79 - Incident Story Time with Valeska Victoria
Send us Fan MailThis week Valeska Victoria returns to share some of her experiences working as an SRE at eBay.We look at the cascading effect of production issues in complex integrated environments (how there's often no single root cause), developer literacy of how infrastructure works, the importance of ownership and accountability of reliability, and much more.You can find Valeska on: LinkedIn: https://www.linkedin.com/in/valeska-victoria/You can find the official Slight Reliability podcast website at: https://slightreliability.com/You can find Stephen at:LinkedIn: https://www.linkedin.com/in/stephentownshend/X: https://twitter.com/the_kiwi_sreInstagram: https://www.instagram.com/slight_reliability/

Nov 16, 2023 • 32min
Slight Reliability Episode 78 - Developer Experience with Ankit Jain
Send us Fan MailThis week I chat with Ankit Jain from aviator.co about developer experience.We define developer experience and developer productivity, and how this applies to SRE. We discuss the growing expectation on developers and how this leads to frustration and burnout. We also explore how to measure developer experience and how to start working to make improvements.You can check out Aviator's developer experience platform here: https://www.aviator.co/You can find Ankit on:LinkedIn: https://www.linkedin.com/in/ankitjaindce/You can find the official Slight Reliability podcast website at: https://slightreliability.com/You can find Stephen at:LinkedIn: https://www.linkedin.com/in/stephentownshend/X: https://twitter.com/the_kiwi_sreInstagram: https://www.instagram.com/slight_reliability/

Nov 16, 2023 • 5min
December 2023 Update
Send us Fan MailA brief mid-week update on my changing circumstances and the future of the podcast.

Nov 15, 2023 • 32min
Slight Reliability Episode 77 - SRE to DevRel with Liz Fong-Jones
Send us Fan MailThis week I had the privilege of interviewing Liz Fong-Jones from honeycomb.io about DevRel, Developer Advocacy, and how that applies to SRE.We discuss the difference between Developer Relations (DevRel) and Developer Advocacy, how Liz got into advocacy, how DevRel helps companies and the community, and some tips on how to get traction with SRE practices in your organisation.You can check out Honeycomb's observability platform here: https://www.honeycomb.io/You can find Liz on:LinkedIn: https://www.linkedin.com/in/efong/Website: https://www.lizthegrey.com/ (all her social/links are here)You can find the official Slight Reliability podcast website at: https://slightreliability.com/You can find Stephen at:LinkedIn: https://www.linkedin.com/in/stephentownshend/X: https://twitter.com/the_kiwi_sreInstagram: https://www.instagram.com/slight_reliability/

Nov 14, 2023 • 39min
Slight Reliability Episode 75 - Enterprise SRE with Steve McGhee
Send us Fan MailThis week I had the honour of chatting with Steve McGhee (former Google SRE, current Google Reliability Advocate, and co-author of Enterprise Roadmap to SRE).We discuss the evolution of SRE from where it began at Google and how it is being adopted by enterprises around the world now (and why this is happening). We talk about getting leadership support and how we get reliability taken seriously, the lies we tell ourselves to justify incidents and issues, leveraging transformation projects to bring SRE to life, how SLOs can act as the fulcrum between dev and ops, the fallacy of the pyramid model of reliability... and so much more.You can find Steve at on:LinkedIn: https://www.linkedin.com/in/stevemcghee/X: https://twitter.com/stevemcgheeYou can find Steve's book "Enterprise Roadmap to SRE" here: https://sre.google/resources/practices-and-processes/enterprise-roadmap-to-sre/Steve also mentions the book "A Seat at the Table": https://itrevolution.com/product/a-seat-at-the-table/You can find the official Slight Reliability podcast website at: https://slightreliability.com/You can find Stephen at:LinkedIn: https://www.linkedin.com/in/stephentownshend/X: https://twitter.com/the_kiwi_sreInstagram: https://www.instagram.com/slight_reliability/

Oct 31, 2023 • 9min
Slight Reliability Episode 74 - The Hidden Side of Vendor Lock-In
Send us Fan MailThis week on Slight Reliability Stephen discusses observability vendor lock-in. What is it? What does OpenTelemetry do to help? What areas are yet to be solved?You can find the official Slight Reliability podcast website at: https://slightreliability.com/You can find Stephen at:LinkedIn: https://www.linkedin.com/in/stephentownshend/Twitter: https://twitter.com/the_kiwi_sreYouTube: https://www.youtube.com/c/SlightReliabilityInstagram: https://www.instagram.com/slight_reliability/TikTok: https://www.tiktok.com/@the_kiwi_sre

Oct 24, 2023 • 32min
Slight Reliability Episode 73 - Enterprise SLOs with Brian Singer
Send us Fan MailThis week we sit down and talk about SLOs with CPO and co-founder of Nobl9 Brian Singer.We talk about the importance of reviewing operational effectiveness, getting buy in from leadership, using SLOs to reduce noise, how to implement SLOs within different cultures and structures, the parallels between security and reliability... and much more.You can check out Nobl9's reliability and SLO platform here: https://www.nobl9.com/You can find Brian on LinkedIn: https://www.linkedin.com/in/briantsinger/You can find the official Slight Reliability podcast website at: https://slightreliability.com/You can find Stephen at:LinkedIn: https://www.linkedin.com/in/stephentownshend/Twitter: https://twitter.com/the_kiwi_sreInstagram: https://www.instagram.com/slight_reliability/

Oct 17, 2023 • 42min
Slight Reliability Episode 72 - Rapid Incident Response with Valeska Victoria
Send us Fan MailThis week Stephen chats with Valeska Victoria about her time working as an SRE at eBay.Valeska shares her data driven approach to SRE, having a voice as a less experienced engineer, handling incidents under high pressure, leveraging large language models to rapidly find the information you need during an incident, and much more.You can check out PromptOps here: https://www.promptops.com/You can find Valeska on LinkedIn: https://www.linkedin.com/in/valeska-victoria/You can find the official Slight Reliability podcast website at: https://slightreliability.com/You can find Stephen at:LinkedIn: https://www.linkedin.com/in/stephentownshend/Twitter: https://twitter.com/the_kiwi_sreInstagram: https://www.instagram.com/slight_reliability/

Oct 10, 2023 • 29min
Slight Reliability Episode 71 - Implementing SRE with Dr. Vlad Ukis
Send us Fan MailThis week Stephen chats with Dr. Vlad Ukis about his journey discovering, and then implementing SRE practices at Siemens Healthineers (which led to him writing a book). They discuss how the evolution of infrastructure necessitates a shift in how we operate, the power of selling SRE practices, the SRE infrastructure used to build SLOs and reliability capabilities, how he implemented SLOs, and much more.You can find Vlad's book "Establishing SRE Foundations" here: https://www.amazon.com/Establishing-Foundations-Step-Step-Organizations/dp/0137424604 You can find Vlad on LinkedIn: https://www.linkedin.com/in/dr-vladyslav-ukis-5172ba32/You can find the official Slight Reliability podcast website at: https://slightreliability.com/You can find Stephen at:LinkedIn: https://www.linkedin.com/in/stephentownshend/Twitter: https://twitter.com/the_kiwi_sreInstagram: https://www.instagram.com/slight_reliability/


