86 episodes

Learning SRE, one day at a time.

Slight Reliability Stephen Townshend

    • Technology
    • 5.0 • 2 Ratings

Learning SRE, one day at a time.

    Slight Reliability Episode 84 - Clinical Troubleshooting with Dan Slimmon

    Slight Reliability Episode 84 - Clinical Troubleshooting with Dan Slimmon

    This week I chat with Dan Slimmon about applying the approach doctors use to treat patient symptoms during incident response.
    You can find Dan's blog at https://blog.danslimmon.com/ or connect with him on LinkedIn here: https://www.linkedin.com/in/danslimmon/
    You can find the official Slight Reliability podcast website at: https://slightreliability.com/
    You can find Stephen at:
    LinkedIn: https://www.linkedin.com/in/stephentownshend/
    Twitter: https://twitter.com/the_kiwi_sre
    YouTube: https://www.youtube.com/c/SlightReliability
    Instagram: https://www.instagram.com/slight_reliability/
    TikTok: https://www.tiktok.com/@the_kiwi_sre
    This episode was sponsored by SquaredUp. SquaredUp combines all your data with awesome dashboards, analytics, health rollup, and notifications, into a unified observability portal. Using a data mesh architecture, SquaredUp is a beautifully simple way to get instant access to the insights that matter, whenever you need them. If you want to know more head over to https://squaredup.com/ to sign up for your free account.

    • 27 min
    Slight Reliability Episode 83 - An Unfulfilled Promise with Itiel Shwartz

    Slight Reliability Episode 83 - An Unfulfilled Promise with Itiel Shwartz

    This week I hear about all things Kubernetes from Komodor CTO and co-founder Itiel Shwartz. We chat about the promise that was made when Kubernetes first entered the industry, the challenge of getting developers engaged and capable of working in Kubernetes, my hate/hate relationship with Helm but its important contribution to the Kubernetes project, Kubernetes observability, and so much more.

    You can find the Kubernetes for Humans podcast here:
    https://komodor.com/blog/the-kubernetes-for-humans-podcast/
    Or find out more about Komodor here:
    https://komodor.com/
    Or find Itiel on LinkedIn: https://www.linkedin.com/in/itiel-shwartz-18542853/

    You can find the official Slight Reliability podcast website at: https://slightreliability.com/
    You can find Stephen at:
    LinkedIn: https://www.linkedin.com/in/stephentownshend/
    Twitter: https://twitter.com/the_kiwi_sre
    YouTube: https://www.youtube.com/c/SlightReliability
    Instagram: https://www.instagram.com/slight_reliability/
    TikTok: https://www.tiktok.com/@the_kiwi_sre

    This episode was sponsored by SquaredUp. SquaredUp combines all your data with awesome dashboards, analytics, health rollup, and notifications, into a unified observability portal. Using a data mesh architecture, SquaredUp is a beautifully simple way to get instant access to the insights that matter, whenever you need them. If you want to know more head over to https://squaredup.com/ to sign up for your free account.

    • 30 min
    Slight Reliability Episode 82 - CI/CD with Amin Astaneh

    Slight Reliability Episode 82 - CI/CD with Amin Astaneh

    This week I sit down and have a discussion with Amin Astaneh (from Certo Modo) about CI/CD. We cover the power of the standard change as a way to navigate ITIL while still implementing DevOps practices, what to monitor to make your CI/CD observable, single piece flow, testing in production, and so much more.

    You can find Amin on his company website https://certomodo.io, LinkedIn: https://www.linkedin.com/in/aminastaneh/ and Twitter: https://twitter.com/aastaneh

    You can find the official Slight Reliability podcast website at: https://slightreliability.com/
    You can find Stephen at:
    LinkedIn: https://www.linkedin.com/in/stephentownshend/
    Twitter: https://twitter.com/the_kiwi_sre
    YouTube: https://www.youtube.com/c/SlightReliability
    Instagram: https://www.instagram.com/slight_reliability/
    TikTok: https://www.tiktok.com/@the_kiwi_sre

    This episode was sponsored by SquaredUp. SquaredUp combines all your data with awesome dashboards, analytics, health rollup, and notifications, into a unified observability portal. Using a data mesh architecture, SquaredUp is a beautifully simple way to get instant access to the insights that matter, whenever you need them. If you want to know more head over to https://squaredup.com/ to sign up for your free account.

    • 25 min
    Slight Reliability Episode 81 - Incident Management in Non-Prod Environments

    Slight Reliability Episode 81 - Incident Management in Non-Prod Environments

    "Environment issues are just incidents that happened to occur in a non-production environment"... so why do we treat them so differently?

    In this first episode of the 2024 season I reflect on how we handle incidents in non-prod environments.

    (Note: Had a few issues with noise suppression in OBS Studio cutting off the start of some words, will sort it for the next episode)

    You can find Stephen at:

    LinkedIn: https://www.linkedin.com/in/stephentownshend/
    Twitter: https://twitter.com/the_kiwi_sre
    YouTube: https://www.youtube.com/c/SlightReliability
    Instagram: https://www.instagram.com/slight_reliability/
    TikTok: https://www.tiktok.com/@the_kiwi_sre

    • 10 min
    Slight Reliability Episode 80 - What's Been Bugging Niall Murphy

    Slight Reliability Episode 80 - What's Been Bugging Niall Murphy

    This week I speak with co-author of the original SRE book + the SRE workbook, and renowned speaker Niall Murphy.

    We chat about the state of SRE in the current macro-economic climate and how we're not yet doing a very good job at articulating the value of SRE to leaders, the relationship that velocity and reliability have, the value of new features versus reliability improvements, and *much* more.

    You can find Niall at:

    LinkedIn: https://www.linkedin.com/in/niallm/
    X: https://twitter.com/niallm
    Website: https://relyabilit.ie/

    (and his company Stanza: https://www.stanza.systems/)

    You can find the official Slight Reliability podcast website at: https://slightreliability.com/
    You can find Stephen at:
    LinkedIn: https://www.linkedin.com/in/stephentownshend/
    X: https://twitter.com/the_kiwi_sre
    Instagram: https://www.instagram.com/slight_reliability/

    • 36 min
    Slight Reliability Episode 76 - Sampling Distributed Traces with Paige Cruz

    Slight Reliability Episode 76 - Sampling Distributed Traces with Paige Cruz

    Paige Cruz (from Chronosphere) is back. This week we discuss sampling. What is sampling? Why do it? What kinds of sampling are there?

    You can check out Chronosphere's cloud native observability platform here: https://chronosphere.io/

    You can find Paige on:

    LinkedIn: https://www.linkedin.com/in/paigerduty/
    X: https://twitter.com/paigerduty

    You can find the official Slight Reliability podcast website at: https://slightreliability.com/
    You can find Stephen at:
    LinkedIn: https://www.linkedin.com/in/stephentownshend/
    X: https://twitter.com/the_kiwi_sre
    Instagram: https://www.instagram.com/slight_reliability/

    • 45 min

Customer Reviews

5.0 out of 5
2 Ratings

2 Ratings

Top Podcasts In Technology

Lex Fridman Podcast
Lex Fridman
Hard Fork
The New York Times
All-In with Chamath, Jason, Sacks & Friedberg
All-In Podcast, LLC
The Gatekeepers
BBC Radio 4
Acquired
Ben Gilbert and David Rosenthal
Darknet Diaries
Jack Rhysider

You Might Also Like