17 episodes

The Software Misadventures Podcast is where we sit down with software and devops experts to hear their stories from the trenches about how software breaks in production.

The show hosts - Ronak, Austin and Guang - have seen first hand how stressful it is when something breaks in production but they have also found it to be the best opportunity to learn about a system more deeply. On this show, they discuss with fellow engineers the principles and practical tips to build resilient software as well as advice to grow as technical leaders.

Learn more at https://softwaremisadventures.com.

Software Misadventures Ronak Nathani, Austin Ouyang, Guang Yang

    • Technology
    • 4.7 • 7 Ratings

The Software Misadventures Podcast is where we sit down with software and devops experts to hear their stories from the trenches about how software breaks in production.

The show hosts - Ronak, Austin and Guang - have seen first hand how stressful it is when something breaks in production but they have also found it to be the best opportunity to learn about a system more deeply. On this show, they discuss with fellow engineers the principles and practical tips to build resilient software as well as advice to grow as technical leaders.

Learn more at https://softwaremisadventures.com.

    Cory Watson - Leading observability teams at Twitter & Stripe, how to succeed in a new org, effective ways to advocate for your team and more - #16

    Cory Watson - Leading observability teams at Twitter & Stripe, how to succeed in a new org, effective ways to advocate for your team and more - #16

    Cory is currently a Solutions Engineer at Jeli.io. Prior to this, he led observability teams at Twitter & Stripe. We talk to him about how he got his start in customer support and the role it played in the later part of his career. We discuss his time at Twitter where there was a power outage in the data center on the day he joined. We also discuss how he created & led the observability team at Stripe, how one can succeed in a new org & what are some effective ways to advocate for your team.

    • 1 hr 24 min
    Ashwin Kumar - On learning new things by breaking them down, the secret to winning >$100k from hackathons, the art of storytelling, and much more - #15

    Ashwin Kumar - On learning new things by breaking them down, the secret to winning >$100k from hackathons, the art of storytelling, and much more - #15

    Ashwin is a Startup Partnership Lead at Stripe. From web development to co-founding a YC startup, to deep learning, Ashwin has a knack for picking up new skills extremely quickly. In this episode, we chat about the methods he employed to successfully make these transitions, learnings/tips from winning 30+ hackathons in a row, and what engineers can gain from better story-telling.

    • 1 hr 14 min
    Bruno Connelly - Building and leading the global SRE org at LinkedIn - #14

    Bruno Connelly - Building and leading the global SRE org at LinkedIn - #14

    Bruno Connelly is a VP of Engineering at LinkedIn. In this episode, we talk about his journey in tech - from teaching himself how to code to leading the SRE org at LinkedIn. We also discuss the early days at LinkedIn, building technical stability, growing the org globally and his leadership style.

    Website link: https://softwaremisadventures.com/bruno

    Music Credits:
    Vlad Gluschenko — Forest
    License: Creative Commons Attribution 3.0
    Unported: https://creativecommons.org/licenses/by/3.0/deed.en

    • 1 hr 6 min
    Lorin Hochstein - On how Netflix learns from incidents, software as socio-technical systems, writing persuasively and more - #13

    Lorin Hochstein - On how Netflix learns from incidents, software as socio-technical systems, writing persuasively and more - #13

    With 5+ years of experience building resilient systems at the Netflix scale, Lorin joins the show to chat about his favorite incident story, the path that led him to doing chaos engineering (and later away from it), and advocating for a dedicated analyst to talk to people after an incident. Throughout the conversation, Lorin shares his philosophy and tips on how to learn from incidents, what engineers can gain from writing better, and why some metrics may not be as useful as you think.

    • 1 hr 24 min
    Spoons (Daniel Spoonhower) - On building Lightstep, being customer focused, developing systems at Google scale and much more - #12

    Spoons (Daniel Spoonhower) - On building Lightstep, being customer focused, developing systems at Google scale and much more - #12

    Spoons is the Co-founder and Chief Architect of Lightstep. He joins the show to talk about building systems at Google scale and various aspects that make Google a weird place than other companies. We talked about Spoons's journey of leaving Google and deciding to join Lightstep as a co-founder. We dig into the challenges during the early days of Lightstep and discuss the importance of speaking to customers to build the right product.

    • 1 hr 14 min
    Emmanuel Ameisen - On production ML at Stripe scale, leading 100+ ML projects, iterating fast, and much more - #11

    Emmanuel Ameisen - On production ML at Stripe scale, leading 100+ ML projects, iterating fast, and much more - #11

    Having led 100+ ML projects at Insight and built ML systems at Stripe scale, Emmanuel joins the show to chat about how to build useful ML products and what happens next when the model is in production. Throughout the conversation, Manu shares stories and advice on topics like the common mistakes people make when starting a new ML project, what’s similar and different about the lifecycle of ML systems compared to traditional software, and writing a technical book.

    • 1 hr 12 min

Customer Reviews

4.7 out of 5
7 Ratings

7 Ratings

Top Podcasts In Technology

You Might Also Like