24 episodes

On OpenObservability Talks we discuss harnessing the power of open source to advance observability initiatives for DevOps practitioners around the world.

If you'd like to propose a topic for an episode, reach out through https://forms.gle/kRsBNasVHG2xccMr8

You can find us on Twitter @openobserv

OpenObservability Talks Dotan Horovits, Jonah Kowall

    • Technology

On OpenObservability Talks we discuss harnessing the power of open source to advance observability initiatives for DevOps practitioners around the world.

If you'd like to propose a topic for an episode, reach out through https://forms.gle/kRsBNasVHG2xccMr8

You can find us on Twitter @openobserv

    Observability for Developers Demystified - OpenObservability Talks E2E12

    Observability for Developers Demystified - OpenObservability Talks E2E12

    Developers hate monitoring, but we need it. We need it in many points of the software development lifecycle: before deprecating an API, before launching a new feature, after launching the feature, and more. In fact, monitoring needs can vary much more than the classic Ops monitoring.

    In this episode I’ll host Liran Haimovitch to discuss how to determine what developers should be monitoring, the difference between observability for Dev and for Ops, and how observability fits into our current dev tools, dev stack and dev processes.

    Liran is the Co-Founder and CTO of Rookout. He’s an Observability and Instrumentation expert with a deep understanding of Java, Python, Node, and C++. Liran has broad experience in cybersecurity and compliance from his past roles. When not coding, you can find Liran hosting his podcast, speaking at conferences, writing about his tech adventures, and trying out the local cuisine when traveling.

    The episode was live-streamed on 10 May 2022 and the video is available at https://youtu.be/OaHQp-qnVN0 

    OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube.

    We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat.
    https://www.twitch.tv/openobservability
    https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg

    Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/

    Show Notes:


    Which data do we need to collect for our observability
    How is observability for dev different from ops
    How does observability fit into dev tool stack
    Snapshots provide deep-dive telemetry signal
    Dynamic instrumentation
    Snapshots support in programming languages and runtimes
    Open source standardization around snapshots
    The cost associated with observability
    Google is applying to contribute Istio to the CNCF
    Shopify case study for observability team

    Resources:


    Istio applying to the CNCF: https://istio.io/latest/blog/2022/istio-has-applied-to-join-the-cncf/
    Shopify case study for Observability team: https://ericmustin.substack.com/p/notes-on-an-observability-team?s=r

    Socials:


    Twitter: https://twitter.com/OpenObserv
    Twitch: https://www.twitch.tv/openobservability
    YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg
    Website: https://openobservability.io/

    • 58 min
    OpenSearch 2.0 and beyond with Eli - OpenObservability Talks E2E11

    OpenSearch 2.0 and beyond with Eli - OpenObservability Talks E2E11

    OpenSearch is a community-driven, open-source search and analytics suite derived from Apache 2.0 licensed Elasticsearch 7.10.2 & Kibana 7.10.2. The OpenSearch project started just over a year ago and is now the open-source alternative to ELK, which is no longer open source. The team has spent much of the last year getting the project going, but there was innovation as well. We will cover and discuss what OpenSearch has accomplished, but more importantly what’s coming next, including a big 2.0 release. We are joined in this episode by Eli Fisher, who is the product lead at AWS, working on the OpenSearch project. He’ll dive into recent launches, including several observability features, and innovations planned for 2.0 and beyond.   

    The podcast episodes are available for listening on your favorite podcast app and on this YouTube channel.  

    We live-stream the episodes, and you’re welcome to join the stream here on YouTube Live or at https://www.twitch.tv/openobservability​.   

    Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://forms.gle/9LDkYCmegyS5D8Li7​

    • 1 hr 1 min
    SLO Driven Engineering: from Dev to Prod - OpenObservability Talks S2E10

    SLO Driven Engineering: from Dev to Prod - OpenObservability Talks S2E10

    Google’s SRE Book popularized the concept of Service Level Objective (SLO) and the SLO-driven approach. But what does it really mean to make SLO driven decisions? How can we generate observability and synchronize teams around joint SLOs? And how can we automate SLOs and integrate them into the software release pipeline?

    In this episode I’ll host Andreas Grabner. We’ll discuss the SRE practices, and how to automate SLO from dev all the way to prod. We’ll talk about the open source efforts to standardize the process under the Continuous Delivery Foundation, and about Keptn, the new CNCF open source project that promises to help with this automation.

    Andreas Grabner (@grabnerandi) has 20+ years of experience as a software developer, tester and architect and is an advocate for high-performing cloud scale applications. He is a contributor and DevRel for the CNCF open source project keptn (www.keptn.sh). Andreas is also a regular contributor to the DevOps community, a frequent speaker at technology conferences and regularly publishes articles on blog.dynatrace.com or medium. In his spare time you can most likely find him on one of the salsa dancefloors of the world.

    The episode was live-streamed on 15 March 2022 and the video is available at https://youtu.be/J81byOpVqrk 

    OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube.

    We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat.
    https://www.twitch.tv/openobservability
    https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg

    Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/

    Show Notes:


    What’s SRE
    Where is SRE placed in the organization
    SRE vs. DevOps
    Good and bad SLOs
    How to define SLOs top-down
    Who owns SLO definition, monitoring, remediation
    Where is SRE within less mature organizations
    Keptn OSS project background
    Who uses and contributes to Keptn project
    What’s the CDF (Continuous Delivery Foundation)
    Creating a standard CD event format under the CDF (CDF Events SIG)
    Cloud Native Observability survey by the CNCF

    Resources:


    SLO in the age of microservices:
    Keptn OSS project: https://keptn.sh/
    Keptn 0.14.0 major release
    TechWorld with Nana on Keptn
    CD Foundation - SIG Events: https://github.com/cdfoundation/sig-events
    PurePerformance podcast
    Cloud Native Observability survey by the CNCF

    Socials:


    Twitter: https://twitter.com/OpenObserv
    Twitch: https://www.twitch.tv/openobservability
    YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVR

    • 59 min
    Building web-scale observability at Slack, Pinterest & Twitter - OpenObservability Talks S2E09

    Building web-scale observability at Slack, Pinterest & Twitter - OpenObservability Talks S2E09

    What does it take to build observability in a web-scale company such as Slack, Pinterest and Twitter?

    On this episode of OpenObsevability Talks I'll host Suman Karumuri to hear how he built these systems from the ground up on these #BigTech co's, about his recent research papers and more.

    Suman Karumuri is a Sr. Staff Software Engineer and the tech lead for Observability at Slack. Suman Karumuri is an expert in distributed tracing and was a tech lead of Zipkin and a co-author of OpenTracing standard, a Linux Foundation project via the CNCF. Previously, Suman Karumuri has spent several years building and operating petabyte scale log search, distributed tracing and metrics systems at Pinterest, Twitter and Amazon.  In his spare time, he enjoys board games, hiking and playing with his kids.

    The episode was live-streamed on 16 February 2022 and the video is available at https://youtu.be/IvidkV3TfYg 

    OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube.

    We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat.
    https://www.twitch.tv/openobservability
    https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg

    Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/

    Show Notes:

    * Who owns observability in large organizations?

    * The gaps in current way of handling metrics 

    * MACH research paper for metrics storage engine

    * The gaps in current way of handling logs Slack KalDB

    * SlackTrace - Slack in house tracing system 

    Resources:


    Research paper: building Observability Data Management Systems
    CIDR paper: Video
    SlackTrace blog post, talk.
    Logging at Twitter
    Pintrace: A Distributed Tracing Pipeline talk by Suman at LISA
    Observability Engineering book
    Observability Trends for 2022
    Yelp engineering with Elasticsearch and Lucene

    Socials:


    Twitter: https://twitter.com/OpenObserv
    Twitch: https://www.twitch.tv/openobservability
    YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg
    Website: https://openobservability.io/

    • 58 min
    SaaS Observability Done Right - OpenObservability Talks S2E08

    SaaS Observability Done Right - OpenObservability Talks S2E08

    SaaS (software as a service) is a popular model for many businesses today. SaaS businesses need agility to move fast and remain competitive. This means agility in the software IT stack, but also agility in the business models and product-led growth (PLG). Observability plays a key role in enabling SaaS organizations to move fast.

    Achieving this agility, however, raises specific observability requirements. On this episode of OpenObservability Talks we’ll host Aviad Mizrachi, the CTO and Co-Founder of Frontegg, to help us map these requirements. Having escorted dozens of SaaS businesses across many verticals, Aviad brings a wealth of experience in how today’s SaaS is built and operated, and will share his insights and best practices on how to design and build the observability stack right.

    Aviad has been a developer for the last 20 years. He held a few management and architecture positions on startups such as Vicon and HTS as well as in larger companies such as NICE and CheckPoint. Today at Frontegg Aviad works closely with many customers to help them build their SaaS solutions.

    The episode was live-streamed on YouTube Live and Twitch on 11 Jan 2022 and the video is available at https://www.youtube.com/watch?v=ZcneTMeBPeg 

    OpenObservability Talks episodes are released monthly, on the last Thursday of each month and are available for listening on your favorite podcast app and on YouTube. 

    We live-stream the episodes on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat.
    https://www.twitch.tv/openobservability
    https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg

    Have you got an interesting topic you'd like to share in an episode? Reach out to us and submit your proposal at https://openobservability.io/  


    Show Notes:


    What characteristics in today’s SaaS businesses dictate/influence the tech choices
    How are SaaS systems built? Tech stack and architecture
    Which observability is needed for SaaS?
    Kubernetes & infra observability
    Availability, responsiveness, low latency are critical in SaaS
    product and business observability
    Observability has many stakeholders
    Recommended tooling for SaaS
    Correlating different data signals
    Persistence and the cost of storage
    Final tips for SaaS observability
    AWS recent outages and learnings
    Log4j recent CVEs 

    Resources:


    AWS outages and learnings: https://horovits.medium.com/retrospect-on-the-aws-outage-and-resilient-cloud-based-architecture-cc513a32747

    Socials:


    Twitter: https://twitter.com/OpenObserv
    Twitch: https://www.twitch.tv/openobservability
    YouTube: https://www.youtube.com/channel/UCLKOtaBdQAJVRJqhJDuOlPg
    Website: https://openobservability.io/

    • 58 min
    Prometheus Pitfalls and the Rise of Continuous Profiling - OpenObservability Talks S2E07

    Prometheus Pitfalls and the Rise of Continuous Profiling - OpenObservability Talks S2E07

    We’ve grown to rely on “the three pillars” for observability - logs, metrics and traces. Popular frameworks such as Prometheus have helped popularize these practices. But now people are starting to realize that it’s not enough.

    On this episode Dotan Horovits will host Frederic Branczyk for a discussion about the unspoken pitfalls of Prometheus and the challenges of current observability coverage. We will also discuss the rise of Continuous Profiling as a new observability signal, what it’s about and where it can help. We’ll also review the recent launch of Parca, an open source project for continuous profiling that traces its roots to Red Hat’s internal ConProf open source tool.

    Frederic is the founder and CEO of Polar Signals. Before founding Polar Signals he was a senior principal engineer and the main architect for all things Observability at Red Hat, which he joined through the CoreOS acquisition. Frederic is a Prometheus and Thanos maintainer as well as the tech lead for the special interest group for instrumentation in Kubernetes. In a previous life, he was a security researcher working on key management solutions as well as intrusion detection systems. When not working on software Frederic enjoys obsessing over brewing a perfect cup of coffee.

    The episode was live-streamed at https://www.twitch.tv/openobservability on 16 December 2021 and the video is available at https://www.youtube.com/watch?v=G02g63oI0IA 

    OpenObservability Talks episodes are released monthly, on the last Thursday of each month. The episodes are also live-streamed on Twitch and YouTube Live - tune in to see us live, and pitch in with your comments and questions on the live chat.

    Show Notes:


    The limitations of the three pillars model of observability
    Prometheus strengths and pitfalls
    how to start with continuous profiling
    how to correlate between different telemetry
    Parca OSS intro
    eBPF turned out perfect for instrumenting continuous profiling
    Parca OSS future plan
    how is the performance penalty of continuous profiling kept low
    what's the solution for high cardinality in Prometheus?
    will Parca OSS be contributed to an established OSS foundation?
    Prometheus Agent mode released
    OTEL operator now has an instrumentation CR
    continuous profiling support for interpreted languages

    Resources:


    https://www.parca.dev/
    https://github.com/google/pprof
    https://increment.com/containers/observing-containers-pillars-of-observability/
    https://ebpf.io/
    https://research.google/pubs/pub36575/

    Social:


    Twitter: https://twitter.com/OpenObserv
    Twitch: https://www.twitch.tv/openobservability
    Website: https://openobservability.io/

    • 1 hr 1 min

Top Podcasts In Technology

Lex Fridman
Jack Rhysider
Jason Calacanis
Gimlet
PJ Vogt
a16z crypto, Sonal Chokshi, Chris Dixon

You Might Also Like

Software Engineering Daily
Craig Box
Changelog Media
Heavybit
Cloudcast Media
Changelog Media