98 episodios

Technical interviews about software topics.

Data – Software Engineering Daily Data – Software Engineering Daily

    • Noticias tecnológicas

Technical interviews about software topics.

    Infrastructure Management with Joey Parsons

    Infrastructure Management with Joey Parsons

    At Airbnb, infrastructure management is standardized across the organization. Platform engineering teams build tools that allow the other teams throughout the organization to work more effectively. A platform engineering team handles problems such as continuous integration, observability, and service discovery. Other teams throughout a company use the tools that a platform engineering team builds. For

    • 1h 14 min
    Data Infrastructure Investing with Eric Anderson

    Data Infrastructure Investing with Eric Anderson

    In a modern data platform, distributed streaming systems are used to read data coming off of an application in real-time. There are a wide variety of streaming systems, including Kafka Streams, Apache Samza, Apache Flink, Spark Streaming, and more.  When Eric Anderson joined the show back in 2016, he was working at Google on Google

    • 1h 12 min
    Materialize: Streaming SQL on Timely Data with Arjun Narayan and Frank McSherry

    Materialize: Streaming SQL on Timely Data with Arjun Narayan and Frank McSherry

    Distributed stream processing frameworks are used to rapidly ingest and aggregate large volumes of incoming data. These frameworks often require the application developer to write imperative logic describing how that data should be processed.  For example, a high volume of clickstream data that is getting buffered to Kafka needs to have a stream processing system

    • 1h 11 min
    Great Expectations: Data Pipeline Testing with Abe Gong

    Great Expectations: Data Pipeline Testing with Abe Gong

    A data pipeline is a series of steps that takes large data sets and creates usable results from them. At the beginning of a data pipeline, a data set might be pulled from a database, a distributed file system, or a Kafka topic. Throughout a data pipeline, different data sets are joined, filtered, and statistically

    • 1h 8 min
    Data Warehouse ETL with Matthew Scullion

    Data Warehouse ETL with Matthew Scullion

    A data warehouse provides low latency access to large volumes of data.  A data warehouse is a crucial piece of infrastructure for a large company, because it can be used to answer complex questions involving a large number of data points. But a data warehouse usually cannot hold all of a company’s data at any

    • 57 min
    Flink and BEAM Stream Processing with Maximilian Michels

    Flink and BEAM Stream Processing with Maximilian Michels

    Distributed stream processing systems are used to read large volumes of data and perform operations across those data streams.  These stream processing systems often build off of the MapReduce algorithm for collecting and aggregating large volumes of data, but instead of processing a calculation over a single large batch of data, they process data on

    • 51 min

Top podcasts en Noticias tecnológicas

Otros usuarios también se han suscrito a