100 episodes

Technical interviews about software topics.

Data – Software Engineering Daily Data – Software Engineering Daily

    • News
    • 5.0 • 4 Ratings

Technical interviews about software topics.

    Faking Data Using Tonic.ai with Ian Coe and Adam Kamor

    Faking Data Using Tonic.ai with Ian Coe and Adam Kamor

    Ian Coe CEO Adam Kamor Head of Engineering Companies that gather data about their users have an ethical obligation and legal responsibility to protect the personally identifiable information in their dataset.  Ideally, developers working on a software application wouldn’t need access to production data. Yet without high-quality example data, many technology groups stumble on avoidable

    • 41 min
    Lakehouse Data Stack with Raj Bains

    Lakehouse Data Stack with Raj Bains

    As companies move to Spark and a Lakehouse architecture, they are realizing that the data tools are lagging way behind.  You need to be a programmer to effectively use Spark and Airflow. There are some low-code ETL tools, but is that enough?  Companies want to treat their data pipelines like mission-critical apps.  They want DevOps

    • 1 hr
    RudderStack Engineering with Soumaydeb Mitra

    RudderStack Engineering with Soumaydeb Mitra

    Customer data pipelines power the backend of many successful web platforms. In a customer data pipeline, data is collected from sources such as mobile apps and cloud SaaS tools, transformed and munged using data engineering, stored in data warehouses, and piped to analytics, advertising platforms, and data infrastructure. RudderStack is an open source customer data

    • 52 min
    Apache Hudi with Vinoth Chandar

    Apache Hudi with Vinoth Chandar

    The data lake architecture has become broadly adopted in a relatively short period of time.  In a nutshell, that means data in it’s raw format stored in cloud object storage.  Modern software and data engineers have no shortage of options for accessing their data lake, but that list shrinks quickly if you care about features

    • 43 min
    Couchbase Architecture with Ravi Mayuram

    Couchbase Architecture with Ravi Mayuram

    Couchbase is a distributed NoSQL cloud database. Since its creation, Couchbase has expanded into edge computing, application services, and most recently a database-as-a-service called Capella.  Couchbase started as an in-memory cache and needed to be rearchitected to be a persistent storage system. In this episode, I interview Ravi Mayuram, SVP Products and Engineering at Couchbase

    • 58 min
    Trifacta with Joe Hellerstein

    Trifacta with Joe Hellerstein

    If you haven’t encountered a data quality problem, then you haven’t yet worked on a large enough project.  Invariably, a gap exists between the state of raw data and what an analyst or machine learning engineer needs to solve their problem.  Many organizations needing to automate data preparation workflows look to Trifacta as a solution. 

    • 41 min

Customer Reviews

5.0 out of 5
4 Ratings

4 Ratings

Top Podcasts In News

The New York Times
New York Times Opinion
Chris Cuomo
CBC Podcasts
NPR
BBC World Service

You Might Also Like

Tobias Macey
Software Engineering Daily
Michael Kennedy (@mkennedy)
Real Python
Kyle Polich
Jon Krohn and Guests on Machine Learning, A.I., and Data-Career Success