136 episodes

The leading podcast on how to build a successful open source company.

Learn from the founders of HashiCorp, Chronosphere, Vercel, MongoDB, DBT, mobile.dev and more!

Open Source Startup Podcast Robby (Cowboy VC) & Tim (Essence VC)

    • Technology

The leading podcast on how to build a successful open source company.

Learn from the founders of HashiCorp, Chronosphere, Vercel, MongoDB, DBT, mobile.dev and more!

    E136: Creating the Vector Database for AI Application Developers

    E136: Creating the Vector Database for AI Application Developers

    Jeff Huber is Co-Founder of Chroma, the open source vector database. Their open source project, also called chroma, has 13K stars on GitHub.

    Chroma has raised $20M from investors including Quiet Ventures and Bloomberg Beta.

    In this episode, we dig into why vector databases are important for AI applications & why AI workloads are different, how their partnership with LangChain helped with early growth, why data is really the only tool a user has to change modern AI's behavior & more!

    • 39 min
    E135: Riding the Homebrew Wave

    E135: Riding the Homebrew Wave

    John Britton & Mike McQuaid are Co-Founders of Workbrew, the company that provides additional features and support for companies using Homebrew. Homebrew's main project, brew, is a wildly popular open source project with 40K GitHub stars and provides the missing package manager for macOS (or Linux).

    In this episode, we dig into John & Mike's history with Homebrew and their time together at GitHub, how Homebrew has kept projects simple over time and avoided feature creep, how Homebrew has managed to get a lot of value from contributors, how their ICP has shifted from mac admins to dev and security teams & more!

    • 42 min
    E134: Making Complex Data RAG-Ready with Unstructured

    E134: Making Complex Data RAG-Ready with Unstructured

    Brian Raymond is Founder & CEO of Unstructured, the platform to extract and transform complex data for use with every major vector database and LLM framework. Their open source project has 7K stars on GitHub and includes libraries and APIs that let users build custom preprocessing pipelines for labeling, training, and production machine learning pipelines. Today, they have over 6M downloads and 50K companies using their tools.

    Unstructured has raised $65M from investors including Bain, Essence VC, and Menlo Ventures.

    In this episode, we dig into Brian's process of talking to 100 data scientists before launching Unstructured, why the long tail of data matters for LLMs, competing with their own open source, why being a "boring company" is valuable for today's LLM stack, why they liked having government design partners, why world-class design & marketing are huge differentiators for open source companies & more!

    • 37 min
    E133: Reinventing Authorization with Google's Zanzibar Paper

    E133: Reinventing Authorization with Google's Zanzibar Paper

    Jake Moshenko is Co-Founder & CEO of AuthZed, the scalable authorization platform based on Google's Zanzibar white paper. Their open source permissions database spiceDB has 5K stars on GitHub and enables fine-grained access control for customer applications.

    AuthZed has raised $4M from investors including Work-Bench and Amplify.

    In this episode, we dig into the Zanzibar approach to auth, branding themselves as a database, building for big companies from the get-go, their Hacker News launch and how getting on the front page kickstarted their project's growth, monetizing early & more!

    • 39 min
    E132: From General Purpose to Specialized Databases

    E132: From General Purpose to Specialized Databases

    Joran Dirk Greef is Founder & CEO of TigerBeetle, the open source financial transactions database. Their project, also called tigerbeetle, has over 7K stars and is a database designed for mission-critical workloads and performance.

    TigerBeetle has raised $6M from investors including Amplify.

    In this episode, we discuss why general purpose databases don't scale for high volume transactional workloads - and the need for specialized databases generally, open source vs. source available, the enterprise commercial stack of management, monitoring, security, and identity, their unique take on monetization & more!

    • 40 min
    E131: Why the Next Generation of Time Series Databases Will Be Multimodal

    E131: Why the Next Generation of Time Series Databases Will Be Multimodal

    Niko West is Co-Founder & CEO of Rerun, the open source visualization engine for streams of multimodal data.

    Rerun has raised over $3M from investors including Costanoa.

    In this episode, we discuss how Rerun found early success in gaming, why building in Rust was important, how open source expanded the segments Rerun could serve, why they thought about monetization early, the importance of visual and video content & more!

    • 34 min

Top Podcasts In Technology

Blood, Sweat & CPMs
Freestar
FT Tech Tonic
Financial Times
Latent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and al
Alessio + swyx
Security Weekly Podcast Network (Audio)
Security Weekly Productions
Faces of Digital Health
Tjasa Zajc
That IT show
thatitshow

You Might Also Like

Kubernetes Podcast from Google
Abdel Sghiouar, Kaslin Fields
Latent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and al
Alessio + swyx
Software Engineering Radio - the podcast for professional software developers
se-radio@computer.org
Thoughtworks Technology Podcast
Thoughtworks
Software Engineering Daily
Software Engineering Daily
No Priors: Artificial Intelligence | Technology | Startups
Conviction | Pod People