Making AI Work: Fine-Tuning, Inference, Memory | Sharon Zhou, CEO, Lamini

The MAD Podcast with Matt Turck

In this episode, we reconnect with Sharon Zhou, co-founder and CEO of Lamini, to dive deep into the ever-evolving world of enterprise AI.

We discuss how the AI hype is evolving and what enterprises are doing to stay ahead, break down the different players in the Inference market, explore how Memory Tuning is reducing hallucinations in AI models, the role of agents in enterprise AI, and the challenges of making them real-time and reliable.

Lamini

Website - https://www.lamini.ai

Twitter - https://x.com/laminiai

Sharon Zhou

LinkedIn - https://www.linkedin.com/in/zhousharon

Twitter - https://x.com/realsharonzhou

FIRSTMARK

Website - https://firstmark.com

Twitter - https://twitter.com/FirstMarkCap

Matt Turck (Managing Director)

LinkedIn - https://www.linkedin.com/in/turck/

Twitter - https://twitter.com/mattturck

(00:00) Intro

(02:18) The state of the AI market in July, 2024

(10:51) What is Lamini?

(11:43) What is Inference?

(15:36) GPU shortage in the enterprise

(18:06) AMD vs Nvidia

(22:10) What is Lamini's final product?

(25:30) What is Memory Tuning?

(29:01) What is LoRA?

(32:39) More on Memory Tuning

(35:51) Sharon's perspective on AI agents

(40:01) What is next for Lamini?

(41:54) Reasoning vs pure compute in AI

Para ouvir episódios explícitos, inicie sessão.

Fique por dentro deste podcast

Inicie sessão ou crie uma conta para seguir podcasts, salvar episódios e receber as atualizações mais recentes.

Selecionar um país ou região

África, Oriente Médio e Índia

Ásia‑Pacífico

Europa

América Latina e Caribe

Estados Unidos e Canadá