11 FEB
T1, E6
23 MIN

Machine Learning Engineering: Training vs. Inference

As iOS engineers, we respect the science of training models, but we live in the trenches of inference. In this episode, we explore the "Great Divide" between creating a model and running it. We break down the memory mechanics of O(N) training vs O(1) inference, dissect compiler optimizations like kernel fusion, and explain exactly how the Apple Neural Engine cheats bandwidth physics using quantization.

Sitio web del episodio

Programa

Sandboxed - On-Device AI for iOS
Frecuencia

Cada semana
Publicado

11 de febrero de 2026, 6:00 a.m. UTC
Duración

23 min
Temporada

1
Episodio

6
Clasificación

Apto

Machine Learning Engineering: Training vs. Inference

Información