As iOS engineers, we respect the science of training models, but we live in the trenches of inference. In this episode, we explore the "Great Divide" between creating a model and running it. We break down the memory mechanics of O(N) training vs O(1) inference, dissect compiler optimizations like kernel fusion, and explain exactly how the Apple Neural Engine cheats bandwidth physics using quantization.
Información
- Programa
- FrecuenciaCada semana
- Publicado11 de febrero de 2026, 6:00 a.m. UTC
- Duración23 min
- Temporada1
- Episodio6
- ClasificaciónApto
