OpenAI recently released its first open-weights model since GPT-2, entering a field led by DeepSeek and Alibaba's Qwen.YC's Ankit Gupta breaks down everything you need to know about these top OSS models, including what sets them apart under the hood. He’ll compare their approaches to mixture-of-experts, long-context training, and post-training techniques that shape reasoning and alignment—and explore how different design choices lead to surprisingly similar performance.
資訊
- 節目
- 頻道
- 頻率每週更新
- 發佈時間2025年8月29日 下午2:07 [UTC]
- 長度13 分鐘
- 年齡分級兒少適宜