Explain The Explainables

Explain Large Language Models: History, Attention, Transformers, ChatGPT, Deepseek and Explainability

Join us for a groundbreaking episode of Explain the Explainables by your favorite podcast host, Fatih Bildirici PhD(c), unbound where we explore the fascinating world of Large Language Models, starting with the revolutionary 2017 paper "Attention Is All You Need." From its Beatles-inspired title to becoming the foundation of modern AI with over 160,000 citations, we'll uncover how this breakthrough transformed machine learning forever.

In this episode, we dive deep into the history, philosophy and architecture that powers tools like ChatGPT and modern AI systems. Through engaging storytelling and clear examples, we'll explore how machines learned to understand human language, the significance of the Transformer architecture, and what it means for our future.

Whether you're a tech enthusiast or simply curious about AI, this episode offers an accessible journey into one of the most transformative technologies of our time. We'll also feature insights from explainability side.

Don't miss this illuminating exploration of how a single paper revolutionized artificial intelligence and set the stage for the AI revolution we're experiencing today.

Recommended resources:

  • Mywebsite: https://fbildirici.github.io
  • Andrej Karpathy's Intro to LLMs: https://www.youtube.com/watch?v=zjkBMFhNj_g
  • Attention is All You Need: https://arxiv.org/abs/1706.03762
  • Explainability of LLMs: https://dl.acm.org/doi/10.1145/3639372