AI Bites: The Academic Series

Jack Lakkapragada

Welcome to AI Bites. This podcast features AI-generated deep dives into the world’s most prestigious computer science curricula. Based on personal study notes and publicly available course material from Stanford University (CS124, CS221, and more), these episodes use Google’s NotebookLM to transform dense academic topics into conversational summaries. Perfect for learning on the go, whether you're commuting or at the gym. Disclaimer: This is an independent, AI-generated study resource and is not officially affiliated with Stanford University.

  1. 1d ago

    EP 48 | CS224N: RAG and Language Agents

    Up until now, we’ve looked at Language Models as isolated brains trapped in a box. In this episode, we cross the threshold into the absolute bleeding edge of AI: giving models a search engine to browse the web, memory to remember past conversations, and tools to execute code. We break down the inner workings of Retrieval-Augmented Generation (RAG) and the anatomy of truly autonomous Language Agents. Key Topics: The Knowledge Problem & RAG: Why forcing LLMs to memorize everything leads to hallucinations, how the Retriever-Reader framework (DPR vs. BM25) fixes it, and why stuffing too many documents into a model triggers the "Lost in the Middle" problem. The Anatomy of an Agent: How we transform a standard text-predictor into an active agent using a core LLM surrounded by an external environment, reasoning protocols, memory structures, and tools. Reasoning & Planning (ReAct vs. Reflexion): Unpacking the massive breakthrough of the ReAct (Reason + Act) framework, and how self-correction loops and multi-agent debates drastically reduce AI hallucinations. The Cognitive Architecture (Memory & Tool Use): Distinguishing between Episodic, Semantic, and Procedural memory (including how MemGPT acts like an Operating System). Plus, how models like Toolformer teach themselves to use external APIs. The Python "While True" Loop: Demystifying the engineering behind agents by looking at the simple code loops that power them, and the massive challenges the industry faces in trying to evaluate open-ended AI behavior. Note: This is an AI-generated discussion created using Google's NotebookLM, based on publicly available Stanford University course material (specifically CS224N) and personal study notes from my learning journey.

    22 min
  2. Jun 11

    EP 46 | CS224N: Post-training

    How do we turn a raw, chaotic text-predictor into a helpful, conversational AI assistant? In this episode, we dive into the massive pipeline of Post-training. We explore the transition from Instruction Fine-Tuning to complex Reinforcement Learning, and why teaching an AI to be "helpful" sometimes inadvertently teaches it to lie. Key Topics: The Alignment Problem: Why a raw foundational model is just a "document completer" and how Instruction Fine-Tuning (IFT) begins the process of teaching it to follow user commands. RLHF & Reward Models: How we use pairwise human comparisons to train a Reward Model, and how PPO is used to optimize the AI's behavior without breaking its grammar. Reward Hacking & Hallucinations: The dark side of RLHF. We explore why heavily incentivizing models to sound authoritative leads to massive real-world failures, like Bing's sports hallucinations and Google Bard's $100 Billion stock drop. The DPO Breakthrough: How researchers removed the unstable reinforcement learning step entirely with Direct Preference Optimization, creating the new open-source standard. Ethical Realities: A candid look at the human cost of AI alignment, from low-wage "digital sweatshops" to the severe annotator biases that bleed directly into modern models. Note: This is an AI-generated discussion created using Google's NotebookLM, based on publicly available Stanford University course material (specifically CS224N) and personal study notes from my learning journey.

    22 min
  3. May 29

    EP 43 | CS224N: Language Models and RNNs

    We are continuing our journey through Stanford's CS224N by exploring the absolute foundation of modern natural language processing. In this episode, we break down Language Models and Recurrent Neural Networks (RNNs), unpacking how the simple task of predicting the next word ultimately taught machines to learn facts, logic, and arithmetic. Key Topics: Language Modeling & n-grams: The core concept of next-word prediction and why the pre-deep learning era of statistical n-gram models ultimately failed due to sparsity, storage bloat, and "goldfish memory." The RNN Breakthrough: How the industry moved past fixed-window models to Recurrent Neural Networks, allowing machines to process sequences of any length by reusing the exact same weight matrix at every time step. Exploding & Vanishing Gradients: The mathematical hurdles that broke early RNNs. We explore why taking massive SGD steps (exploding) or forgetting long-distance dependencies (vanishing) required fixes like gradient clipping and LSTMs. Neural Machine Translation (NMT): A look at the Sequence-to-Sequence (Seq2Seq) Encoder-Decoder architecture that revolutionized machine translation between 2014 and 2016—and the massive "Bottleneck Problem" it created for future engineers to solve. Note: This is an AI-generated discussion created using Google's NotebookLM, based on publicly available Stanford University course material (specifically CS224N) and personal study notes from my learning journey.

    9 min

About

Welcome to AI Bites. This podcast features AI-generated deep dives into the world’s most prestigious computer science curricula. Based on personal study notes and publicly available course material from Stanford University (CS124, CS221, and more), these episodes use Google’s NotebookLM to transform dense academic topics into conversational summaries. Perfect for learning on the go, whether you're commuting or at the gym. Disclaimer: This is an independent, AI-generated study resource and is not officially affiliated with Stanford University.