13 FEB
17 MIN

"Intro to Large Language Models" - Andrej Karpathy's Tech Talk Learning

Andrej Karpathy's talk, "Intro to Large Language Models," demystifies LLMs by portraying them as systems with two key components:a parameters file (the weights of the neural network) anda run file (the code that runs the network). The creation of these files starts with a computationally intensive training process, where a large amount of internet text is compressed into the model's parameters. The scaling laws show that LLM performance depends on the number of parameters and the amount of training data.Karpathy reviews how LLMs are evolving to incorporate external tools and multiple modalities. He presents his view of LLMs as the kernel process of an emerging operating system and also discusses the security challenges of LLMs, including jailbreak attacks, prompt injection attacks, and data poisoning.

Episode Webpage

Show

Large Language Model (LLM) Talk
Frequency

Updated daily
Published

13 February 2025 at 07:06 UTC
Length

17 min
Rating

Clean

"Intro to Large Language Models" - Andrej Karpathy's Tech Talk Learning

Information