The New Claude 3.5 Sonnet: Better, Yes, But Not Just in the Way You Might Think
A new state of the art LLM (at least for creative writing and basic reasoning) but what lies behind the numbers that were put out? Is it for real, and are AI agents about to grab your mouse and shake your cursor?
Plus, results on my own Simple Bench, and new tools from Runway (Act-One), HeyGen (Zoom Calls) and an updated NotebookLM. AI, without the hype.
Weights and Biases' Weave: https://wandb.me/ai_explained
Information
- Show
- FrequencyUpdated Weekly
- PublishedOctober 28, 2024 at 1:00 PM UTC
- Length23 min
- Season1
- Episode1
- RatingClean