208 episodes

Weekly summaries and discussion about the most interesting developments in AI, deep learning, robotics, and more!

Last Week in AI Skynet Today

    • Technology

Weekly summaries and discussion about the most interesting developments in AI, deep learning, robotics, and more!

    #170 - new Sora rival, OpenAI robotics, understanding GPT4, AGI by 2027?

    #170 - new Sora rival, OpenAI robotics, understanding GPT4, AGI by 2027?

    Our 170th episode with a summary and discussion of last week's big AI news!
    With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)
    Feel free to leave us feedback here.
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + Links:
    Tools & Apps(00:03:33) KLING is the latest AI video generator that could rival OpenAI's Sora
    (00:09:16) ‘Apple Intelligence’ will automatically choose between on-device and cloud-powered AI
    (00:12:21) Udio introduces new udio-130 music generation model and more advanced features
    (00:14:38) Perplexity AI’s new feature will turn your searches into shareable pages
    (00:16:35) ElevenLabs’ AI generator makes explosions or other sound effects with just a prompt
    (00:18:37) Google’s updated AI-powered NotebookLM expands to India, UK and over 200 other countries

    Applications & Business(00:19:40) OpenAI is restarting its robotics research group
    (00:25:01) Saudi fund invests in China effort to create rival to OpenAI
    (00:29:34) UAE seeks ‘marriage’ with US over artificial intelligence deals
    (00:33:01) Zoox to test self-driving cars in Austin and Miami 
    (00:35:49) Microsoft Lays Off 1,500 Workers, Blames "AI Wave"
    (00:38:28) Avengers, assemble—Google, Intel, Microsoft, AMD and more team up to develop an interconnect standard to rival Nvidia's NVLink

    Projects & Open Source(00:40:39) GLM-4-9B-Chat-1M
    (00:46:37) Hugging Face and Pollen Robotics show off first project: an open source robot that does chores
    (00:49:40) Zyphra debuts Zyda, a 1.3T language modeling dataset it claims outperforms Pile, C4, arxiv
    (00:51:59) Stability AI debuts new Stable Audio Open for sound design

    Research & Advancements(00:54:05) Scaling and evaluating sparse autoencoders
    (01:04:54) Improving Alignment and Robustness with Short Circuiting
    (01:12:11) Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach
    (01:16:20) GPT-4 didn't ace the bar exam after all, MIT research suggests — it didn't even break the 70th percentile

    Policy & Safety(01:20:11) Former OpenAI researcher foresees AGI reality in 2027
    (01:28:03) OpenAI Insiders Warn of a ‘Reckless’ Race for Dominance
    (01:33:52) Testing and mitigating elections-related risks
    (01:36:26) Teams of LLM Agents can Exploit Zero-Day Vulnerabilities

    Synthetic Media & Art(01:43:23) The Uncanny Rise of the World's First AI Beauty Pageant

    (01:46:25) Outro + AI Song

    • 1 hr 48 min
    #169 - Google's Search Errors, OpenAI news & DRAMA, new leaderboards

    #169 - Google's Search Errors, OpenAI news & DRAMA, new leaderboards

    Our 168th episode with a summary and discussion of last week's big AI news!
    Feel free to leave us feedback here: https://forms.gle/ngXvXZpNJxaAprDv6
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + Links:
    (00:00:00) Intro / Banter
    (00:02:55) Response to listener comments / corrections
    Tools & Apps
    (00:04:33) Google’s A.I. Search Errors Cause a Furor Online
    (00:10:56) Telegram gets an in-app Copilot bot
    (00:13:13) Opera is adding Google's Gemini AI to its browser
    (0016:13) Amazon plans to give Alexa an AI overhaul — and a monthly subscription price
    (00:19:15) Microsoft Edge will translate and dub YouTube videos as you’re watching them
    (00:21:12) Iyo thinks its gen AI earbuds can succeed where Humane and Rabbit stumbled

    Applications & Business
    (00:24:57) PwC agrees deal to become OpenAI's first reseller and largest enterprise user
    (00:30:07) Vox Media and The Atlantic sign content deals with OpenAI
    (00:36:27) OpenAI launches programs making ChatGPT cheaper for schools and nonprofits
    (00:40:03) Huawei patent reveals 3nm-class process technology plans — China continues to move forward despite US sanctions
    (00:44:32) Nvidia, Powered by A.I. Boom, Reports Soaring Revenue and Profits
    (00:48:16) Elon Musk’s xAI raises $6 billion in latest funding round

    Projects & Open Source
    (00:51:13) Scale AI publishes its first LLM Leaderboards, ranking AI model performance in specific domains
    (00:56:04) Cohere For AI Launches Aya 23, 8 and 35 Billion Parameter Open Weights Release
    (01:00:45) Who will make AlphaFold3 open source? Scientists race to crack AI model
    (01:04:07) Mistral releases Codestral, its first generative AI model for code

    Research & Advancements
    (01:09:23) The Road Less Scheduled
    (01:14:10) Training Compute of Frontier AI Models Grows by 4-5x per Year
    (01:21:33) gzip Predicts Data-dependent Scaling Laws
    (01:25:51) Neural Scaling Laws for Embodied AI
    (01:28:47) Contextual Position Encoding: Learning to Count What’s Important
    (01:33:09) New AI products much hyped but not much used, study says

    Policy & Safety
    (01:37:00) Ex-OpenAI board member reveals what led to Sam Altman's brief ousting
    (01:46:36) OpenAI researcher who resigned over safety concerns joins Anthropic
    (01:49:16) Leaked OpenAI Documents Show Sam Altman Was Clearly Aware of Silencing Former Employees
    (01:54:33) OpenAI Board Forms Safety and Security Committee
    (01:58:07) Robocaller Who Used AI to Clone Biden’s Voice Fined $6 Million
    (01:59:08) Hacker Releases Jailbroken "Godmode" Version of ChatGPT
    (02:00:46) China Creates $47.5 Billion Chip Fund to Back Nation’s Firms

    Synthetic Media & Art
    (02:02:23) Alphabet, Meta Offer Millions to Partner With Hollywood on AI

    (02:04:21) Outro + AI Song

    • 2 hrs 6 min
    #168 - OpenAI vs Scar Jo + safety researchers, MS AI updates, cool Anthropic research

    #168 - OpenAI vs Scar Jo + safety researchers, MS AI updates, cool Anthropic research

    Our 168th episode with a summary and discussion of last week's big AI news!
    With guest host Gavin Purcell from AI for Humans podcast!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + Links:
    (00:00:00) Intro / Banter + Response to listener comments / corrections
    Tools & Apps
    (00:08:00) OpenAI says Sky voice in ChatGPT will be paused after concerns it sounds too much like Scarlett Johansson
    (00:16:14) Microsoft’s Copilot assistant is getting a GPT-4o upgrade + Recall is Microsoft’s key to unlocking the future of PCs
    (00:21:36) ElevenLabs Launches AI-Voiced Screen Reader App
    (00:22:40) Adobe Lightroom gets a magic eraser, and it’s impressive
    (00:25:07) Microsoft, Khan Academy provide free AI assistant for all educators in US
    (00:27:40) Microsoft Paint is getting an AI-powered image generator that responds to your text prompts and doodles

    Applications & Business
    (00:29:16) OpenAI founders Sam Altman and Greg Brockman go on the defensive after top safety researchers quit
    (00:36:58) OpenAI, WSJ Owner News Corp Strike Content Deal Valued at Over $250 Million
    (00:41:27) CoreWeave Raises $7.5 Billion in Debt for AI Computing Push
    (00:44:13) Google announced Trillium, its sixth generation of Tensor processors.
    (00:45:09) Inflection AI reveals new team and plan to embed emotional AI in business bots
    (00:47:01) Data-labeling startup Scale AI raises $1B as valuation doubles to $13.8B

    Projects & Open Source
    (00:48:35) Abacus AI Releases Smaug-Llama-3-70B-Instruct: The New Benchmark in Open-Source Conversational AI Rivaling GPT-4 Turbo
    (00:52:24) Introducing New Chatbot Arena Category: Hard Prompts
    (00:54:56) Microsoft brings out a small language model that can look at pictures

    Research & Advancements
    (00:56:05) New Anthropic Research Sheds Light on AI's 'Black Box'
    (01:04:03) Chameleon: Mixed-Modal Early-Fusion Foundation Models
    (01:08:14) SpeechVerse: A Large-scale Generalizable Audio Language Model
    (01:09:05) CAT3D: Create Anything in 3D with Multi-View Diffusion Models
    (01:11:17) Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning
    (01:12:10) SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

    Policy & Safety
    (01:15:01) World’s first major law for artificial intelligence gets final EU green light
    (01:17:18) Colorado governor signs sweeping AI regulation bill
    (01:22:10) Senators Propose $32 Billion in Annual A.I. Spending but Defer Regulation
    (01:23:25) Google DeepMind launches new framework to assess the dangers of AI models
    (01:25:05) Tech giants pledge AI safety commitments — including a ‘kill switch’ if they can’t mitigate risks

    Synthetic Media & Art
    (01:28:32) Sony Music warns tech companies over ‘unauthorized’ use of its content to train AI
    (01:32:34) Hollywood agency CAA aims to help stars manage their own AI likenesses
    (01:38:28) What Do You Do When A.I. Takes Your Voice?

    (01:42:01) Outro + AI Song

    • 1 hr 44 min
    #167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

    #167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

    Our 167th episode with a summary and discussion of last week's big AI news!
    With guest host Daliana Liu (https://www.linkedin.com/in/dalianaliu/) from The Data Scientist Show!
    And a special one-time interview with Andrey in the latter part of the podcast.
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Intro / Banter
    Tools & Apps
    (00:03:42) OpenAI releases GPT-4o, a faster model that’s free for all ChatGPT users
    (00:12:06) Project Astra is the future of AI at Google
    (00:18:06) Google is redesigning its search engine — and it’s AI all the way down
    (00:19:39) Google unveils Veo and Imagen 3, its latest AI media creation models
    (00:23:36) Google Unveils Music AI Sandbox Making Loops From Prompts
    (00:26:27) Anthropic AI Launches a Prompt Engineering Tool that Generates Production-Ready Prompts in the Anthropic Console

    Applications & Business
    (00:31:02) OpenAI’s Chief Scientist and Co-Founder Is Leaving the Company
    (00:35:15) Mike Krieger joins Anthropic as Chief Product Officer
    (00:36:28) $16k G1 humanoid rises up to smash nuts, twist and twirl
    (00:41:02) GM's Cruise to start testing robotaxis in Phoenix area with human safety drivers on board
    (00:42:52) US agency probes Amazon-owned Zoox self-driving vehicles after two crashes
    (00:43:58) Waymo’s robotaxis under investigation after crashes and traffic mishaps

    Projects & Open Source
    (00:44:48) Introducing PaliGemma, Gemma 2, and an Upgraded Responsible AI Toolkit
    (00:46:24) Falcon 2: UAE’s Technology Innovation Institute Releases New AI Model Series, Outperforming Meta’s New Llama 3
    (00:48:00) License to Call: Introducing Transformers Agents 2.0

    Research & Advancements
    (00:49:22) The Platonic Representation Hypothesis
    (00:53:08) SUTRA: Scalable Multilingual Language Model Architecture

    Policy & Safety
    (00:54:46) Bipartisan Senate bill on AI security would bolster voluntary cyber reporting processes
    (00:56:17) U.K. agency releases tools to test AI model safety
    (00:57:25) Protesters Are Fighting to Stop AI, but They’re Split on How to Do It

    Synthetic Media & Art
    (00:58:54) Google’s invisible AI watermark will help identify generative text and video
    (01:00:50) How One Author Pushed the Limits of AI Copyright
    (01:03:27) Stellaris gets an DLC about AI that features AI-created voices, director insists it's 'ethical' and 'we're pretty good at exploring dystopian sci-fi and don't want to end up there ourselves'
    (01:04:46) At the AI Film Festival, humanity triumphed over tech

    (01:06:37) Daliana Interviews Andrey
    (01:42:00) AI Outro Song

    • 1 hr 43 min
    #166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

    #166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

    Our 166th episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    (00:00:00) Intro / Banter
    Tools & Apps(00:04:23) ElevenLabs previews music-generating AI model
    (00:09:31) Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts
    (00:13:00) SoundHound AI and Perplexity Partner to Bring Online LLMs to Next Gen Voice Assistants Across Cars and IoT Devices
    (00:14:50) Stability AI sows gen AI discord with Stable Artisan
    (00:16:35) Apple Will Revamp Siri to Catch Up to Its Chatbot Competitors
    (00:18:54) Alibaba rolls out latest version of its large language model to meet robust AI demand

    Applications & Business(00:19:34) OpenAI and Stack Overflow partner to bring more technical knowledge into ChatGPT
    (00:17:31) New Microsoft AI model may challenge GPT-4 and Google Gemini
    (00:31:08) Wayve, an A.I. Start-Up for Autonomous Driving, Raises $1 Billion
    (00:32:00) Motional delays commercial robotaxi plans amid restructuring
    (00:33:54) The rise of the Chinese AI unicorns doing battle with OpenAI

    Projects & Open Source(00:35:25) Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models
    (00:40:12) DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
    (00:44:31) OpenVoice V2: Evolving Multilingual Voice Cloning with Enhanced Style Control and Cross-Lingual Capabilities
    (00:45:20) Granite Code Models: A Family of Open Foundation Models for Code Intelligence
    (00:46:00) Hugging Face launches LeRobot open source robotics code library
    (00:48:50) Vibe-Eval: A new open and hard evaluation suite for measuring progress of multimodal language models

    Research & Advancements(00:50:02) Google DeepMind’s Groundbreaking AI for Protein Structure Can Now Model DNA
    (00:57:20) xLSTM: Extended Long Short-Term Memory
    (01:06:35) StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
    (01:07:55) Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
    (01:11:48) KAN: Kolmogorov-Arnold Networks

    Policy & Safety(01:13:20) US lawmakers unveil bill to make it easier to restrict exports of AI models
    (01:17:30) OpenAI’s Model Spec outlines some basic rules for AI
    (01:20:18) Robot dogs armed with AI-targeting rifles undergo US Marines Special Ops evaluation
    (01:25:15) OpenAI Releases ‘Deepfake’ Detector to Disinformation Researchers

    Synthetic Media & Art(01:28:15) Audible’s Test of AI-Voiced Audiobooks Tops 40,000 Titles
    (01:32:30) TikTok will automatically label AI-generated content created on platforms like DALL·E 3
    (01:33:23) Katy Perry's Fan-Made AI Image Is So Real It Fooled the World Into Thinking She Was at the Met Gala
    (01:35:32) South Korean woman falls for deepfake Elon Musk, loses $50K in romance scam 
    (01:37:18) Why young Russian women appear so eager to marry Chinese men

    (01:40:18) AI Outro Song

    • 1 hr 41 min
    #165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

    #165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

    Our 165th episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Tools & Apps(00:01:27) GitHub releases an AI-powered tool aiming for a 'radically new way of building software'
    (00:07:05) China unveils Sora challenger able to produce videos from text similar to OpenAI tool, though much shorter
    (00:12:23) ChatGPT’s AI ‘memory’ can remember the preferences of paying customers
    (00:14:21) Rabbit R1 review: Avoid this AI gadget
    (00:18:30) Amazon Q, a generative AI-powered assistant for businesses and developers, is now generally available
    (00:19:54) Yelp’s Assistant AI bot will do all the talking to help users find service providers

    Applications & Business(00:21:31) Video of super-fast, super-smooth humanoid robot will drop your jaw
    (00:25:22) Tesla’s 2 million car Autopilot recall is now under federal scrutiny
    (00:29:32) Tesla shares soar as Elon Musk returns from China with FSD 'Game Changer'
    (00:32:11) OpenAI inks strategic tie-up with UK’s Financial Times, including content use
    (00:35:21) OpenAI Startup Fund quietly raises $15M
    (00:37:00) Huawei backs HBM memory manufacturing in China to sidestep crippling US sanctions that restrict AI development

    Research & Advancements(00:39:20) Capabilities of Gemini Models in Medicine
    (00:45:34) Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
    (00:52:20) NExT: Teaching Large Language Models to Reason about Code Execution
    (00:55:08) SenseNova 5.0: China’s latest AI model surpasses OpenAI’s GPT-4
    (00:57:20) Octopus v4: Graph of language models
    (01:00:28) Better & Faster Large Language Models via Multi-token Prediction

    Policy & Safety(01:03:15) Refusal in LLMs is mediated by a single direction
    (01:09:19) Rishi Sunak promised to make AI safe. Big Tech’s not playing ball.
    (01:15:09) DOE Announces New Actions to Enhance America’s Global Leadership in Artificial Intelligence
    (01:18:21) The Chips Act is rebuilding US semiconductor manufacturing, so far resulting in $327 billion in announced projects
    (01:20:50) Analysis-Second global AI safety summit faces tough questions, lower turnout
    (01:24:03) Sam Altman, Jensen Huang, and more join the federal AI safety board

    Synthetic Media & Art(01:26:30) Air Head creators say OpenAI's Sora finicky to work with, needs hundreds of prompts, serious VFX work for under 2 minutes of cohesive story ↺
    (01:29:50) Eight newspaper publishers sue OpenAI over copyright infringement

    • 1 hr 32 min

Top Podcasts In Technology

Tehnična podpora
RTVSLO – Val 202
Lex Fridman Podcast
Lex Fridman
Search Engine
PJ Vogt, Audacy, Jigsaw
TED Radio Hour
NPR
The Neuron: AI Explained
The Neuron
Darknet Diaries
Jack Rhysider

You Might Also Like

This Day in AI Podcast
Michael Sharkey, Chris Sharkey
Practical AI: Machine Learning, Data Science
Changelog Media
The AI Podcast
NVIDIA
The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis
Nathaniel Whittemore
Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Sam Charrington