205 episodes

Weekly summaries and discussion about the most interesting developments in AI, deep learning, robotics, and more!

Last Week in AI Skynet Today

    • Technology

Weekly summaries and discussion about the most interesting developments in AI, deep learning, robotics, and more!

    #167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

    #167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

    Our 167th episode with a summary and discussion of last week's big AI news!
    With guest host Daliana Liu (https://www.linkedin.com/in/dalianaliu/) from The Data Scientist Show!
    And a special one-time interview with Andrey in the latter part of the podcast.
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Intro / Banter
    Tools & Apps
    (00:03:42) OpenAI releases GPT-4o, a faster model that’s free for all ChatGPT users
    (00:12:06) Project Astra is the future of AI at Google
    (00:18:06) Google is redesigning its search engine — and it’s AI all the way down
    (00:19:39) Google unveils Veo and Imagen 3, its latest AI media creation models
    (00:23:36) Google Unveils Music AI Sandbox Making Loops From Prompts
    (00:26:27) Anthropic AI Launches a Prompt Engineering Tool that Generates Production-Ready Prompts in the Anthropic Console

    Applications & Business
    (00:31:02) OpenAI’s Chief Scientist and Co-Founder Is Leaving the Company
    (00:35:15) Mike Krieger joins Anthropic as Chief Product Officer
    (00:36:28) $16k G1 humanoid rises up to smash nuts, twist and twirl
    (00:41:02) GM's Cruise to start testing robotaxis in Phoenix area with human safety drivers on board
    (00:42:52) US agency probes Amazon-owned Zoox self-driving vehicles after two crashes
    (00:43:58) Waymo’s robotaxis under investigation after crashes and traffic mishaps

    Projects & Open Source
    (00:44:48) Introducing PaliGemma, Gemma 2, and an Upgraded Responsible AI Toolkit
    (00:46:24) Falcon 2: UAE’s Technology Innovation Institute Releases New AI Model Series, Outperforming Meta’s New Llama 3
    (00:48:00) License to Call: Introducing Transformers Agents 2.0

    Research & Advancements
    (00:49:22) The Platonic Representation Hypothesis
    (00:53:08) SUTRA: Scalable Multilingual Language Model Architecture

    Policy & Safety
    (00:54:46) Bipartisan Senate bill on AI security would bolster voluntary cyber reporting processes
    (00:56:17) U.K. agency releases tools to test AI model safety
    (00:57:25) Protesters Are Fighting to Stop AI, but They’re Split on How to Do It

    Synthetic Media & Art
    (00:58:54) Google’s invisible AI watermark will help identify generative text and video
    (01:00:50) How One Author Pushed the Limits of AI Copyright
    (01:03:27) Stellaris gets an DLC about AI that features AI-created voices, director insists it's 'ethical' and 'we're pretty good at exploring dystopian sci-fi and don't want to end up there ourselves'
    (01:04:46) At the AI Film Festival, humanity triumphed over tech

    (01:06:37) Daliana Interviews Andrey
    (01:42:00) AI Outro Song

    • 1 hr 43 min
    #166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

    #166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

    Our 166th episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    (00:00:00) Intro / Banter
    Tools & Apps(00:04:23) ElevenLabs previews music-generating AI model
    (00:09:31) Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts
    (00:13:00) SoundHound AI and Perplexity Partner to Bring Online LLMs to Next Gen Voice Assistants Across Cars and IoT Devices
    (00:14:50) Stability AI sows gen AI discord with Stable Artisan
    (00:16:35) Apple Will Revamp Siri to Catch Up to Its Chatbot Competitors
    (00:18:54) Alibaba rolls out latest version of its large language model to meet robust AI demand

    Applications & Business(00:19:34) OpenAI and Stack Overflow partner to bring more technical knowledge into ChatGPT
    (00:17:31) New Microsoft AI model may challenge GPT-4 and Google Gemini
    (00:31:08) Wayve, an A.I. Start-Up for Autonomous Driving, Raises $1 Billion
    (00:32:00) Motional delays commercial robotaxi plans amid restructuring
    (00:33:54) The rise of the Chinese AI unicorns doing battle with OpenAI

    Projects & Open Source(00:35:25) Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models
    (00:40:12) DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
    (00:44:31) OpenVoice V2: Evolving Multilingual Voice Cloning with Enhanced Style Control and Cross-Lingual Capabilities
    (00:45:20) Granite Code Models: A Family of Open Foundation Models for Code Intelligence
    (00:46:00) Hugging Face launches LeRobot open source robotics code library
    (00:48:50) Vibe-Eval: A new open and hard evaluation suite for measuring progress of multimodal language models

    Research & Advancements(00:50:02) Google DeepMind’s Groundbreaking AI for Protein Structure Can Now Model DNA
    (00:57:20) xLSTM: Extended Long Short-Term Memory
    (01:06:35) StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
    (01:07:55) Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
    (01:11:48) KAN: Kolmogorov-Arnold Networks

    Policy & Safety(01:13:20) US lawmakers unveil bill to make it easier to restrict exports of AI models
    (01:17:30) OpenAI’s Model Spec outlines some basic rules for AI
    (01:20:18) Robot dogs armed with AI-targeting rifles undergo US Marines Special Ops evaluation
    (01:25:15) OpenAI Releases ‘Deepfake’ Detector to Disinformation Researchers

    Synthetic Media & Art(01:28:15) Audible’s Test of AI-Voiced Audiobooks Tops 40,000 Titles
    (01:32:30) TikTok will automatically label AI-generated content created on platforms like DALL·E 3
    (01:33:23) Katy Perry's Fan-Made AI Image Is So Real It Fooled the World Into Thinking She Was at the Met Gala
    (01:35:32) South Korean woman falls for deepfake Elon Musk, loses $50K in romance scam 
    (01:37:18) Why young Russian women appear so eager to marry Chinese men

    (01:40:18) AI Outro Song

    • 1 hr 41 min
    #165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

    #165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

    Our 165th episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Tools & Apps(00:01:27) GitHub releases an AI-powered tool aiming for a 'radically new way of building software'
    (00:07:05) China unveils Sora challenger able to produce videos from text similar to OpenAI tool, though much shorter
    (00:12:23) ChatGPT’s AI ‘memory’ can remember the preferences of paying customers
    (00:14:21) Rabbit R1 review: Avoid this AI gadget
    (00:18:30) Amazon Q, a generative AI-powered assistant for businesses and developers, is now generally available
    (00:19:54) Yelp’s Assistant AI bot will do all the talking to help users find service providers

    Applications & Business(00:21:31) Video of super-fast, super-smooth humanoid robot will drop your jaw
    (00:25:22) Tesla’s 2 million car Autopilot recall is now under federal scrutiny
    (00:29:32) Tesla shares soar as Elon Musk returns from China with FSD 'Game Changer'
    (00:32:11) OpenAI inks strategic tie-up with UK’s Financial Times, including content use
    (00:35:21) OpenAI Startup Fund quietly raises $15M
    (00:37:00) Huawei backs HBM memory manufacturing in China to sidestep crippling US sanctions that restrict AI development

    Research & Advancements(00:39:20) Capabilities of Gemini Models in Medicine
    (00:45:34) Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
    (00:52:20) NExT: Teaching Large Language Models to Reason about Code Execution
    (00:55:08) SenseNova 5.0: China’s latest AI model surpasses OpenAI’s GPT-4
    (00:57:20) Octopus v4: Graph of language models
    (01:00:28) Better & Faster Large Language Models via Multi-token Prediction

    Policy & Safety(01:03:15) Refusal in LLMs is mediated by a single direction
    (01:09:19) Rishi Sunak promised to make AI safe. Big Tech’s not playing ball.
    (01:15:09) DOE Announces New Actions to Enhance America’s Global Leadership in Artificial Intelligence
    (01:18:21) The Chips Act is rebuilding US semiconductor manufacturing, so far resulting in $327 billion in announced projects
    (01:20:50) Analysis-Second global AI safety summit faces tough questions, lower turnout
    (01:24:03) Sam Altman, Jensen Huang, and more join the federal AI safety board

    Synthetic Media & Art(01:26:30) Air Head creators say OpenAI's Sora finicky to work with, needs hundreds of prompts, serious VFX work for under 2 minutes of cohesive story ↺
    (01:29:50) Eight newspaper publishers sue OpenAI over copyright infringement

    • 1 hr 32 min
    #164 - Meta AI, Phi-3, OpenELM, Bollywood Deepfakes

    #164 - Meta AI, Phi-3, OpenELM, Bollywood Deepfakes

    Our 164th episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Tools & Apps
    (00:04:02) Meta, in Its Biggest A.I. Push, Places Smart Assistants Across Its Apps
    (00:07:26) Microsoft launches Phi-3, its smallest AI model yet
    (00:15:35) The Ray-Ban Meta Smart Glasses have multimodal AI now
    (00:17:32) OpenAI winds down AI image generator that blew minds and forged friendships in 2022
    (00:18:44) Baidu claims 200 million users for Ernie chatbot after only 13 months
    (00:21:13) The new Adobe Photoshop gets an in-app image generator, major Generative Fill upgrades

    Applications & Business
    (00:22:22) Intel & The Pentagon Deepen Ties To Develop World’s Most Advanced Chips
    (00:27:58) Meta Says It Plans to Spend Billions More on A.I.
    (00:31:36) OpenAI CEO Sam Altman invests in solar power firm Exowatt to fuel AI datacenters
    (00:33:58) Google consolidates AI-focused DeepMind, Research teams
    (00:36:22) Microsoft and OpenAI bet $100 billion to free themselves from the shackles and overreliance on the world's most profitable semiconductor chip brand for AI chips

    Projects & Open Source
    (00:39:03) Apple releases OpenELM: small, open source AI models designed to run on-device
    (00:44:12) Snowflake launches Arctic, an open ‘mixture-of-experts’ LLM to take on DBRX, Llama 3

    Research & Advancements
    (00:48:08) The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
    (00:55:11) Groq’s breakthrough AI chip achieves blistering 800 tokens per second on Meta’s LLaMA 3
    (00:59:52) Microsoft shows off VASA-1, an AI framework that makes human headshots talk, sing
    (01:01:59) Intel Builds World’s Largest Neuromorphic System to Enable More Sustainable AI

    Policy & Safety
    (01:05:11) Deepfakes of Bollywood stars spark worries of AI meddling in India election
    (01:08:51) LLM Agents can Autonomously Exploit One-day Vulnerabilities
    (01:15:27) The Necessity of AI Audit Standards Boards
    (01:19:45) A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
    (01:22:45) COERCING LLMS TO DO AND REVEAL (ALMOST) ANYTHING
    (01:26:40) China acquired recently banned Nvidia chips in Super Micro, Dell servers, tenders show

    Synthetic Media & Art
    (01:29:08) Drake threatened with lawsuit over diss track featuring AI Tupac

    • 1 hr 31 min
    #163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban

    #163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban

    Our 163rd episode with a summary and discussion of last week's big AI news!
    Note: apology for this one coming out a few days late, got delayed in editing it -Andrey
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Intro / Banter
    Tools & Apps
    (00:02:16) Meta releases Llama 3, claims it’s among the best open models available
    (00:14:01) Elon Musk’s xAI Unveils Grok-1.5 Vision, Beats OpenAI’s GPT-4V
    (00:17:55) Reka releases Reka Core, its multimodal language model to rival GPT-4 and Claude 3 Opus
    (00:21:50) Cohere Compass Private Beta: A New Multi-Aspect Embedding Model
    (00:23:48) Amazon Music’s Maestro lets listeners make AI playlists
    (00:24:36) Snap plans to add watermarks to images created with its AI-powered tools

    Applications & Business
    (00:25:52) Boston Dynamics unveils new Atlas robot for commercial use
    (00:30:32) TSMC’s $65 billion bet still leaves US missing piece of chip puzzle
    (00:36:30) U.S. blacklists Intel's and Nvidia's key partner in China — three other Chinese firms also included in the blacklist for helping the military
    (00:38:37) Elon Musk says the next-generation Grok 3 model will require 100,000 Nvidia H100 GPUs to train
    (00:40:22) Dr. Andrew Ng appointed to Amazon’s Board of Directors
    (00:41:55) Collaborative Robotics Locks Up $100M, Latest Robot Startup To Raise Big

    Projects & Open Source
    (00:44:08) OpenEQA: Embodied Question Answering in the Era of Foundation Models
    (00:50:03) Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

    Research & Advancements
    (00:51:21) RHO-1: Not All Tokens Are What You Need
    (00:57:21) Scaling Laws for Fine-Grained Mixture of Experts
    (01:03:20) Chinchilla Scaling: A replication attempt
    (01:07:18) China develops new light-based chiplet that could power artificial general intelligence — where AI is smarter than humans
    (01:10:45) OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

    Policy & Safety
    (01:13:44) U.S. Commerce Secretary Gina Raimondo Announces Expansion of U.S. AI Safety Institute Leadership Team
    (01:17:18) NSA Publishes Guidance for Strengthening AI System Security
    (01:19:19) Foundational Challenges in Assuring Alignment and Safety of Large Language Models
    (01:24:11) Former OpenAI Board Member Calls for Audits of Top AI Companies
    (01:27:35) SoA survey reveals a third of translators and quarter of illustrators losing work to AI

    Synthetic Media & Art
    (01:30:25) Medium bans AI-generated content from its paid Partner Program

    • 1 hr 33 min
    #162 - Udio Song AI, TPU v5, Mixtral 8x22, Mixture-of-Depths, Musicians sign open letter

    #162 - Udio Song AI, TPU v5, Mixtral 8x22, Mixture-of-Depths, Musicians sign open letter

    Our 162nd episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Tools & Apps
    (00:02:50) AI-Music Arms Race: Meet Udio, the Other ChatGPT for Music
    (00:07:42) Anthropic launches external tool use for Claude AI, enabling stock ticker integrations and more
    (00:11:51) Building LLMs for Code Repair
    (00:14:16) Early Reviews of Humane AI Pin Aren’t Impressed
    (00:16:23) Microsoft 365’s Copilot gets a GPT-4 Turbo upgrade and improved image generation
    (00:18:41) AI editing tools are coming to all Google Photos users

    Applications & Business
    (00:19:21) Google announces the Cloud TPU v5p, its most powerful AI accelerator yet
    (00:23:32) Meta unveils its newest custom AI chip as it races to catch up
    (00:27:27) Intel Unveils New AI Accelerator in Bid to Challenge Nvidia
    (00:30:46) Adobe Is Buying Videos for $3 Per Minute to Build AI Model
    (00:32:55) OpenAI transcribed over a million hours of YouTube videos to train GPT-4
    (00:36:23) Waymo will launch paid robotaxi service in Los Angeles on Wednesday
    (00:37:23) OpenAI removes Sam Altman's ownership of its Startup Fund

    Projects & Open Source
    (00:39:51) Mistral AI Stuns With Surprise Launch of New Mixtral 8x22B Model
    (00:43:54) Google updates its Gemma AI model family with variants for coding and research
    (00:47:04) Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

    Research & Advancements
    (00:52:08) Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
    (00:57:41) Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
    (01:03:31) Octopus v2: On-device language model for super agent
    (01:07:54) Bigger is not Always Better: Scaling Properties of Latent Diffusion Models
    (01:09:54) Many-shot Jailbreaking

    Policy & Safety
    (01:15:08) Schiff unveils AI training transparency measure
    (01:20:25) Linwei Ding was a Google software engineer. He was also a prolific thief of trade secrets, say prosecutors.
    (01:26:11) Responsible Reporting for Frontier AI Development
    (01:30:08) US govt wants to talk to tech companies about AI electricity demands — eyes nuclear fusion and fission
    (01:32:39) Washington state judge blocks use of AI-enhanced video as evidence in possible first-of-its-kind ruling
    (01:36:45) Trudeau announces $2.4 billion for AI-related investments

    Synthetic Media & Art
    (01:39:26) Billie Eilish, Pearl Jam, Nicki Minaj Among 200 Artists Calling for Responsible AI Music Practices

    Fun!
    (01:41:52) OpenAI's Sora just made its first music video and it's like a psychedelic trip

    • 1 hr 45 min

Top Podcasts In Technology

iOS Today (Audio)
TWiT
Go Time: Golang, Software Engineering
Changelog Media
My AudioNerds
Devvon Terrell
Apple Events (audio)
Apple
Note to Self
WNYC Studios
Hard Fork
The New York Times

You Might Also Like

This Day in AI Podcast
Michael Sharkey, Chris Sharkey
Practical AI: Machine Learning, Data Science
Changelog Media
The AI Podcast
NVIDIA
The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis
Nathaniel Whittemore
Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Sam Charrington