205 episodes

Weekly summaries and discussion about the most interesting developments in AI, deep learning, robotics, and more!

Last Week in AI Skynet Today

    • Technology

Weekly summaries and discussion about the most interesting developments in AI, deep learning, robotics, and more!

    #167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

    #167 - GPT-4o, Project Astra, Veo, OpenAI Departures, Interview with Andrey

    Our 167th episode with a summary and discussion of last week's big AI news!
    With guest host Daliana Liu (https://www.linkedin.com/in/dalianaliu/) from The Data Scientist Show!
    And a special one-time interview with Andrey in the latter part of the podcast.
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Intro / Banter
    Tools & Apps
    (00:03:42) OpenAI releases GPT-4o, a faster model that’s free for all ChatGPT users
    (00:12:06) Project Astra is the future of AI at Google
    (00:18:06) Google is redesigning its search engine — and it’s AI all the way down
    (00:19:39) Google unveils Veo and Imagen 3, its latest AI media creation models
    (00:23:36) Google Unveils Music AI Sandbox Making Loops From Prompts
    (00:26:27) Anthropic AI Launches a Prompt Engineering Tool that Generates Production-Ready Prompts in the Anthropic Console

    Applications & Business
    (00:31:02) OpenAI’s Chief Scientist and Co-Founder Is Leaving the Company
    (00:35:15) Mike Krieger joins Anthropic as Chief Product Officer
    (00:36:28) $16k G1 humanoid rises up to smash nuts, twist and twirl
    (00:41:02) GM's Cruise to start testing robotaxis in Phoenix area with human safety drivers on board
    (00:42:52) US agency probes Amazon-owned Zoox self-driving vehicles after two crashes
    (00:43:58) Waymo’s robotaxis under investigation after crashes and traffic mishaps

    Projects & Open Source
    (00:44:48) Introducing PaliGemma, Gemma 2, and an Upgraded Responsible AI Toolkit
    (00:46:24) Falcon 2: UAE’s Technology Innovation Institute Releases New AI Model Series, Outperforming Meta’s New Llama 3
    (00:48:00) License to Call: Introducing Transformers Agents 2.0

    Research & Advancements
    (00:49:22) The Platonic Representation Hypothesis
    (00:53:08) SUTRA: Scalable Multilingual Language Model Architecture

    Policy & Safety
    (00:54:46) Bipartisan Senate bill on AI security would bolster voluntary cyber reporting processes
    (00:56:17) U.K. agency releases tools to test AI model safety
    (00:57:25) Protesters Are Fighting to Stop AI, but They’re Split on How to Do It

    Synthetic Media & Art
    (00:58:54) Google’s invisible AI watermark will help identify generative text and video
    (01:00:50) How One Author Pushed the Limits of AI Copyright
    (01:03:27) Stellaris gets an DLC about AI that features AI-created voices, director insists it's 'ethical' and 'we're pretty good at exploring dystopian sci-fi and don't want to end up there ourselves'
    (01:04:46) At the AI Film Festival, humanity triumphed over tech

    (01:06:37) Daliana Interviews Andrey
    (01:42:00) AI Outro Song

    • 1 hr 43 min
    #166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

    #166 - new AI song generator, Microsoft's GPT4 efforts, AlphaFold3, xLSTM, OpenAI Model Spec

    Our 166th episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    (00:00:00) Intro / Banter
    Tools & Apps(00:04:23) ElevenLabs previews music-generating AI model
    (00:09:31) Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts
    (00:13:00) SoundHound AI and Perplexity Partner to Bring Online LLMs to Next Gen Voice Assistants Across Cars and IoT Devices
    (00:14:50) Stability AI sows gen AI discord with Stable Artisan
    (00:16:35) Apple Will Revamp Siri to Catch Up to Its Chatbot Competitors
    (00:18:54) Alibaba rolls out latest version of its large language model to meet robust AI demand

    Applications & Business(00:19:34) OpenAI and Stack Overflow partner to bring more technical knowledge into ChatGPT
    (00:17:31) New Microsoft AI model may challenge GPT-4 and Google Gemini
    (00:31:08) Wayve, an A.I. Start-Up for Autonomous Driving, Raises $1 Billion
    (00:32:00) Motional delays commercial robotaxi plans amid restructuring
    (00:33:54) The rise of the Chinese AI unicorns doing battle with OpenAI

    Projects & Open Source(00:35:25) Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models
    (00:40:12) DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
    (00:44:31) OpenVoice V2: Evolving Multilingual Voice Cloning with Enhanced Style Control and Cross-Lingual Capabilities
    (00:45:20) Granite Code Models: A Family of Open Foundation Models for Code Intelligence
    (00:46:00) Hugging Face launches LeRobot open source robotics code library
    (00:48:50) Vibe-Eval: A new open and hard evaluation suite for measuring progress of multimodal language models

    Research & Advancements(00:50:02) Google DeepMind’s Groundbreaking AI for Protein Structure Can Now Model DNA
    (00:57:20) xLSTM: Extended Long Short-Term Memory
    (01:06:35) StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
    (01:07:55) Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
    (01:11:48) KAN: Kolmogorov-Arnold Networks

    Policy & Safety(01:13:20) US lawmakers unveil bill to make it easier to restrict exports of AI models
    (01:17:30) OpenAI’s Model Spec outlines some basic rules for AI
    (01:20:18) Robot dogs armed with AI-targeting rifles undergo US Marines Special Ops evaluation
    (01:25:15) OpenAI Releases ‘Deepfake’ Detector to Disinformation Researchers

    Synthetic Media & Art(01:28:15) Audible’s Test of AI-Voiced Audiobooks Tops 40,000 Titles
    (01:32:30) TikTok will automatically label AI-generated content created on platforms like DALL·E 3
    (01:33:23) Katy Perry's Fan-Made AI Image Is So Real It Fooled the World Into Thinking She Was at the Met Gala
    (01:35:32) South Korean woman falls for deepfake Elon Musk, loses $50K in romance scam 
    (01:37:18) Why young Russian women appear so eager to marry Chinese men

    (01:40:18) AI Outro Song

    • 1 hr 41 min
    #165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

    #165 - Sora challenger, Astribot's S1, Med-Gemini, Refusal in LLMs

    Our 165th episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Tools & Apps(00:01:27) GitHub releases an AI-powered tool aiming for a 'radically new way of building software'
    (00:07:05) China unveils Sora challenger able to produce videos from text similar to OpenAI tool, though much shorter
    (00:12:23) ChatGPT’s AI ‘memory’ can remember the preferences of paying customers
    (00:14:21) Rabbit R1 review: Avoid this AI gadget
    (00:18:30) Amazon Q, a generative AI-powered assistant for businesses and developers, is now generally available
    (00:19:54) Yelp’s Assistant AI bot will do all the talking to help users find service providers

    Applications & Business(00:21:31) Video of super-fast, super-smooth humanoid robot will drop your jaw
    (00:25:22) Tesla’s 2 million car Autopilot recall is now under federal scrutiny
    (00:29:32) Tesla shares soar as Elon Musk returns from China with FSD 'Game Changer'
    (00:32:11) OpenAI inks strategic tie-up with UK’s Financial Times, including content use
    (00:35:21) OpenAI Startup Fund quietly raises $15M
    (00:37:00) Huawei backs HBM memory manufacturing in China to sidestep crippling US sanctions that restrict AI development

    Research & Advancements(00:39:20) Capabilities of Gemini Models in Medicine
    (00:45:34) Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
    (00:52:20) NExT: Teaching Large Language Models to Reason about Code Execution
    (00:55:08) SenseNova 5.0: China’s latest AI model surpasses OpenAI’s GPT-4
    (00:57:20) Octopus v4: Graph of language models
    (01:00:28) Better & Faster Large Language Models via Multi-token Prediction

    Policy & Safety(01:03:15) Refusal in LLMs is mediated by a single direction
    (01:09:19) Rishi Sunak promised to make AI safe. Big Tech’s not playing ball.
    (01:15:09) DOE Announces New Actions to Enhance America’s Global Leadership in Artificial Intelligence
    (01:18:21) The Chips Act is rebuilding US semiconductor manufacturing, so far resulting in $327 billion in announced projects
    (01:20:50) Analysis-Second global AI safety summit faces tough questions, lower turnout
    (01:24:03) Sam Altman, Jensen Huang, and more join the federal AI safety board

    Synthetic Media & Art(01:26:30) Air Head creators say OpenAI's Sora finicky to work with, needs hundreds of prompts, serious VFX work for under 2 minutes of cohesive story ↺
    (01:29:50) Eight newspaper publishers sue OpenAI over copyright infringement

    • 1 hr 32 min
    #164 - Meta AI, Phi-3, OpenELM, Bollywood Deepfakes

    #164 - Meta AI, Phi-3, OpenELM, Bollywood Deepfakes

    Our 164th episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Tools & Apps
    (00:04:02) Meta, in Its Biggest A.I. Push, Places Smart Assistants Across Its Apps
    (00:07:26) Microsoft launches Phi-3, its smallest AI model yet
    (00:15:35) The Ray-Ban Meta Smart Glasses have multimodal AI now
    (00:17:32) OpenAI winds down AI image generator that blew minds and forged friendships in 2022
    (00:18:44) Baidu claims 200 million users for Ernie chatbot after only 13 months
    (00:21:13) The new Adobe Photoshop gets an in-app image generator, major Generative Fill upgrades

    Applications & Business
    (00:22:22) Intel & The Pentagon Deepen Ties To Develop World’s Most Advanced Chips
    (00:27:58) Meta Says It Plans to Spend Billions More on A.I.
    (00:31:36) OpenAI CEO Sam Altman invests in solar power firm Exowatt to fuel AI datacenters
    (00:33:58) Google consolidates AI-focused DeepMind, Research teams
    (00:36:22) Microsoft and OpenAI bet $100 billion to free themselves from the shackles and overreliance on the world's most profitable semiconductor chip brand for AI chips

    Projects & Open Source
    (00:39:03) Apple releases OpenELM: small, open source AI models designed to run on-device
    (00:44:12) Snowflake launches Arctic, an open ‘mixture-of-experts’ LLM to take on DBRX, Llama 3

    Research & Advancements
    (00:48:08) The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
    (00:55:11) Groq’s breakthrough AI chip achieves blistering 800 tokens per second on Meta’s LLaMA 3
    (00:59:52) Microsoft shows off VASA-1, an AI framework that makes human headshots talk, sing
    (01:01:59) Intel Builds World’s Largest Neuromorphic System to Enable More Sustainable AI

    Policy & Safety
    (01:05:11) Deepfakes of Bollywood stars spark worries of AI meddling in India election
    (01:08:51) LLM Agents can Autonomously Exploit One-day Vulnerabilities
    (01:15:27) The Necessity of AI Audit Standards Boards
    (01:19:45) A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
    (01:22:45) COERCING LLMS TO DO AND REVEAL (ALMOST) ANYTHING
    (01:26:40) China acquired recently banned Nvidia chips in Super Micro, Dell servers, tenders show

    Synthetic Media & Art
    (01:29:08) Drake threatened with lawsuit over diss track featuring AI Tupac

    • 1 hr 31 min
    #163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban

    #163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban

    Our 163rd episode with a summary and discussion of last week's big AI news!
    Note: apology for this one coming out a few days late, got delayed in editing it -Andrey
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Intro / Banter
    Tools & Apps
    (00:02:16) Meta releases Llama 3, claims it’s among the best open models available
    (00:14:01) Elon Musk’s xAI Unveils Grok-1.5 Vision, Beats OpenAI’s GPT-4V
    (00:17:55) Reka releases Reka Core, its multimodal language model to rival GPT-4 and Claude 3 Opus
    (00:21:50) Cohere Compass Private Beta: A New Multi-Aspect Embedding Model
    (00:23:48) Amazon Music’s Maestro lets listeners make AI playlists
    (00:24:36) Snap plans to add watermarks to images created with its AI-powered tools

    Applications & Business
    (00:25:52) Boston Dynamics unveils new Atlas robot for commercial use
    (00:30:32) TSMC’s $65 billion bet still leaves US missing piece of chip puzzle
    (00:36:30) U.S. blacklists Intel's and Nvidia's key partner in China — three other Chinese firms also included in the blacklist for helping the military
    (00:38:37) Elon Musk says the next-generation Grok 3 model will require 100,000 Nvidia H100 GPUs to train
    (00:40:22) Dr. Andrew Ng appointed to Amazon’s Board of Directors
    (00:41:55) Collaborative Robotics Locks Up $100M, Latest Robot Startup To Raise Big

    Projects & Open Source
    (00:44:08) OpenEQA: Embodied Question Answering in the Era of Foundation Models
    (00:50:03) Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

    Research & Advancements
    (00:51:21) RHO-1: Not All Tokens Are What You Need
    (00:57:21) Scaling Laws for Fine-Grained Mixture of Experts
    (01:03:20) Chinchilla Scaling: A replication attempt
    (01:07:18) China develops new light-based chiplet that could power artificial general intelligence — where AI is smarter than humans
    (01:10:45) OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

    Policy & Safety
    (01:13:44) U.S. Commerce Secretary Gina Raimondo Announces Expansion of U.S. AI Safety Institute Leadership Team
    (01:17:18) NSA Publishes Guidance for Strengthening AI System Security
    (01:19:19) Foundational Challenges in Assuring Alignment and Safety of Large Language Models
    (01:24:11) Former OpenAI Board Member Calls for Audits of Top AI Companies
    (01:27:35) SoA survey reveals a third of translators and quarter of illustrators losing work to AI

    Synthetic Media & Art
    (01:30:25) Medium bans AI-generated content from its paid Partner Program

    • 1 hr 33 min
    #162 - Udio Song AI, TPU v5, Mixtral 8x22, Mixture-of-Depths, Musicians sign open letter

    #162 - Udio Song AI, TPU v5, Mixtral 8x22, Mixture-of-Depths, Musicians sign open letter

    Our 162nd episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Tools & Apps
    (00:02:50) AI-Music Arms Race: Meet Udio, the Other ChatGPT for Music
    (00:07:42) Anthropic launches external tool use for Claude AI, enabling stock ticker integrations and more
    (00:11:51) Building LLMs for Code Repair
    (00:14:16) Early Reviews of Humane AI Pin Aren’t Impressed
    (00:16:23) Microsoft 365’s Copilot gets a GPT-4 Turbo upgrade and improved image generation
    (00:18:41) AI editing tools are coming to all Google Photos users

    Applications & Business
    (00:19:21) Google announces the Cloud TPU v5p, its most powerful AI accelerator yet
    (00:23:32) Meta unveils its newest custom AI chip as it races to catch up
    (00:27:27) Intel Unveils New AI Accelerator in Bid to Challenge Nvidia
    (00:30:46) Adobe Is Buying Videos for $3 Per Minute to Build AI Model
    (00:32:55) OpenAI transcribed over a million hours of YouTube videos to train GPT-4
    (00:36:23) Waymo will launch paid robotaxi service in Los Angeles on Wednesday
    (00:37:23) OpenAI removes Sam Altman's ownership of its Startup Fund

    Projects & Open Source
    (00:39:51) Mistral AI Stuns With Surprise Launch of New Mixtral 8x22B Model
    (00:43:54) Google updates its Gemma AI model family with variants for coding and research
    (00:47:04) Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

    Research & Advancements
    (00:52:08) Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
    (00:57:41) Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
    (01:03:31) Octopus v2: On-device language model for super agent
    (01:07:54) Bigger is not Always Better: Scaling Properties of Latent Diffusion Models
    (01:09:54) Many-shot Jailbreaking

    Policy & Safety
    (01:15:08) Schiff unveils AI training transparency measure
    (01:20:25) Linwei Ding was a Google software engineer. He was also a prolific thief of trade secrets, say prosecutors.
    (01:26:11) Responsible Reporting for Frontier AI Development
    (01:30:08) US govt wants to talk to tech companies about AI electricity demands — eyes nuclear fusion and fission
    (01:32:39) Washington state judge blocks use of AI-enhanced video as evidence in possible first-of-its-kind ruling
    (01:36:45) Trudeau announces $2.4 billion for AI-related investments

    Synthetic Media & Art
    (01:39:26) Billie Eilish, Pearl Jam, Nicki Minaj Among 200 Artists Calling for Responsible AI Music Practices

    Fun!
    (01:41:52) OpenAI's Sora just made its first music video and it's like a psychedelic trip

    • 1 hr 45 min

Top Podcasts In Technology

Waveform: The MKBHD Podcast
Vox Media Podcast Network
MacStories Unwind
viticci@macstories.net (Federico Viticci)
AppStories
Federico Viticci, John Voorhees
Люди и код
Skillbox Media Code
Απλά Ψηφιακά
Απλά Ψηφιακά
ΓΙΑ ΗΛΙΘΙΟΥΣ!
PCsteps.gr

You Might Also Like

This Day in AI Podcast
Michael Sharkey, Chris Sharkey
Practical AI: Machine Learning, Data Science
Changelog Media
The AI Podcast
NVIDIA
The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis
Nathaniel Whittemore
Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Sam Charrington