201 Folgen

Weekly summaries and discussion about the most interesting developments in AI, deep learning, robotics, and more!

Last Week in AI Skynet Today

    • Technologie

Weekly summaries and discussion about the most interesting developments in AI, deep learning, robotics, and more!

    #163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban

    #163 - Llama 3, Grok-1.5 Vision, new Atlas robot, RHO-1, Medium ban

    Our 163rd episode with a summary and discussion of last week's big AI news!
    Note: apology for this one coming out a few days late, got delayed in editing it -Andrey
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Intro / Banter
    Tools & Apps
    (00:02:16) Meta releases Llama 3, claims it’s among the best open models available
    (00:14:01) Elon Musk’s xAI Unveils Grok-1.5 Vision, Beats OpenAI’s GPT-4V
    (00:17:55) Reka releases Reka Core, its multimodal language model to rival GPT-4 and Claude 3 Opus
    (00:21:50) Cohere Compass Private Beta: A New Multi-Aspect Embedding Model
    (00:23:48) Amazon Music’s Maestro lets listeners make AI playlists
    (00:24:36) Snap plans to add watermarks to images created with its AI-powered tools

    Applications & Business
    (00:25:52) Boston Dynamics unveils new Atlas robot for commercial use
    (00:30:32) TSMC’s $65 billion bet still leaves US missing piece of chip puzzle
    (00:36:30) U.S. blacklists Intel's and Nvidia's key partner in China — three other Chinese firms also included in the blacklist for helping the military
    (00:38:37) Elon Musk says the next-generation Grok 3 model will require 100,000 Nvidia H100 GPUs to train
    (00:40:22) Dr. Andrew Ng appointed to Amazon’s Board of Directors
    (00:41:55) Collaborative Robotics Locks Up $100M, Latest Robot Startup To Raise Big

    Projects & Open Source
    (00:44:08) OpenEQA: Embodied Question Answering in the Era of Foundation Models
    (00:50:03) Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

    Research & Advancements
    (00:51:21) RHO-1: Not All Tokens Are What You Need
    (00:57:21) Scaling Laws for Fine-Grained Mixture of Experts
    (01:03:20) Chinchilla Scaling: A replication attempt
    (01:07:18) China develops new light-based chiplet that could power artificial general intelligence — where AI is smarter than humans
    (01:10:45) OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

    Policy & Safety
    (01:13:44) U.S. Commerce Secretary Gina Raimondo Announces Expansion of U.S. AI Safety Institute Leadership Team
    (01:17:18) NSA Publishes Guidance for Strengthening AI System Security
    (01:19:19) Foundational Challenges in Assuring Alignment and Safety of Large Language Models
    (01:24:11) Former OpenAI Board Member Calls for Audits of Top AI Companies
    (01:27:35) SoA survey reveals a third of translators and quarter of illustrators losing work to AI

    Synthetic Media & Art
    (01:30:25) Medium bans AI-generated content from its paid Partner Program

    • 1 Std. 33 Min.
    #162 - Udio Song AI, TPU v5, Mixtral 8x22, Mixture-of-Depths, Musicians sign open letter

    #162 - Udio Song AI, TPU v5, Mixtral 8x22, Mixture-of-Depths, Musicians sign open letter

    Our 162nd episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Tools & Apps
    (00:02:50) AI-Music Arms Race: Meet Udio, the Other ChatGPT for Music
    (00:07:42) Anthropic launches external tool use for Claude AI, enabling stock ticker integrations and more
    (00:11:51) Building LLMs for Code Repair
    (00:14:16) Early Reviews of Humane AI Pin Aren’t Impressed
    (00:16:23) Microsoft 365’s Copilot gets a GPT-4 Turbo upgrade and improved image generation
    (00:18:41) AI editing tools are coming to all Google Photos users

    Applications & Business
    (00:19:21) Google announces the Cloud TPU v5p, its most powerful AI accelerator yet
    (00:23:32) Meta unveils its newest custom AI chip as it races to catch up
    (00:27:27) Intel Unveils New AI Accelerator in Bid to Challenge Nvidia
    (00:30:46) Adobe Is Buying Videos for $3 Per Minute to Build AI Model
    (00:32:55) OpenAI transcribed over a million hours of YouTube videos to train GPT-4
    (00:36:23) Waymo will launch paid robotaxi service in Los Angeles on Wednesday
    (00:37:23) OpenAI removes Sam Altman's ownership of its Startup Fund

    Projects & Open Source
    (00:39:51) Mistral AI Stuns With Surprise Launch of New Mixtral 8x22B Model
    (00:43:54) Google updates its Gemma AI model family with variants for coding and research
    (00:47:04) Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

    Research & Advancements
    (00:52:08) Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
    (00:57:41) Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
    (01:03:31) Octopus v2: On-device language model for super agent
    (01:07:54) Bigger is not Always Better: Scaling Properties of Latent Diffusion Models
    (01:09:54) Many-shot Jailbreaking

    Policy & Safety
    (01:15:08) Schiff unveils AI training transparency measure
    (01:20:25) Linwei Ding was a Google software engineer. He was also a prolific thief of trade secrets, say prosecutors.
    (01:26:11) Responsible Reporting for Frontier AI Development
    (01:30:08) US govt wants to talk to tech companies about AI electricity demands — eyes nuclear fusion and fission
    (01:32:39) Washington state judge blocks use of AI-enhanced video as evidence in possible first-of-its-kind ruling
    (01:36:45) Trudeau announces $2.4 billion for AI-related investments

    Synthetic Media & Art
    (01:39:26) Billie Eilish, Pearl Jam, Nicki Minaj Among 200 Artists Calling for Responsible AI Music Practices

    Fun!
    (01:41:52) OpenAI's Sora just made its first music video and it's like a psychedelic trip

    • 1 Std. 45 Min.
    #161 - Claude 3 beats GPT-4, Stability CEO resigns, DBRX, TacticAI, UN resolution on AI

    #161 - Claude 3 beats GPT-4, Stability CEO resigns, DBRX, TacticAI, UN resolution on AI

    Our 161st episode with a summary and discussion of last week's big AI news!
    Check out our sponsor, the SuperDataScience podcast. You can listen to SDS across all major podcasting platforms (e.g., Spotify, Apple Podcasts, Google Podcasts) plus there’s a video version on YouTube.
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Note - one extra story we didn't get to but worth knowing from this week: ‘Totally surreal’: OpenAI shares first short films created with new AI tool Sora
    Timestamps + links:
    (00:00:00) Intro / Banter
    Tools & Apps
    (00:05:20) Google starts testing AI overviews from SGE in main Google search interface
    (00:10:00) Adobe’s new GenStudio platform is an AI factory for advertisers
    (00:13:53) Claude-3 Haiku has reached GPT-4 level by our user preference
    (00:15:26) Microsoft Teams is getting smarter Copilot AI features
    (00:17:16) Samsung is beating Apple in the race to bring AI to smartphones
    (00:19:18) Elon Musk says all premium subscribers on X will gain access to AI chatbot Grok this week

    Applications & Business
    (00:22:36) Stability AI CEO resigns to ‘pursue decentralized AI’
    (00:26:43) Amazon spends $2.75 billion on AI startup Anthropic in its largest venture investment yet
    (00:31:16) Intel Gaudi 2 Accelerators Showcase Competitive Performance Per Dollar Against NVIDIA H100 In MLPerf 4.0 GenAI Benchmarks
    (00:33:43) Chip Startup Celestial AI Lands Massive $175M Series C
    (00:36:10) Accenture Invests in Sanctuary AI to Bring AI-Powered, Humanoid Robotics to Work Alongside Humans
    (00:37:44) Humanoid robots are joining the Mercedes-Benz workforce

    Projects & Open Source
    (00:38:40) Introducing DBRX: A New State-of-the-Art Open LLM
    (00:44:06) Common Corpus: A Large Public Domain Dataset for Training LLMs
    (00:46:07) DROID: A Large-Scale In-the-Wild Robot Manipulation Dataset
    (00:48:15) InternLM2 Technical Report

    Research & Advancements
    (00:51:49) TacticAI: an AI assistant for football tactics
    (00:55:44) On the Conversational Persuasiveness of Large Language Models: A Randomized Controlled Trial
    (00:59:26) AutoDev: Automated AI-Driven Development
    (01:02:48) Reverse Training to Nurse the Reversal Curse
    (01:07:43) Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
    (01:09:48) Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction

    Policy & Safety
    (01:11:12) General Assembly adopts landmark resolution on artificial intelligence
    (01:16:24) Israel Deploys Expansive Facial Recognition Program in Gaza
    (01:19:49) LINEARITY OF RELATION DECODING IN TRANSFORMER LANGUAGE MODELS
    (01:24:52) US Weighs Sanctioning Huawei’s Secretive Chinese Chip Network
    (01:27:51) New York City welcomes robotaxis — but only with safety drivers
    (01:28:40) The White House Puts New Guardrails on Government Use of AI

    Synthetic Media & Art
    (01:31:17) BBC Will Stop Using AI For ‘Doctor Who’ Promotion After Receiving Complaints

    • 1 Std. 36 Min.
    #160 - Nvidia's new GPU, Microsoft pays for Inflection AI, Grok-1 open sourced, Jeremie's Action Plan

    #160 - Nvidia's new GPU, Microsoft pays for Inflection AI, Grok-1 open sourced, Jeremie's Action Plan

    Our 160th episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Tools & Apps
    (00:01:36) Adobe Substance 3D’s AI features can turn text into backgrounds and textures
    (00:05:04) OpenAI’s chatbot store is filling up with spam
    (00:11:02) Apple’s AI ambitions could include Google or OpenAI

    Applications & Business
    (00:13:31) Nvidia reveals Blackwell B200 GPU, the ‘world’s most powerful chip’ for AI
    (00:19:34) Microsoft to Pay Inflection AI $650 Million After Scooping Up Most of Staff
    (00:24:33) Figure 01: Conversations & Actions in Humanoid Robotics!
    (00:28:07) OpenAI's GPT-4.5 Turbo leaked on search engines and could launch in June
    (00:30:32) Abu Dhabi in talks to invest in OpenAI chip venture
    (00:33:43) Nvidia Announces GR00T, a Foundation Model For Humanoids

    Projects & Open Source
    (00:35:38) Open Release of Grok-1
    (00:41:25) Stability AI brings a new dimension to video with Stable Video 3D
    (00:44:23) Colossal-AI Team Introduces Open-Sora: An Open-Source Library for Video Generation
    (00:45:43) Evolutionary Optimization of Model Merging Recipes

    Research & Advancements
    (00:46:52) DiPaCo: Distributed Path Composition
    (00:53:58) MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
    (00:59:37) PERL: Parameter Efficient Reinforcement Learning from Human Feedback
    (01:01:55) VideoAgent: Long-form Video Understanding with Large Language Model as Agent
    (01:05:38) MusicHiFi: Fast High-Fidelity Stereo Vocoding

    Policy & Safety
    (01:06:44) The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
    (01:12:43) Exclusive: U.S. Must Move ‘Decisively’ to Avert ‘Extinction-Level’ Threat From AI, Government-Commissioned Report Says
    (01:18:18) Evaluating Frontier Models for Dangerous Capabilities
    (01:22:55) Chinese and western scientists identify ‘red lines’ on AI risks
    (01:26:11) Google fined $272M by French government over AI use of news content
    (01:27:58) Elvis Act Signed Into Tennessee Law to Protect Musicians From AI Deepfakes

    Synthetic Media & Art
    (01:30:17) AI-Generated Science
    (01:35:35) YouTube adds new AI-generated content labeling tool

    Fun!
    (01:37:29) 10 of My Most Popular Text-To-Image Series (+Prompts)

    • 1 Std. 39 Min.
    #159 - Inflection-2.5, Devin, OpenAI board update, SIMA, EU AI Act passed

    #159 - Inflection-2.5, Devin, OpenAI board update, SIMA, EU AI Act passed

    Our 159th episode with a summary and discussion of last week's big AI news!
    Check out our sponsor, the SuperDataScience podcast. You can listen to SDS across all major podcasting platforms (e.g., Spotify, Apple Podcasts, Google Podcasts) plus there’s a video version on YouTube.
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Intro / Banter
    Tools & Apps
    (00:03:24) Inflection-2.5: meet the world's best personal AI
    (00:06:37) Introducing Devin, the first AI software engineer
    (00:11:00) DoorDash’s new AI-powered ‘SafeChat+’ tool automatically detects verbal abuse
    (00:12:44) Anthropic releases Claude 3 Haiku, an AI model built for speed and affordability
    (00:13:30) Pika Labs just added sound effects to its generative AI videos — here’s how it sounds
    (00:15:33) Salesforce announces new AI tools for doctors

    Applications & Business
    (00:17:33) Sam Altman Rejoins OpenAI Board Along With Three New Directors
    (00:21:15) Cohere releases powerful ‘Command-R’ language model for enterprise use
    (00:23:16) Building Meta’s GenAI Infrastructure
    (00:25:53) Baidu Launches China's First 24/7 Robotaxi Service

    Projects & Open Source
    (00:26:54) Croissant: a metadata format for ML-ready datasets
    (00:29:40) SaulLM-7B: A pioneering Large Language Model for Law
    (00:31:45) Kai-Fu Lee’s AI Company “01.AI” Announces the Open Source of the Yi-9B Model

    Research & Advancements
    (00:33:50) A generalist AI agent for 3D virtual environments
    (00:39:16) Stealing Part of a Production Language Model
    (00:42:01) Data Interpreter: An LLM Agent For Data Science
    (00:43:54) ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
    (00:44:55) PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

    Policy & Safety
    (00:46:24) World’s first major act to regulate AI passed by European lawmakers
    (00:48:57) US spearheads first UN resolution on artificial intelligence — aimed at ensuring equal access
    (00:51:27) Google restricts election-related queries for its Gemini chatbot

    Synthetic Media & Art
    (00:52:43) Researchers tested leading AI models for copyright infringement using popular books, and GPT-4 performed worst
    (00:55:27) Nvidia Says NeMo AI Platform Complies With Copyright After Authors’ Complaint
    (00:57:23) Five of this year’s Pulitzer finalists are AI-powered

    Fun!
    (00:58:12) I made by Superman action figure talk with Pika Labs’ new AI lip sync tool — watch this

    • 1 Std.
    #158 - Claude 3, Elon Musk sues OpenAI, StarCoder 2, AI-Generated Spam

    #158 - Claude 3, Elon Musk sues OpenAI, StarCoder 2, AI-Generated Spam

    Our 158th episode with a summary and discussion of last week's big AI news!
    Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
    Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
    Timestamps + links:
    Tools & Apps
    (00:01:05) Introducing the next generation of Claude
    (00:16:05) Competition in AI video generation heats up as Deepmind alums unveil Haiper
    (00:19:40) Meta AI creates ahistorical images, like Google Gemini
    (00:22:12) Ideogram Is A New AI Image Generator That Obliterates the Competition, Outperforming MidJourney and Dall-E 3
    (00:24:25) Wix’s new AI chatbot builds websites in seconds based on prompts
    (00:26:34) I used generative AI to turn my story into a comic—and you can too

    Applications & Business
    (00:27:40) Elon Musk sues OpenAI and CEO Sam Altman for putting profits above humanity , OpenAI Fires Back at Musk Allegations With Trove of Emails
    (00:35:42) Inside the Crisis at Google
    (00:38:36) It’s official: Waymo robotaxis are now free to use freeways and leave San Francisco
    (00:40:33) Nvidia's next-gen AI GPUs could draw an astounding 1000 Watts each, a 40 percent increase — Dell spills the beans on B100 and B200 in its earnings call
    (00:42:43) AI chip startup Groq forms new business unit, acquires Definitive Intelligence

    Projects & Open Source
    (00:45:13) StarCoder 2 and The Stack v2: The Next Generation
    (00:49:05) Introducing TripoSR: Fast 3D Object Generation from Single Images
    (00:51:21) H2O AI releases Danube, a super-tiny LLM for mobile applications

    Research & Advancements
    (00:52:27) AtP*: An efficient and scalable method for localizing LLM behaviour to components
    (00:58:28) Stable Diffusion 3: Research Paper
    (01:02:10) Approaching Human-Level Forecasting with Language Models
    (01:05:34) Here Come the AI Worms
    (01:08:20) High-speed humanoid feels like a step change in robotics
    (01:12:26) Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap

    Policy & Safety
    (01:15:18) India reverses AI stance, requires government approval for model launches
    (01:19:16) When Your AIs Deceive You: Challenges with Partial Observability of Human Evaluators in Reward Learning
    (01:24:45) OpenAI signs open letter to build AI responsibly just days after Elon Musk sued the company for putting profit ahead of people
    (01:27:37) AI-generated articles prompt Wikipedia to downgrade CNET’s reliability rating
    (01:29:40) Malicious AI models on Hugging Face backdoor users’ machines
    (01:32:25) China offers AI computing ‘vouchers’ to its underpowered start-ups

    Synthetic Media & Art
    (01:34:18) Trump supporters target black voters with faked AI images
    (01:37:10) AI-Generated Kara Swisher Biographies Flood Amazon
    (01:39:55) Inside the World of AI TikTok Spammers
    (01:43:35) Twitter is becoming a 'ghost town' of bots as AI-generated spam content floods the internet

    Fun!
    (01:46:35) Man tries to steal driverless car in L.A., doesn’t get far: police

    • 1 Std. 48 Min.

Top‑Podcasts in Technologie

Ö1 matrix
ORF Ö1
Ö1 Digital.Leben
ORF Ö1
Darknet Diaries
Jack Rhysider
Deep Questions with Cal Newport
Cal Newport
Dwarkesh Podcast
Dwarkesh Patel
Lex Fridman Podcast
Lex Fridman

Das gefällt dir vielleicht auch

This Day in AI Podcast
Michael Sharkey, Chris Sharkey
Practical AI: Machine Learning, Data Science
Changelog Media
The AI Breakdown: Daily Artificial Intelligence News and Discussions
Nathaniel Whittemore
The AI Podcast
NVIDIA
Super Data Science: ML & AI Podcast with Jon Krohn
Jon Krohn
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Sam Charrington