1 hr 23 min

Adversarial Machine Learning Oxide and Friends

    • Technology

Nicholas Carlini joined Bryan, Adam, and the Oxide Friends to talk about his work with adversarial machine learning. He's found sequences of--seemingly random--tokens that cause LLMs to ignore their restrictions! Also: printf is Turing complete?!
In addition to Bryan Cantrill and Adam Leventhal, we were joined by special guest Nicholas Carlini.
If we got something wrong or missed something, please file a PR! Our next show will likely be on Monday at 5p Pacific Time on our Discord server; stay tuned to our Mastodon feeds for details, or subscribe to this calendar. We'd love to have you join us, as we always love to hear from new speakers!

Nicholas Carlini joined Bryan, Adam, and the Oxide Friends to talk about his work with adversarial machine learning. He's found sequences of--seemingly random--tokens that cause LLMs to ignore their restrictions! Also: printf is Turing complete?!
In addition to Bryan Cantrill and Adam Leventhal, we were joined by special guest Nicholas Carlini.
If we got something wrong or missed something, please file a PR! Our next show will likely be on Monday at 5p Pacific Time on our Discord server; stay tuned to our Mastodon feeds for details, or subscribe to this calendar. We'd love to have you join us, as we always love to hear from new speakers!

1 hr 23 min

Top Podcasts In Technology

Lex Fridman Podcast
Lex Fridman
Frontend Weekend
Андрей Смирнов
«Суровый веб» — тот самый подкаст от uwebdesign
uwebdesign.ru
Свободный слот
AvitoTech
Радио-Т
Umputun, Bobuk, Gray, Ksenks, Alek.sys
Dwarkesh Podcast
Dwarkesh Patel