Thompson Sampling Regret Bounds for Logistic Bandits

Neural intel Pod

Dive into the mathematics of decision-making under uncertainty, exploring how Thompson Sampling helps balance exploration and exploitation in online learning with binary outcomes.

To listen to explicit episodes, sign in.

Stay up to date with this show

Sign in or sign up to follow shows, save episodes, and get the latest updates.

Select a country or region

Africa, Middle East, and India

Asia Pacific

Europe

Latin America and the Caribbean

The United States and Canada