How Do Multimodal Large Language Models Perform on Clinical Vignette Questions?

JAMA Author Interviews

How did GPT-4 Vision, a model that can work with images and text as input, perform when answering clinical challenge questions from medical journals? Daniel Truhn, MD, MSc, of the University Hospital Aachen in Germany, joins JAMA Editor in Chief Kirsten Bibbins-Domingo, PhD, MD, MAS, to discuss this topic. Related Content:

  • Comparative Analysis of Multimodal Large Language Model Performance on Clinical Vignette Questions

To listen to explicit episodes, sign in.

Stay up to date with this show

Sign in or sign up to follow shows, save episodes, and get the latest updates.

Select a country or region

Africa, Middle East, and India

Asia Pacific

Europe

Latin America and the Caribbean

The United States and Canada