In this article GOOGL META Follow your favorite stocks CREATE FREE ACCOUNT Hands, tablet and doctor with body hologram, overlay and dna research for medical innovation on app. Medic man, nurse and mobile touchscreen for typing on anatomy study or 3d holographic ux in clinic Jacob Wackerhausen | Istock | Getty Images Dr. Scott Gottlieb is a physician and served as the 23rd Commissioner of the U.

S. Food and Drug Administration. He is a CNBC contributor and is a member of the boards of Pfizer and several other startups in health and tech.

He is also a partner at the venture capital firm New Enterprise Associates. Shani Benezra is a senior research associate at the American Enterprise Institute and a former associate producer at CBS News' Face the Nation. Many consumers and medical providers are turning to chatbots, powered by large language models, to answer medical questions and inform treatment choices.

We decided to see whether there were major differences between the leading platforms when it came to their clinical aptitude. To secure a medical license in the United States, aspiring doctors must successfully navigate three stages of the U.S.

Medical Licensing Examination (USMLE), with the third and final installment widely regarded as the most challenging. It requires candidates to answer about 60% of the questions correctly, and historically, the average passing score hovered around 75%. When we subjected the major large language models (LLMs) to the same Step 3 examination.