News|Articles|May 14, 2024

Pediatric Dermatologists Outperform Artificial Intelligence; ChatGPT Demonstrates Comparability in Some Aspects

Author(s)Emma Andrus, Associate Editor

In a comparison of pediatric dermatologists versus AI, dermatologists primarily exhibited greater performance.

Artificial intelligence-based tools (AITs) such as OpenAI's Chat Generative Pre-trained Transformer (ChatGPT) have developed a growing importance in medical applications. These tools have demonstrated the ability to predict patient outcomes and adverse events associated with treatment, as well as the capability to interpret imaging or lab results, among others.¹

Aware of these capabilities and the ever-expanding role of AITs in the medical field, researchers Huang et al sought to assess the knowledge and clinical diagnostic capabilities of ChatGPT iterations 3.5 and 4.0 via a comparison of pediatric dermatologists.

In the study, published in Pediatric Dermatology, researchers found that on average, pediatric dermatologists predominantly outperformed AITs in multiple-choice, multiple-answer, and case-based questions.² However, results of the study also demonstrated that ChatGPT, specifically version 4.0, often exhibited comparability in some aspects, including in multiple-choice and multiple-answer questions.

Background and Methods

Researchers developed a test of 24 text-based questions, including 16 multiple-choice questions, 2 multiple-answer questions, and 6 case-based questions; case-based questions were free-response.

Questions were developed based on American Board of Dermatology 2021 Certification Sample Test and the “Photoquiz” section of the journal Pediatric Dermatology, and all questions were first processed through ChatGPT's web interface as of October 2023.

Researchers utilized a 0 to 5 scale common for the evaluation of AITs to evaluate and grade case-based questions. Reviewers of responses were blinded to respondents' identities.

Findings

A total of 5 pediatric dermatologists completed the questions posed by researchers, with an average of 5.6 years of clinical experience shared between them.

On average, pediatric dermatologists scored 91.4% on multiple-choice and multiple-answer questions, while ChatGPT version 3.5 demonstrated an average score of 76.2%, giving pediatric dermatologists a significantly greater advantage. However, when compared to ChatGPT version 4.0, results were considered comparable, with iteration 4.0 achieving an average score of 90.5%--just 0.9% less than that of the clinicians.

On average, clinicians performed better than AI on case-based questions with a score of 3.81, while ChatGPT v.3.5 scored an average of 3.53. On average, case-based question scoring for pediatric dermatologists was not significantly greater than ChatGPT v.4.0.

Using these findings as a basis, Huang et al developed a differential best practices list of "dos and don'ts" for clinicians.

They recommend that clinicians DO:

Use ChatGPT to brainstorm a differential diagnosis
Provide detailed and relevant information while maintaining patient privacy
Fact check ChatGPT's responses using reputable sources for medical information
Stay updated on legal and institutional policies surrounding the use of AI tools in health care

They recommend that clinicians DO NOT:

Rely on ChatGPT to provide the single, best diagnosis
Succumb to anchoring bias as a result of ChatGPT's responses
Immediately accept ChatGPT's responses as medical facts
Enter HIPAA-protected information into AI tools like ChatGPT that are not HIPAA-compliant

Conclusions

Researchers recommended that dermatology clinicians become more familiar with AIT tools as their accuracy continues to advance and improve, noting that they may serve as useful for fact-based questions and case-based materials.

Though these results are promising, they noted that further research is necessary to better understand the role of ChatGPT in clinical knowledge and reasoning.

Limitations of the study, as posed by researchers, include the potential for changing reproducibility of the results and the potential for prior exposure of pediatric dermatologists to questions and cases utilized within the study.

"While clinicians currently continue to outperform AITs, incremental advancements in the complexity of these AI algorithms for text and image interpretation offer pediatric dermatology clinicians a valuable addition to their toolbox," according to Huang et al. "In the present circumstance, generative AI is a useful tool but should not be relied upon to draw any final conclusions about diagnosis or therapy without appropriate supervision."

References

Haug CJ, Drazen JM. Artificial intelligence and machine learning in clinical medicine, 2023. N Engl J Med. 2023; 388(13): 1201-1208. doi:10.1056/nejmra2302038
Huang CY, Zhang E, Caussade MC, Brown T, Stockton Hogrogian G, Yan AC. Pediatric dermatologists versus AI bots: Evaluating the medical knowledge and diagnostic capabilities of ChatGPT. Pediatr Dermatol. May 9, 2024. Accessed May 13, 2024. doi:10.1111/pde.15649

Like what you’re reading? Subscribe to Dermatology Times for weekly updates on therapies, innovations, and real-world practice tips.

Subscribe Now!

Latest CME

Video

Medical Crossfire®: Navigating Chronic GVHD Prophylaxis and Treatment – Targeted Strategies to Elevate Patient Outcomes

Corey Cutler, MD, MPH, FRCP(C); Amin M. Alousi, MD; Mehdi Hamadani, MD; Anna Sureda, MD, PhD

Video

Burst CME: Targeted Therapy for Optimal Psoriasis Management

Tina Bhutani, MD

Multimedia

Expert Illustrations & Commentaries™: Exploring Novel Therapeutic Targets in Acne Management

Hilary Baldwin; Neal Bhatia, MD

In-Person Event

Revolutionizing Atopic Dermatitis (RAD) Conference 2026

June 17-19, 2026

Video

Understanding Topical Steroid Withdrawal (TSW) in Patients With Atopic Dermatitis (AD)

Diego Ruiz Dasilva, MD, FAAD; Brad Glick, DO, MPH. FAAD; Peter Lio, MD

Video

Clinical Consultations™: Providing Holistic Care for Complex Cases of Psoriasis with Cardiovascular Comorbidities

Brittany Weber MD, PhD, FACC, FAHA; Lourdes M. Perez-Chada MD, MMSc

Video

Assessing the Evidence for OX40-OX40L Axis Inhibition for the Treatment of Atopic Dermatitis

Johann Gudjonsson, MD, PhD; Christopher Bunick, MD, PhD

Video

Navigating Safety Data with Janus Kinase (JAK) Inhibitors in Atopic Dermatitis (AD) Management

Andrew Alexis, MD, MPH; Leon Kircik, MD

Video

Patient, Provider, and Caregiver Connection™: Addressing Patient Challenges With Holistic Approaches to Vitiligo Management

Andrew F. Alexis, MD, MPH; Chesahna Kindred, MD, MBA, FAAD

Video

Cases and Conversations™: Biologic Matchmaking in Psoriasis – Finding the Right Therapy for the Right Patient

Douglas DiRuggiero, DMSc, MHS, PA-C; Lakshi Aldredge, MSN, ANP-BC, DCNP, FAANP

Multimedia

Burst CME™: Optimizing Care for Patients with Psoriasis – Incorporating a Buy-and-Bill Model for Biologic Agents into Dermatological Practice

Jerry Bagel, MD, MS

Video

Hidradenitis Suppurativa: Diving Deeper Into Disease Pathogenesis, Severity Assessment, and Holistic Management Approaches

Hadar Lev-Tov, MD; Martina Porter, MD

In-Person Event

Derm Nexus The Inflammatory Disease and Innovations Congress for NP/PAs

November 20-21, 2026

Video

Clear Skin, Clear Mind: Integrating Mental Health into Psoriasis Care

John Koo, MD; T.J. Chao, MPAS, PA-C

Pediatric Dermatologists Outperform Artificial Intelligence; ChatGPT Demonstrates Comparability in Some Aspects

Background and Methods

Findings

Conclusions

Newsletter

Related Content

Smartphone-Based Education Reduces Short-Term Relapse in Pediatric Atopic Dermatitis

Obagi Launches ALOHA Program to Capture Real-World Insights in Aesthetic Injectables

Twelve Months of LLLT Show Sustained Benefit in Alopecia

Journal Digest: January 21, 2026

Exploring Type 2 Inflammation and Psoriasis Management at Horizons in Advanced Practice

Latest CME

Medical Crossfire®: Navigating Chronic GVHD Prophylaxis and Treatment – Targeted Strategies to Elevate Patient Outcomes

Burst CME: Targeted Therapy for Optimal Psoriasis Management

Expert Illustrations & Commentaries™: Exploring Novel Therapeutic Targets in Acne Management

Revolutionizing Atopic Dermatitis (RAD) Conference 2026

Understanding Topical Steroid Withdrawal (TSW) in Patients With Atopic Dermatitis (AD)

Clinical Consultations™: Providing Holistic Care for Complex Cases of Psoriasis with Cardiovascular Comorbidities

Assessing the Evidence for OX40-OX40L Axis Inhibition for the Treatment of Atopic Dermatitis

Navigating Safety Data with Janus Kinase (JAK) Inhibitors in Atopic Dermatitis (AD) Management

Patient, Provider, and Caregiver Connection™: Addressing Patient Challenges With Holistic Approaches to Vitiligo Management

Cases and Conversations™: Biologic Matchmaking in Psoriasis – Finding the Right Therapy for the Right Patient

Burst CME™: Optimizing Care for Patients with Psoriasis – Incorporating a Buy-and-Bill Model for Biologic Agents into Dermatological Practice

Hidradenitis Suppurativa: Diving Deeper Into Disease Pathogenesis, Severity Assessment, and Holistic Management Approaches

Derm Nexus The Inflammatory Disease and Innovations Congress for NP/PAs

Clear Skin, Clear Mind: Integrating Mental Health into Psoriasis Care

Trending on Dermatology Times

Clinicians See Intersection of Weight Loss and Facial Aging

Soquelitinib Phase 1 Data Show Sustained Clinical Improvement With Extended Treatment in AD

The “6-7” Advances for Dermatologists to Look Forward to in 2026

Exploring Type 2 Inflammation and Psoriasis Management at Horizons in Advanced Practice

The Burden of the Atopic March: Real-World Data on Pediatric AD and Allergic Comorbidities in Japan