Extended Data Fig. 1: Distribution of case vignettes across medical specialties and source datasets.
From: An evaluation framework for clinical use of large language models in patient interaction tasks

(a) CRAFT-MD evaluation dataset, showing the distribution of case vignettes across 12 medical specialties - Dermatology, Hematology and Oncology, Neurology, Gastroenterology, Pediatrics and Neonatology, Cardiology, Infectious Disease, Obstetrics and Gynecology, Urology and Nephrology, Endocrinology, Rheumatology and Others. (b) Inset pie chart showing the proportion of case vignettes based on source of curation (MedQA-USMLE, Derm-Public and Derm-Private). (c) MELD analysis showing Levenshtein Distance between original and GPT-4 completed case vignettes.