Fig. 3: Performance evaluation of PD classifiers from speech on external test sets.

a and c respectively demonstrate the AUROC curve and the confusion matrix of our best performing novel fusion model when tested on the dataset collected from PD Care Facility. In contrast, b and d give such visualizations when the model was tested on the participating cohort from Clinical Setup.