Fig. 1: Data collection for the training and validation sets, description of the cancer survey, overview of the predictive model construction, and disease risk scores.

a Cumulative data collection from May 2016 to September 2016. Participants recruited between May and September 2016 were included in the training set. The gray line is the total number of participants from European ancestry. BCC, SCC, and melanoma prediction models were trained on participants 30–90 years old (black line). b Number of skin cancer cases in the training set. c Overview of prediction model construction in the training set. d Disease risk scores and predictive performances evaluated in the validation set.