Fig. 3

BCN20000 Dataset Preparation Pipeline. The process begins with the collection of images and metadata. A neural network is then employed to classify and separate between the image types. Patient identifiers are extracted from ‘Sticker pictures’ using a YOLOv3 network. Dermatoscopy’s diagnosis are revised by multiple reviewers for quality assurance. The resultant BCN20000 dataset is composed exclusively of dermoscopic images and metadata, divided into training and testing sets.