Table 8 Definition of variables included in the lung segmentation metadata file.

From: POLCOVID: a multicenter multiclass chest X-ray database (Poland, 2020–2021)

Variable Name

Definition

source

Name of dataset

source_id

Dataset abbreviation: “POLCOVID” for the POLCOVID dataset; “NIH” for National Institute of Health – Clinical Center12; “SHENZHEN” for Shenzhen No.3 Hospital, Shenzhen, China13; “DHHS” for Department of Health and Human Services of Montgomery County, USA13; “GUANGZHOU” for Guangzhou Women and Children’s Medical Center, Guangzhou, China14.

filename

Anonymized unique file name: for POLCOVID Anonymus_<hospital_id>_<patient_id>_<class_id>.<file_format>; for the remaining datasets the name of the file given by the data provider.

set

Set to which the image was included during the generation of the lung segmentation model: “train” – training set, “validation” – validation set, “hold-out test” – testing set.