Fig. 3: Examples of augmented images that were continuously generated during the training process.

More than four million augmented images were used to train the vision transformer-based model over 250 epochs.
More than four million augmented images were used to train the vision transformer-based model over 250 epochs.