Fig. 6: Unsupervised salmon origin differentiation based on different data fusion strategy, and Supervised learning parameter optimisation based on mid-level data fusion strategy. | Nature Communications

Fig. 6: Unsupervised salmon origin differentiation based on different data fusion strategy, and Supervised learning parameter optimisation based on mid-level data fusion strategy.

From: Data fusion and multivariate analysis for food authenticity analysis

Fig. 6

a Low-level data fusion, using min-max normalisation, PCA score plot of 5 salmon groups with data min-max normalisation. b Mid-level data fusion PCA score plot of 5 salmon groups. c ICP-MS principal compound accumulated explained variance plot. d REIMS principal compound accumulated explained variance plot. e The k value evaluation of k-NN model based on mid-level data fusion, k values between 1 and 20 were tested to find the optimal parameter of the k-NN classifier using different sub-datasets in this study. The optimal k for the k-NN classifier was chosen as k = 5. f Plot cumulative R2 and Q2 per component for the PLS-DA model based on mid-level data fusion. Components 1–50 were computed for parameter optimisation, and 25 was determined to be the optimal component number. g Number of predictors of RF classifier influenced the correct classification rate, npredic 1–200 were tested for five groups to find the best parameters for the RF classifier. npredic = 15 was found to be the best value for RF classifiers, based on mid-level data fusion. h RF classifier correct classification rate was influenced by the number of trees, Ntree = 500 was found to be the best value for RF classifiers, based on mid-level data fusion.

Back to article page