home    about    browse    search    latest    help 
Login | Create Account

Improved classification of fused data: Synergetic effect of partial least squares discriminant analysis (PLS-DA) and common components and specific weights analysis (CCSWA) combination as applied to tomato profiles (NMR, IR and IRMS)

Monakhova, Yulia B.; Hohmann, Monika; Christoph, Norbert; Wachter, Helmut and Rutledge, Douglas N. (2016) Improved classification of fused data: Synergetic effect of partial least squares discriminant analysis (PLS-DA) and common components and specific weights analysis (CCSWA) combination as applied to tomato profiles (NMR, IR and IRMS). Chemometrics and Intelligent Laboratory Systems, pp. 1-6.

Full text not available from this repository.

Document available online at: https://hal.archives-ouvertes.fr/hal-01532592


Summary in the original language of the document

Discriminant analysis (DA) methods are well-known chemometric approaches for solving classification problems in chemistry. Recently, specific multiblock methods, such as common components and specific weights analysis (CCSWA), have been developed which make it possible to enhance the quality of the classification models, by combining data from different analytical platforms. In this study we propose a new data fusion methodology PLS-DA-CCSWA, which combines the discriminant power of the PLS-DA method with the capability of CCSWA to extract the maximum of useful information from the different data blocks. A large dataset (n = 112) of H-1 NMR, infrared and isotope ratio mass spectral profiles of authentic tomato samples was analyzed to demonstrate the principle. The classification model developed was used to predict the tomato production type (organic or conventional). The application of the new method resulted in improved classification performance for test set samples according to the Wilks' lambda test. Moreover, a clear decrease in the standard deviations of the predicted Y-values was observed going from 024 to 0.18 on average between the classical CCSWA and the PLS-DA-CCSWA, respectively. The procedure to determine the number of common components and the number of latent variables is discussed. The PLS-DA-CCSWA method is shown to be preferable to separate PLS-DA and CCSWA approaches for classification based on fused spectroscopic measurements.


EPrint Type:Journal paper
Keywords:Data fusion (en), Common components and specific weights analysis (en), Partial least squares-discriminant analysis (en), Organic products (en), chemometrics (en), organically grown products (en), chimiométrie (fr), classification (fr), produit biologique (fr)
Subjects:"Organics" in general
Research affiliation: France > INRAe - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement
ISSN:ISSN: 0169-7439
DOI:10.1016/j.chemolab.2016.05.006
Project ID:HAL-INRAe
Deposited By: PENVERN, Servane
ID Code:41482
Deposited On:12 Aug 2021 10:37
Last Modified:12 Aug 2021 10:37
Document Language:English

Repository Staff Only: item control page