TY - JOUR
T1 - Soft classification of single samples based on multi-analyte spectra
AU - Cheung, Nai Ho
N1 - Funding Information:
The author thanks Michael Huang and Timothy Yeung for useful comments and Brent Ho for inputs on the y to z projection. This study was funded by the Research Grant Council of Hong Kong under grant number HKBU200513 and the Faculty Research Grant of Hong Kong Baptist University.
PY - 2019/12
Y1 - 2019/12
N2 - Chemical fingerprinting based on multi-analyte spectra can be very powerful. An example is the analysis and classification of forensic documents and artworks by means of the plume fluorescence spectra of the pigments. For that purpose, we borrow the concept of Gaussian naïve Bayes classifier and develop a soft classification scheme to estimate the class membership probabilities. It is based on the similarity of the unknown sample to the training observations, as measured by the radial basis function kernel in the class space of orthogonal partial-least-squares discriminant analysis. We apply the scheme to the classification of plume fluorescence spectra of chinese red seal inks. We compare its performance against that of a conventional hard classification scheme. Our scheme gives 98.9% sensitivity, 99.8% specificity, and zero false in-class rate; all better than those of the hard scheme. More importantly, our scheme reports class membership probabilities for each and every test sample. This is especially useful for sorting single samples. For example, we show that samples with assigned probabilities higher than 80% are sorted correctly 99.5% of the time. Their classification is therefore highly reliable. For samples with assigned probabilities below 80%, the correct sorting rate is only 82%. But these cases are few, less than 4% of the samples, and their relatively low membership probabilities still serve as flags for further sampling of the specimen.
AB - Chemical fingerprinting based on multi-analyte spectra can be very powerful. An example is the analysis and classification of forensic documents and artworks by means of the plume fluorescence spectra of the pigments. For that purpose, we borrow the concept of Gaussian naïve Bayes classifier and develop a soft classification scheme to estimate the class membership probabilities. It is based on the similarity of the unknown sample to the training observations, as measured by the radial basis function kernel in the class space of orthogonal partial-least-squares discriminant analysis. We apply the scheme to the classification of plume fluorescence spectra of chinese red seal inks. We compare its performance against that of a conventional hard classification scheme. Our scheme gives 98.9% sensitivity, 99.8% specificity, and zero false in-class rate; all better than those of the hard scheme. More importantly, our scheme reports class membership probabilities for each and every test sample. This is especially useful for sorting single samples. For example, we show that samples with assigned probabilities higher than 80% are sorted correctly 99.5% of the time. Their classification is therefore highly reliable. For samples with assigned probabilities below 80%, the correct sorting rate is only 82%. But these cases are few, less than 4% of the samples, and their relatively low membership probabilities still serve as flags for further sampling of the specimen.
UR - http://www.scopus.com/inward/record.url?scp=85075810271&partnerID=8YFLogxK
U2 - 10.1039/c9ja00292h
DO - 10.1039/c9ja00292h
M3 - Journal article
AN - SCOPUS:85075810271
SN - 0267-9477
VL - 34
SP - 2370
EP - 2377
JO - Journal of Analytical Atomic Spectrometry
JF - Journal of Analytical Atomic Spectrometry
IS - 12
ER -