Multivariate and machine learning approaches for honey botanical origin authentication using near infrared spectroscopy
- 15 January 2019
- journal article
- conference paper
- Published by SAGE Publications in Journal of Near Infrared Spectroscopy
- Vol. 27 (1), 65-74
- https://doi.org/10.1177/0967033518824765
Abstract
In this work the feasibility of near infrared spectroscopy was evaluated combined with chemometric approaches, as a tool for the botanical origin prediction of 119 honey samples. Four varieties related to polyfloral, acacia, chestnut, and linden were first characterized by their physical–chemical parameters and then analyzed in triplicate using a near infrared spectrophotometer equipped with an optical path gold reflector. Three different classifiers were built on distinct multivariate and machine learning approaches for honey botanical classification. A partial least squares discriminant analysis was used as a first approach to build a predictive model for honey classification. Spectra pretreatments named autoscale, standard normal variate, detrending, first derivative, and smoothing were applied for the reduction of scattering related to the presence of particle size, like glucose crystals. The values of the descriptive statistics of the partial least squares discriminant analysis model allowed a sufficient floral group prediction for the acacia and polyfloral honeys but not in the cases of chestnut and linden. The second classifier, based on a support vector machine, allowed a better classification of acacia and polyfloral and also achieved the classification of chestnut. The linden samples instead remained unclassified. A further investigation, aimed to improve the botanical discrimination, exploited a feature selection algorithm named Boruta, which assigned a pool of 39 informative averaged near infrared spectral variables on which a canonical discriminant analysis was assessed. The canonical discriminant analysis accounted a better separation of samples according to the botanical origin than the partial least squares discriminant analysis. The approach used has permitted to achieve a complete authentication of the acacia honeys but not a precise segregation of polyfloral ones. The comparison between the variables important in projection and the Boruta pool showed that the informative wavelengths are partially shared especially in the middle and far band of the near infrared spectral range.Keywords
Funding Information
- Università degli Studi di Padova (CPTA158894/15)
- Fondazione Cariverona (call 2016-SAFIL project)
This publication has 30 references indexed in Scilit:
- A fast chemometric procedure based on NIR data for authentication of honey with protected geographical indicationFood Chemistry, 2013
- Qualitative and Quantitative Control of Honeys Using NMR Spectroscopy and ChemometricsInternational Scholarly Research Notices, 2013
- Classification of Chinese honeys according to their floral origin by near infrared spectroscopyFood Chemistry, 2012
- Variable selection in regression—a tutorialJournal of Chemometrics, 2010
- Review of the most common pre-processing techniques for near-infrared spectraTrAC Trends in Analytical Chemistry, 2009
- Geographical Classification of Honey Samples by Near-Infrared Spectroscopy: A Feasibility StudyJournal of Agricultural and Food Chemistry, 2007
- Authentication of the Botanical Origin of Honey by Near-Infrared SpectroscopyJournal of Agricultural and Food Chemistry, 2006
- Assessing the performance of statistical validation tools for megavariate metabolomics dataMetabolomics, 2006
- Classification of monofloral honeys based on their quality control dataFood Chemistry, 2004
- Standard Normal Variate Transformation and De-Trending of Near-Infrared Diffuse Reflectance SpectraApplied Spectroscopy, 1989