TY - JOUR
T1 - Transformations for non-destructive evaluation of brix in mango by reflectance spectroscopy and machine learning
AU - Ernesto, Paiva Peredo
AU - Diego, Gonzales Rodriguez
AU - Herrera, William Trujillo
AU - Quijaite, Juan Jesús Soria
AU - Diana, Quispe Arpasi
AU - Paulino, Christian Ovalle
N1 - Publisher Copyright:
© 2024 Institute of Advanced Engineering and Science. All rights reserved.
PY - 2024/2
Y1 - 2024/2
N2 - Mango is a very popular climacteric fruit in America and Europe. Among the internal properties of the mango, total soluble solids (TSS) are an adequate indicator to estimate the quality of mango, however, the measurement of this indicator requires destructive tests. Several research have addressed similar issues; they have made use of pre-processing transformations without making it clear which of them is statistically better. Here, we created a new spectral database to build machine learning (ML) models. We analyzed a total of 18 principal component regression (PCR) models and 18 partial least squared regression (PLSR) models, where 4 types of transformations, 3 different feature extractors, and 3 different pre-processing techniques are combined. The research proposes a double cross validation (CV) both to determine the optimal number of components and to obtain the final metrics. The best model had a root mean square error (RMSE) of 1.1382 °Brix and a RMSE on the transformed scale of 0.5140. The best model used 4 components, used y2 transformation, reflectance R as the independent variable and MSC as a preprocessing technique.
AB - Mango is a very popular climacteric fruit in America and Europe. Among the internal properties of the mango, total soluble solids (TSS) are an adequate indicator to estimate the quality of mango, however, the measurement of this indicator requires destructive tests. Several research have addressed similar issues; they have made use of pre-processing transformations without making it clear which of them is statistically better. Here, we created a new spectral database to build machine learning (ML) models. We analyzed a total of 18 principal component regression (PCR) models and 18 partial least squared regression (PLSR) models, where 4 types of transformations, 3 different feature extractors, and 3 different pre-processing techniques are combined. The research proposes a double cross validation (CV) both to determine the optimal number of components and to obtain the final metrics. The best model had a root mean square error (RMSE) of 1.1382 °Brix and a RMSE on the transformed scale of 0.5140. The best model used 4 components, used y2 transformation, reflectance R as the independent variable and MSC as a preprocessing technique.
KW - Brix Machine learning Partial least squares Principal component analysis Spectroscopy
UR - http://www.scopus.com/inward/record.url?scp=85183890715&partnerID=8YFLogxK
U2 - 10.11591/ijece.v14i1.pp532-546
DO - 10.11591/ijece.v14i1.pp532-546
M3 - Article
AN - SCOPUS:85183890715
SN - 2088-8708
VL - 14
SP - 532
EP - 546
JO - International Journal of Electrical and Computer Engineering
JF - International Journal of Electrical and Computer Engineering
IS - 1
ER -