A machine learning workflow for raw food spectroscopic classification in a future industry

Tsakanikas, Panagiotis; Karnavas, Apostolos; Panagou, Efstathios Z.; Nychas, George-John

doi:10.1038/s41598-020-68156-2

Download PDF

Article
Open access
Published: 08 July 2020

A machine learning workflow for raw food spectroscopic classification in a future industry

Panagiotis Tsakanikas¹,
Apostolos Karnavas¹,
Efstathios Z. Panagou¹ &
…
George-John Nychas ORCID: orcid.org/0000-0002-2673-6425¹

Scientific Reports volume 10, Article number: 11212 (2020) Cite this article

7006 Accesses
44 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Over the years, technology has changed the way we produce and have access to our food through the development of applications, robotics, data analysis, and processing techniques. The implementation of these approaches by the food industry ensure quality and affordability, reducing at the same time the costs of keeping the food fresh and increase productivity. A system, as the one presented herein, for raw food categorization is needed in future food industries to automate food classification according to type, the process of algorithm approaches that will be applied to every different food origin and also for serving disabled people. The purpose of this work was to develop a machine learning workflow based on supervised PLS regression and SVM classification, towards automated raw food categorization from FTIR. The system exhibited high efficiency in multi-class classification of 7 different types of raw food. The selected food samples, were diverse in terms of storage conditions (temperature, storage time and packaging), while the variability within each food was also taken into account by several different batches; leading in a classifier able to embed this variation towards increased robustness and efficiency, ready for real life applications targeting to the digital transformation of the food industry.

Discovery of food identity markers by metabolomics and machine learning technology

Article Open access 04 July 2019

Machine learning prediction of the degree of food processing

Article Open access 21 April 2023

Rapid analysis of meat floss origin using a supervised machine learning-based electronic nose towards food authentication

Article Open access 16 June 2023

Introduction

At the dawn of the twenty-first century, the agri-food sector is facing major challenges: first, providing the world’s population with enough to eat (Food Security)¹ and second, ensuring that this food is safe to eat (Food Safety)¹, while maintaining a production process within environmental constraints. These objectives have to be realized in the context of tremendous technological change, a growing lack of natural resources, and a continuous evolution of consumers’ life-styles and consumption habits, across the globe^1,2. The food industry is obliged to operate under seemingly contradictory expectations, i.e. consumers prefer foods that are (i) convenient and fresh (minimally-processed and packaged); (ii) all “natural”—with no preservatives; (iii) potentially healthy without adverse health effects (i.e., low in fat, salt, and sugar); and (iv) produced in an environmentally sustainable manner.

Regarding these issues, the Joint Research Centre (JRC) Science for policy report³ investigated 4 scenarios on the identification of future challenges in the global food system and indicated the need to increase dependence on Information and Communications Technologies (ICT) to ensure traceability in the food chain and the possibility of temporary failure or fraud and terrorism.

To implement this need, smart sensors have been designed to bridge the gap between appropriate food information and consumer’s needs. Similarly, the importance of ICT has been recognized as a mean to enhance the operational efficiency and productivity in the agricultural sector/food industry in the context of the Implementation Action Plan proposed by European Technology Platforms (ETPs), which are industry-led stakeholder fora, recognized by the European Commission as key actors in driving innovation, knowledge transfer and European competitiveness⁴. The use of sensors is of vital importance in the food industry; their potential of taking non-invasive measurements on, in or at line without destructing the food product is a prerequisite for the food industry of the future⁵.

Nowadays many different sensors (e.g. NIR, FTIR, RAMAN, Multi or High Spectral Imaging—surface chemistry) have been employed by the food sector to evaluate freshness, microbial quality, adulteration, food origin, etc.^6,7,8,9,10. However, due to the complexity of these measurements, data analytics (DA) should be considered as an essential step, to provide solid and valid information to stakeholders^5,11,12 with regard to the quality characteristics mentioned above. Indeed these measurements in tandem with DA have been found to tackle basic issues regarding the implementation of rapid methods in the Food sector. Although a number of studies have shown that the combination of sensors and DA¹² can provide accurate information regarding the freshness, safety, and quality integrity of specific products, its limitation is evident since they cannot discern the product per se; thus a measurement from fish cannot be characterized as such, while if used for meat products it will fail to provide the correct answer. In other words, it is clear that in these measurements a region of the spectrum should be used as a key which can characterize the identity of the product. This key region(s) in the measurements is (are) essential and will allow the ‘classification’ of food system so as to “discriminate” different food commodities, prior to the evaluation of requested parameters such as the quality status of the product. Intuitively it can be thought as a ‘coding’ region of the data, showing the type of the food, driving and employing the use of the suitable analysis pipeline from a library of pipelines resident on a local database or on the cloud. At the same time it can be used to facilitate the life of disabled persons (e.g., the blind) or the needs of a huge futuristic megastore that imports food products from different producers and distribute these to consumers using advanced communication technologies, i.e. the Internet of Foods. The basic idea underlying the aforementioned “discrimination of different food commodities” is that the spectrum from a sensor for a specific raw material would exhibit some unique features/properties compared to the spectrum of other food systems. This has to be performed regardless of any inherent variation of the raw material per se due to other origins, e.g. storage conditions, storage time, animal diet, etc. Another possible application of the present work is the current need for food recognition prior and towards a fully automated food quality assessment system in the industry with several raw food types as raw materials. Such a system will serve as a submodule for switching/enabling the appropriate food type specific algorithm/pipeline execution. Throughout the literature on food science, it is apparent that the majority of algorithms and approaches developed for food quality assessment is very different for the different types of food, in terms of spectra type, preprocessing (data normalization, filtering etc.) and regression and/or classification approaches¹². Thus, there is need for automated selection, e.g. recognition of the food type directly from spectra, and thus redirecting to raw food type to the most suitable quality assessment approach. Actually, this need has emerged from a running EU project in which our research group participates, named PhasmaFOOD¹³. Finally and maybe the most promising application of the presented system is the sorting of raw food materials in the food industry to prepare the production chain for the manufacturing of mixed goods and recipes. One of the most advanced AI applications in the food industry is TOMRA Sorting Food¹⁴, which uses sensor-based optical sorting solutions with machine learning functionalities. Herein, we reach a performance of 100% accuracy without the use of AI, but with “traditional” machine learning methods, where in addition to high accuracy it also provides inferable results.

At this point it should be underlined that the presented methodology, although closely related, has no application (at least in its current form) in adulteration detection. The datasets used are not from adulteration experimental designs and do not take into account mixed adulterated samples but only pure samples. Throughout the literature there is a lot of research^15,16 in the subject with several instruments, including FT-IR. All the approaches for adulteration detection use data from pure and mixed samples. The corresponding chemometrics are designed to identify adulteration taking into account the information from adulterated samples. Intuitively, the algorithms detect the alterations considering the variations/alterations of the spectra while adulteration in several levels occurs, where the spectrum of each sample corresponds to the whole surface of the sample. Thus, having a dataset of only pure samples cannot support the development of a system for adulteration detection purposes. In this context, the purpose of this study was to investigate the potential use of data mining and data analysis, on data acquired by a Fourier-transform infrared spectroscopy (FT-IR) sensor, a non-destructive/non-invasive instrument to ‘identify’ food commodities from a single signal (spectrum) only.

Results and discussion

The developed workflow for raw food material recognition employing FT-IR spectra is implemented in Python 2.7. A brief outline is provided here and more details can be found in the Methods section. First, the raw sensor data are passed via a preprocessing and normalization step Standard Normal Variate¹⁷ (SNV) and specifically under its robust version, RNV¹⁸. This step is crucial to enhance the quality of the data, remove any correlated information across the different wavelengths/wavenumbers, and also eliminate the inherent multiplicative noise. Afterward, a supervised dimensionality reduction is employed, based on Partial Least Squares Regression (PLSR)¹⁹. A supervised dimensionality reduction scheme has been selected to help/guide the system towards a more focused dimension estimation to the “target” of raw food classification rather than other, not relevant sample properties such as a batch of sample origin, sample storage conditions, etc. PLSR can be thought of as a feature engineering method for the following SVM classification. Finally, the classification model of the 7 raw food material types has been built using SVM classifier²⁰, producing the final classification model. Figure 1 presents the data (training set) on the space of the three first principal components. The data was transformed via Principal Components Analysis and refer to the data after the normalization and feature selection by the PLS regression scheme described in Methods section. In addition, the interested reader can also refer to SI1 for the PCA and PLS plots of the data prior of the feature selection step.

The developed classification model (please refer to Methods) results to an accuracy of 100% with a lower 95% confidence interval bound at accuracy 98.5% (p-Value >> 0.0001). The accuracy corresponds to the prediction of the correct food type among the 240 independent test samples (Table 1 summarizes the confusion matrix). The significance of these results is enhanced if someone takes into account the properties of the samples consisting the data. As discussed in the Materials Section, the samples for each individual food subtype are highly variant (please refer to SI1 figures showing the mean spectrum and the standard deviation of the samples for each food type). Samples have origins from several different batches (which means inherent variance due to different sampling). Furthermore, the samples were exposed to different storage conditions in terms of time, temperature and also packaging (aerobic or modified atmosphere packaging—MAP). Table 2 presents an overview of the origin of the samples, their type and the experimental setup for which they have been acquired. It should be mentioned that as far as the storage conditions is concerned, the variance in the samples depends highly on the storage temperatures^21,22,23. This is due to the different types of microorganisms²⁴ (mesophiles or psychrotrophs) that can be populate and predominate on the food samples, resulting in a large variety of the byproducts they produce and thus to the chemical composition of the surface. So, it is apparent that the acquired spectra for the same food type exhibit variations, originated from the microorganisms’ byproducts; i.e. the microenvironment they create on the food. Moreover, storage time is also another significant source of variance on the data mainly due to the different levels of byproducts’ abundance²¹ and in general the physicochemical alterations of the samples (dehydration, oxidation etc.). Finally, another important source of variability of the acquired spectra of food of the same type is the packaging. Packaging, plays a major role on microorganisms’ growth on the samples²¹, since aerobic and facultative anaerobic microorganisms are prevailing one another as affected by their surroundings²⁵. In conclusion, it is apparent that taking into account the different initial microbiological load along with all the aforementioned parameters (temperature, storage time and type of packaging) that affect the surface chemical composition of the samples, the acquired spectra exhibit large variations even among the same food type (please refer to SI1 to see the standard deviation for spectra of the same food type).

Table 1 Confusion matrix presenting the classification results.

Full size table

Table 2 Brief description of the used samples from the specific references.

Full size table

Another argument supporting all the above statements relative to the introduced variance due to microorganisms’ contribution is that FTIR is used as a “gold standard” method for spoilage detection and estimation^26,27,28. All the studies on this field, spoilage estimation and prediction depend on those spectra variations. As described, the data for training and testing the classifier have diversities even among the same class. In general, it is well known in data science that having samples with diversities (as the spectra in our case) in the same class and for all the classes, drives the classification system towards lower generalization error. The final trained model becomes more general by virtue of being trained on more examples, incorporating the information of the diversity that can be inherent in each class. Intuitively, having a dataset of very similar spectra increases the generalization error as if the dataset was very small; most of the measurements would be like having the same multiple times. The data used for test were extracted from the original pool of data with a random generator so as to be as unbiased as can be. So, it is apparent that the data used in this research and the way they were handled, enhance the efficiency of the developed model and thus the significance of the outcome.

From all the above it can be concluded that the developed classifier apart from achieving ideal classification scores (accuracy = 1, F1-score = 1, Sensitivity = 1, Specificity = 1, Precision = 1, MCC = 1, Informedness = 1, Markedness = 1), it is also independent of sample storage conditions in terms of time, temperature, and packaging (please refer to Table SI1 for per class statistics).

In order to further evaluate and explain the efficacy of the developed classifier, the classification probabilities statistics per class were calculated and presented in Table 3. At this point the approach for the computation of those probabilities is elaborated. The predict_function used from the scikit-learn library for SVM gives the per-class scores for each sample. When the constructor (of the classifier) option probability is set to True, class membership probability estimates (using the method predict_proba) are enabled. In the binary case, the probabilities are calibrated using Platt scaling whereas in the multiclass case (as herein), it is extended as per Wu et al. (2004)²⁹. Briefly, given the observation x and the class label y, Wu et al. (2004)²⁹ assume that the estimated pairwise class probabilities r_ij of µ_ij = P(y = i|y = i or j, x) are available. From the ith and jth classes of a training set, a model is obtained which for any new x, calculates r_ij as an approximation of µ_ij. Then, using all r_ij, the goal is to estimate p_i = P(y = i|x), i = 1,…, k. In their research, a method for obtaining probability estimates via an approximation solution to an identity was proposed, while the existence of a solution is guaranteed by theory in finite Markov Chains.

Table 3 Classification probabilities statistics per class.

Full size table

For all the cases the mean probabilities for the classified samples is high and greater than 0.84 (with maximum standard deviation of 0.17 in the case of chicken), while the median value is > 0.91. Figure 2 shows the mean predicted probabilities for each class and the corresponding standard deviations. These high values prove the expectancy of the performance achieved, also supported but the high values of the minimum probabilities. In the case of chicken samples, the low minimum probability (0.46) accounts for just one sample whereas the next minimum value is 0.67. Those two values explain the relative large value of the standard deviation (0.17) of the probabilities. The next class probability for the chicken samples accounts to pork class (0.40), showing that pork and chicken are closely related types in terms of physiochemical properties captured by the FT-IR. It must be mentioned that pork/chicken combination is a common food fraud approach and a lot of research is undergone towards this^30,31,32. It is obvious that the mean posterior probability of predicting the chicken is the worst among the others as can be noticed from the posterior probabilities provided in Table SI2, while the next highest probabilities are for pork and beef classes respectively (in all cases the highest probability is assigned to the chicken class). However it must be underlined that in no case the probability of chicken class was inferior to any other and thus there was no mismatch, as also supported by the results. Moreover, in the case of rocket and spinach classification, two types of food with high resemblance (please also refer to SI1), it can be observed that the low probability values in one result in higher values to the other, something that is expected. For instance, when for a spinach sample the probability for being spinach is 0.67, its class probability to be rocket is 0.33, and for a rocket sample for being a rocket the probability is 0.85, while for spinach is 0.14 (please refer to Table SI1 for all, per class, probabilities).

In conclusion to the aforementioned results on the generalization and efficiency of the proposed pipeline and the developed classifier, the significance of the feature selection step in tandem to the development of dedicated sensors should be highlighted. As mentioned in Methods section, the selected (41) wavenumbers were proved as the most suitable for classifying the 7 food types used here. Results like the ones presented here and others throughout the literature can drive sensor manufacturers towards building dedicated sensors for specific applications with lower cost and size that can perform optimally.

Conclusions

Herein a uniform and global pipeline for the analysis of spectroscopic data from FT-IR towards raw food recognition has been developed and validated. As shown, the proposed workflow performed ideally on sample data exhibiting extensive variability in terms of batch, storage time, temperature, spoilage levels and packaging. This variability is reflected on the acquired spectra and proves the robustness of the method while suggesting its flawless performance. Furthermore, it can be safely concluded that FT-IR, as being a “gold standard” approach for several food safety applications and research, provides information rich data of the samples, that allows the efficient monitoring of different properties of food samples such as spoilage, quality attributes (e.g. moisture), shelf life and type of raw food (as shown herein). Apart from the efficiency shown in terms of food type classification, another critical property of the method is that its performance is invariant in terms of food storage conditions (temperature, time of storage, packaging), which makes it suitable for broad application on the detection of the raw food type. A significant property of the approach presented is that in theory it should be also efficient on different parts from a specific animal (e.g. chicken breast of wing, different parts of beef meat etc.). This can be justified via the presented results especially in the case of minced pork and/or beef, where a variety of animal parts have been used. As supported from the results, it can be concluded that the proposed workflow and the resulting classifier was able to discriminate 7 different raw food types (beef, pork, chicken, fish, rocket-salad, spinach and pineapple) with an accuracy of 100% with the data in training and testing phases exhibiting large variability in terms of batch origin and storage conditions (temperature, packaging and storage time). This enables the classifier to be robust and insensitive to random variations among the same food type. Thus the resulting classifier, as judged by its performance on a large external validation dataset, could be employed in real-world samples towards the automation of the digital food industry (e.g., food sensing devices and applications). Further research includes also the experimentation of alternative surface chemistry sensors, such as multispectral/hyperspectral imaging, that will allow us to expand towards applications such as food adulteration detection using the provided additional spatial information of the food samples.

Methods

Methodology

First and prior to supervised dimensionality reduction via Partial Least Squares (PLS) regression, Standard Normal Variate (SNV) normalization scheme17 and specifically under its robust version, RNV¹⁸ was employed to normalize the acquired spectra S, according to:

$$ {s_{i}^{snv}} = \frac{{{s_{i}} - median\left( S \right)}}{mad\left( S \right)} $$

(1)

where s_i is the ith spectrum and s_i^snv the ith normalized spectrum. MAD stands for Median Absolute Deviation (mad)³³; a robust variability metric of a univariate sample of quantitative data s₁,s₂,…,s_n. MAD is computed as:

$$ mad = median\left( {s_{i} } \right) - median\left( S \right) $$

(2)

The above normalization scheme is used for data quality enhancement, reduction of the correlated information along the wavelengths of the spectra and eliminate the multiplicative noise originating from the acquisition process inherent in order to improve the downstream analysis. The same scheme for data normalization has been used in another work by our laboratory³⁴.

Then the normalized data/spectra, are passed to the next step of processing, i.e. dimensionality reduction, and more specifically PLS-based supervised dimensionality reduction. In general, PLS regression¹⁹ is used for finding relationships between two data matrices, X and Y via a linear multivariate model. It is a popular method with extensive applications in numerous scientific fields and it’s well suited for applications where the number of variables (wavelength/wavenumbers in our case) exceeds the number of samples. PLS resembles in many ways with Principal Component Analysis³⁵ (PCA) as it transforms and maps a set of possibly correlated observations to a set of linearly uncorrelated values called components and the space those components define. Some of these components can then be used as regression coefficients for the data to accomplice dimensional reduction. PLS in contrast to PCA takes into account the latent structure of not only the independent variables (predictors) but that of the dependent (responses) as well. This is done by using a training set to find the multi-dimensional variance direction from the predictors’ space, where the maximum multidimensional variance direction is explained into the space of the responses. This way, one can predict the response of new data based only on the predictors using a model created in a supervised manner.

The number of the components to “hold” in a PLS regression¹⁹ modeling application is critical since wrong choice can lead or avoid “over-fitting”, i.e. a model with high accuracy on the training dataset but with little to zero predictive power on new, independent samples. To overcome this issue usually some kind of model validation technique is used in order to assess how the resulting model will generalize with new untrained data, with the most popular being the cross-validation. During cross-validation the training data is split into training and validation sets based on a user-defined ratio. Then, recursively, several models are created and trained using the training set, every new one using one more component than the last. Each model is then evaluated with the validation set based on the mean square error (MSE) of the predicted versus the actual values (Fig. 3a). Intuitively, the smaller the MSE is, the better the model will perform towards the prediction of the response variable with new observations. Prediction error estimation with “unseen” data; i.e. with data that have not been used to train the model helps avoiding overfitting and also provide results under a fair comparison of different regressions exhibiting a variability on the number of covariates³⁶. This way the inclusion of the minimum number of components in our model that best describe the data is reached. Herein, tenfold cross-validation was performed and the maximum number of components has been set to 100. The MSE metric for the calibration gets better and better the more components are added to the model (data not shown). On the other hand, the more complex the model, the more biased towards overfitting it gets. Thus, the cross-validation values of the MSE are important, so as to select the simpler model with the best performance. Figure 3a presents exactly those values, exhibiting a minimum in the MSE, which occurs for PC = 41. Adding more principal components, the cross validation MSE gets actually worse, meaning that the model starts to overfit and loses its generalization.

Next in the classification pipeline, is the Support Vector Machine (SVM) based modeling with the use of the data transformed into the PLS space and at the selected number of components, i.e. problem dimensionality. Support Vector Machines (SVMs) are models, created in a supervised manner, used for classification and regression analysis²⁰. SVMs is a well-suited classification technique when the training data consists of large number of variables in relationship to the number of observations. In SVM every sample x that consists of n variables is treated as an n-dimensional vector. The goal is to find a surface, a hyperplane that is able to separate the vectors based on their corresponding class. There can be many hyperplanes that can classify the data. From those hyperplanes the ones that have the lowest generalization error, i.e. best predictive capability of new untrained data, need to be chosen. One common way to achieve that is by choosing the hyperplane that represents the largest “separation”, or margin, between the classes. This hyperplane has the largest distance from the closest training data point of any class. In many cases the sets to discriminate cannot be linearly separable. To overcome this, the sets can be mapped from a finite to a higher dimensional space, in an attempt to make the separation easier³⁷. This is called the “kernel trick”, used for projecting and classifying the data from a higher dimensional space without ever computing the corresponding vectors in that space but rather by computing the inner products of every pair of vectors in the transformed space.

In particular, given a training set of data $\left( {x_{i} ,y_{i} } \right), i = 1, \ldots , l$ with $x_{i} \in R^{n}$ and $y \in \left[ { - 1, 1} \right]^{l}$, SVM finds the solution to the following optimization problem:

$$ min_{w,b,\xi } \frac{1}{2}w^{T} w + C\mathop \sum \limits_{i = 1}^{l} \xi_{i} $$

$$ {\text{subject}}\,{\text{to:}}\,y_{i} \left( {w^{T} \varphi \left( {x_{i} } \right) + b} \right) \ge 1 - \xi_{i} , \xi_{i} \ge 0 $$

(3)

The function φ maps the vectors x_i to the higher dimensional space, C is the penalty parameter of the error term and $K\left( {x_{i} ,x_{j} } \right) \equiv \varphi \left( {x_{i} } \right)^{T} \varphi \left( {x_{j} } \right)$ is the kernel function. There are many kernel functions, where the three most commonly used are:

$$ {\text{linear:}}\,K\left( {x_{i} ,x_{j} } \right) = x_{i}^{T} x_{i} $$

(4)

$$ {\text{polynomial:}}\,K\left( {x_{i} ,x_{j} } \right) = (\gamma x_{i}^{T} x_{i} + r)^{d} , \gamma > 0 $$

(5)

$$ {\text{radial}}\,{\text{basis}}\,{\text{function}}\,( {{\text{RBF}}}){:}\,K\left( {x_{i} ,x_{j} } \right) = e^{{ - \gamma x_{i} - x_{j}^{2} }} $$

(6)

SVM multiclass classification is implemented by the most commonly, in multiclass problems, used One-vs-the-rest (OvR) strategy, i.e. one classifier is fitted per class. As stated in the scikit-learn³⁸ webpage for the multiclass algorithms, for each classifier, the class is fitted against all the other classes. The final output is the class that corresponds to the SVM with the largest margin. OvR approach is computational efficient, since only n_classes classifiers are needed, and in addition it is interpretable: each class is represented by only one classifier, thus it is possible to have access to a class by inspecting its corresponding classifier. In our case matrix X contains 1815 (training samples) normalized FTIR training samples with 1,700 variables/wavelengths each, while Y matrix is a single column matrix consisting of the class (coded as a number from 0 to 6) for each corresponding sample type. First, tenfold cross-validation was used so as to determine the minimum number of PLS components that minimize the mean squared error (please refer to Fig. 3a). The mean square error is computed by the predicted values of the class (i.e. class number) with respect to the real class number as defined by the matrix Y. In the tenfold, the training set is shuffled and then split it into ten subsets (folds). From these subsets, a single set is kept out of the training for validation and the model is trained using the rest 9 sets and repeated 10 times with a different validation set every time. This process has been employed in order to evaluate 100 different, in terms of number of components, PLS models (ranging from 1 to 100 components) (please refer to Fig. 3a). The cross-validation results for our data resulted in 41 components as the optimum number to be used. Thereafter, the model was trained using the whole of the training data, taking into account only the optimal number of components. At this point, it should be mentioned that this procedure reduced the dimensionality from 1,700 to just 41 features. Afterwards, the train data are transformed using the PLS model to get the reduced training set, which is then used as the training set for the support vector machine. In order to tune the parameters, i.e. find the optimum hyper-parameter values of SVM, grid search approach³⁹ was employed. Grid search is an exhaustive search through a manually specified subset of the parameter space, combined with cross validation, in an attempt to compute the ideal kernel and parameters for the SVM classification. Herein the kernels tested were: linear, radial basis function (rbf) and polynomial, the search range for C parameter is set as [0.0001, 0.001, 0.01, 0.1, 1, 10, 100, 1000], while for the γ parameter [10⁻⁶, 10⁻¹] in logarithmic scale and degree = 2, 3, 4 and 5. The outcomes for the hyper-parameters grid search resulted in choosing the linear kernel and C = 100, as the optimal classifier parameters for our data. Afterwards the SVM is trained using those optimal kernel and parameters. Finally, the same procedure is applied to the test set (described previously) after applying on them the robust SNV normalization resulted from the training set and the corresponding transformation to the same PLS space as the training set.

The SVM classification model has been evaluated on the test data, in terms of accuracy, F1-score, sensitivity, precision, specificity, Matthews’s correlation coefficient (MCC), Informedness, Markedness, in total and per class (data shown in Table SI1). In addition, the probabilities of the SVM classifier for each sample (test set) were approximated according to Platt’s scaling approach, in order to explain any misclassifications and trying to interpret the results.

Materials and samples

The food types under consideration come from 7 different classes, namely beef^40,41, pork^42,43, chicken⁴⁴, farmed whole ungutted gilhead sea bream⁴⁵, ready-to-eat rocket³⁴, baby-spinach³⁴ and pineapple. The samples were all subjects from experiments and corresponding experimental setups of spoilage research, that have been previously published^{34,40,41,42,43,44,45} and the reader can refer to Table 2 and the corresponding publications for more details. In general, each food class consisted of at least 2 independent batches of samples and 2 replicates of each sample. From all these samples the corresponding spectral data have been acquired with FTIR spectroscopy, as described next. More specifically, in the case of pork, a number of samples came from Tsakanikas et al.⁴² and other additional samples came from a spoilage related experimental setup Fengou et al.⁴³. Specifically, minced pork samples were prepared and packaged in food trays, placed in one styrofoam tray (duplicate samples) and wrapped with air-permeable polyethylene plastic⁴⁶ cling film. Samples were stored at different isothermal conditions (4, 8, and 12 °C) and under dynamic conditions (periodic temperature change from 4 to 12 °C) in high precision (± 0.5 °C) programmable incubators (MIR-153, Sanyo Electric Co., Osaka, Japan) for a maximum time period of 14 days. Furthermore, two chronically independent batches have been used in order to increase samples’ variability. Concerning the pineapple samples, they were stored in their original packages at three isothermal temperatures, i.e. 4, 8 and 12 °C, and at a dynamic profile (8 h at 4 °C, 8 h at 8 °C and 8 h at 12 °C). Sampling was performed at regular time intervals depending on the storage temperature for a maximum period of 10 days. In total, four independent experimental replicates were taken into consideration resulting in 318 samples of pineapple. In summary, the total samples used in this study were 1815 training samples (400 beef, 380 pork, 300 fish, 280 pineapple, 190 rocket, 85 chicken, 180 spinach), and 240 test samples (44 beef, 52 pork, 26 fish, 38 pineapple, 29 rocket, 15 chicken, 36 spinach).

From the aforementioned description of the data used herein, it is obvious that via the high diversity of samples’ origin (different batches and in some cases even different time periods and people conducting the experiments) and state (sampling condition over a spoilage experimental setup—resulting in varying biochemical properties of the samples and thus diversity in their corresponding FTIR spectra), it was feasible to import this information into the predictive models to simulate real life conditions, since the datasets were acquired under different conditions of temperature, packaging, storage time and degree of microbiological contamination, apart from different batches. This way, it can be ensured that whatever the classification result, the model will be enough robust and generic to the input, since for different conditions the samples (within the same sample type) are degraded differently as well as their chemical profile. So, it is apparent that the evaluation scheme followed herein and more importantly, the data where the classification models were trained, are unbiased (even within the same sample type) with large variability, resulting in the development of a classifier that is robust, generic and thus reliable.

Data acquisition—FTIR spectroscopy

The FTIR spectral data were collected using a ZnSe 45° HATR (Horizontal Attenuated Total Reflectance) crystal (PIKE Technologies, Madison, Wisconsin, USA), and an FTIR-6200 JASCO spectrometer (Jasco Corp., Tokyo, Japan). The spectra acquisition process consists of cutting a small portion from each sample and placed to the crystal plate, covered with a small piece of aluminum foil. The specific crystal works at a refractive index of 2.4 and a depth of penetration of 2.0 μm @ 1,000 cm⁻¹. Then the acquired spectra were processed and collected by the Spectra Manager™ Code of Federal Regulations (CFR) software version 2 (Jasco Corp.). The corresponding wavenumber range is 4,000–400 cm⁻¹, while 100 scans with a resolution of 4 cm-1 and a total integration time of 2 min were accumulated. The FTIR spectra that were used in further analyses were in the approximate wavenumber range of 2,700–1,000 cm⁻¹, i.e. 1,700 wavelengths (sample features), resulted by removing the water peak starting at ~ 2,700 cm⁻¹ and ignoring the range [400–1,000 cm⁻¹] as it mainly represents noise.

Implementation and performance

The whole pipeline has been implemented in Python 2.7 employing scikit-learn library³⁹. The code is OS independent and require the libraries indicated in the source code and at the import instances.

Software and data availability

The python scripts along with the data used for training and test of the system are available at zenodo, https://doi.org/10.5281/zenodo.3237542.

References

White Paper on Food SafetyChapter 2 (2000) COM (1999) 719 final).
Vecchio, R. & Borrello, M. Measuring food preferences through experimental auctions: a review. Food Res. Int. 116, 1113–1120 (2019).
Article Google Scholar
Mylona, K. et al. (Publications Office of the European Union, 2016).
FoodForLife https://etp.fooddrinkeurope.eu/news-and-publications/news/8-implementation-action-plan-2018.html. (January 10, 2020).
Nychas, G.-J.E., Panagou, E. Z. & Mohareb, F. Novel approaches for food safety management and communication. Curr. Opinion Food Sci. 12, 13–20 (2016).
Article Google Scholar
He, H.-J. & Sun, D.-W. Microbial evaluation of raw and processed food products by visible/infrared, Raman and fluorescence spectroscopy. Trends Food Sci. Technol. 46, 199–210 (2015).
Article CAS Google Scholar
Tahir, H. E. et al. Recent progress in rapid analyses of vitamins, phenolic, and volatile compounds in foods using vibrational spectroscopy combined with chemometrics: a review. Food Anal. Methods 12, 2361–2382 (2019).
Article Google Scholar
Pathmanaban, P., Gnanavel, B. K. & Anandan, S. S. Recent application of imaging techniques for fruit quality assessment. Trends Food Sci. Technol. 94, 32–42 (2019).
Article CAS Google Scholar
Pu, H., Lin, L. & Sun, D.-W. Principles of hyperspectral microscope imaging techniques and their applications in food quality and safety detection: a review. Compr. Rev. Food Sci. Food Saf. 18, 853–866 (2019).
Article Google Scholar
Nychas, G.-J.E., Skandamis, P. N., Tassou, C. C. & Koutsoumanis, K. P. Meat spoilage during distribution. Meat Sci. 78, 77–89 (2008).
Article Google Scholar
Ropodi, A. I., Panagou, E. Z. & Nychas, G. J. E. Data mining derived from food analyses using non-invasive/non-destructive analytical techniques; determination of food authenticity, quality & safety in tandem with computer science disciplines. Trends Food Sci. Technol. 50, 11–25 (2016).
Article CAS Google Scholar
Estelles-Lopez, L. et al. An automated ranking platform for machine learning regression models for meat spoilage prediction using multi-spectral imaging and metabolic profiling. Food Res. Int. (Ottawa Ont.) 99, 206–215 (2017).
Article CAS Google Scholar
PhasmaFOOD https://phasmafood.eu/. (January 10, 2020).
Food, T.S. https://www.tomra.com/en/sorting/food. (January 10, 2020).
Kutsanedzie, F. Y. H., Guo, Z. & Chen, Q. Advances in nondestructive methods for meat quality and safety monitoring. Food Rev. Int. 35, 536–562 (2019).
Article Google Scholar
Kumar, Y. & Chandrakant Karne, S. Spectral analysis: a rapid tool for species detection in meat products. Trends Food Sci. Technol. 62, 59–67 (2017).
Article CAS Google Scholar
Barnes, R. J., Dhanoa, M. S. & Lister, S. J. Standard normal variate transformation and de-trending of near-infrared diffuse reflectance spectra. Appl. Spectrosc. 43, 772–777 (1989).
Article ADS CAS Google Scholar
Guo, Q., Wu, W. & Massart, D. L. The robust normal variate transform for pattern recognition with near-infrared data. Anal. Chim. Acta 382, 87–103 (1999).
Article CAS Google Scholar
Wold, S., Sjöström, M. & Eriksson, L. PLS-regression: a basic tool of chemometrics. Chemomet. Intell. Lab. Syst. 58, 109–130 (2001).
Article CAS Google Scholar
Hearst, M. A. Support vector machines. IEEE Intell. Syst. 13, 18–28 (1998).
Article Google Scholar
Doulgeraki, A. I., Ercolini, D., Villani, F. & Nychas, G.-J.E. Spoilage microbiota associated to the storage of raw meat in different conditions. Int. J. Food Microbiol. 157, 130–141 (2012).
Article Google Scholar
Koutsoumanis, K., Stamatiou, A., Skandamis, P. & Nychas, G. J. E. Development of a microbial model for the combined effect of temperature and pH on spoilage of ground meat, and validation of the model under dynamic temperature conditions. Appl. Environ. Microbiol. 72, 124–134 (2006).
Article CAS Google Scholar
Bruckner, S., Albrecht, A., Petersen, B. & Kreyenschmidt, J. Influence of cold chain interruptions on the shelf life of fresh pork and poultry. Int. J. Food Sci. Technol. 47, 1639–1646 (2012).
Article CAS Google Scholar
Brock, T. D. & Rose, A. H. In Methods in Microbiology, 3 (eds Norris, J. R. & Ribbons, D. W.) 161–168 (Academic Press, Cambridge, 1969).
Google Scholar
Joubert, W. A. & Britz, T. J. Characterization of aerobic, facultative anaerobic, and anaerobic bacteria in an acidogenic phase reactor and their metabolite formation. Microb. Ecol. 13, 159–168 (1987).
Article CAS Google Scholar
Argyri, A. A. et al. A comparison of Raman and FT-IR spectroscopy for the prediction of meat spoilage. Food Control 29, 461–470 (2013).
Article CAS Google Scholar
Papadopoulou, O., Panagou, E. Z., Tassou, C. C. & Nychas, G. J. E. Contribution of Fourier transform infrared (FTIR) spectroscopy data on the quantitative determination of minced pork meat spoilage. Food Res. Int. 44, 3264–3271 (2011).
Article CAS Google Scholar
Rahman, U. U., Sahar, A., Pasha, I., Rahman, S. U. & Ishaq, A. Assessing the capability of Fourier transform infrared spectroscopy in tandem with chemometric analysis for predicting poultry meat spoilage. PeerJ 6, e5376 (2018).
Article Google Scholar
Wu, T.-F., Lin, C.-J. & Weng, R.C. Probability Estimates for Multi-class Classification by Pairwise Coupling, Vol 5. (JMLR.org, 2004).
Ha, J. et al. Identification of pork adulteration in processed meat products using the developed mitochondrial dna-based primers. Korean J. Food Sci. Anim. Resour. 37, 464–468 (2017).
Article Google Scholar
Tian, X., Wang, J., Shen, R., Ma, Z. & Li, M. Discrimination of pork/chicken adulteration in minced mutton by electronic taste system. Int. J. Food Sci. Technol. 54, 670–678 (2019).
Article CAS Google Scholar
Yacoub, H. A. & Sadek, M. A. Identification of fraud (with pig stuffs) in chicken-processed meat through information of mitochondrial cytochrome b. Mitochondrial DNA A 28, 855–859 (2017).
Article CAS Google Scholar
Hoaglin, D. C., Mosteller, F. & Tukey, J. W. Understanding Robust and Exploratory Data Analysis (Wiley, Hoboken, 2000).
MATH Google Scholar
Tsakanikas, P. et al. A unified spectra analysis workflow for the assessment of microbial contamination of ready-to-eat green salads: comparative study and application of non-invasive sensors. Comput. Electron. Agric. 155, 212–219 (2018).
Article Google Scholar
Jolliffe, I. T. Principal Component Analysis 2nd edn. (Springer, New York, 2002).
MATH Google Scholar
Ellies-Oury, M. P. et al. Statistical model choice including variable selection based on variable importance: a relevant way for biomarkers selection to predict meat tenderness. Sci. Rep. 9, 10014 (2019).
Article ADS CAS Google Scholar
Theodoridis, S. & Koutroumbas, K. Pattern Recognition 4th edn. (Academic Press, Cambridge, 2009).
MATH Google Scholar
https://scikit-learn.org/stable/modules/multiclass.html (January 10, 2020).
Pedregosa, F. et al. Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Pavlidis, D. E., Mallouchos, A., Ercolini, D., Panagou, E. Z. & Nychas, G.-J.E. A volatilomics approach for off-line discrimination of minced beef and pork meat and their admixture using HS-SPME GC/MS in tandem with multivariate data analysis. Meat Sci. 151, 43–53 (2019).
Article CAS Google Scholar
Tsakanikas, P., Pavlidis, D., Panagou, E. & Nychas, G.-J. Exploiting multispectral imaging for non-invasive contamination assessment and mapping of meat samples. Talanta 161, 606–614 (2016).
Article CAS Google Scholar
Tsakanikas, P., Pavlidis, D. & Nychas, G.-J. High throughput multispectral image processing with applications in food science. PLoS ONE 10, e0140122 (2015).
Article Google Scholar
Fengou, L.-C. et al. Estimation of minced pork microbiological spoilage through fourier transform infrared and visible spectroscopy and multispectral vision technology. Foods 8, 238 (2019).
Article CAS Google Scholar
Lytou, A. E., Panagou, E. Z. & Nychas, G.-J.E. Effect of different marinating conditions on the evolution of spoilage microbiota and metabolomic profile of chicken breast fillets. Food Microbiol. 66, 141–149 (2017).
Article CAS Google Scholar
Fengou, L.-C. et al. Evaluation of Fourier transform infrared spectroscopy and multispectral imaging as means of estimating the microbiological spoilage of farmed sea bream. Food Microbiol. 79, 27–34 (2019).
Article CAS Google Scholar
Panagou, E. Z., Papadopoulou, O., Carstensen, J. M. & Nychas, G.-J.E. Potential of multispectral imaging technology for rapid and non-destructive determination of the microbiological quality of beef filets during aerobic storage. Int. J. Food Microbiol. 174, 1–11 (2014).
Article Google Scholar

Download references

Acknowledgements

This work has been supported by the project “PhasmaFOOD”, funded from the European Union’s Horizon 2020 research and innovation programme under Grant Agreement No. 732541.

Author information

Authors and Affiliations

School of Food and Nutritional Sciences, Department of Food Science and Human Nutrition, Laboratory of Microbiology and Biotechnology of Foods, Agricultural University of Athens, Iera Odos 75, 11855, Athens, Greece
Panagiotis Tsakanikas, Apostolos Karnavas, Efstathios Z. Panagou & George-John Nychas

Authors

Panagiotis Tsakanikas
View author publications
You can also search for this author in PubMed Google Scholar
Apostolos Karnavas
View author publications
You can also search for this author in PubMed Google Scholar
Efstathios Z. Panagou
View author publications
You can also search for this author in PubMed Google Scholar
George-John Nychas
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.J.E.N. and P.T. contributed to the conception, P.T., E.Z.P. and A.K. in the design of the methodology and workflow, P.T. and A.K. in the software implementation, formal analysis, validation and results interpretation. All authors contributed into drafting the manuscript and critically revising it.

Corresponding authors

Correspondence to Panagiotis Tsakanikas or George-John Nychas.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary file 1

Supplementary file 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâ€™s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâ€™s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tsakanikas, P., Karnavas, A., Panagou, E.Z. et al. A machine learning workflow for raw food spectroscopic classification in a future industry. Sci Rep 10, 11212 (2020). https://doi.org/10.1038/s41598-020-68156-2

Download citation

Received: 30 October 2019
Accepted: 07 June 2020
Published: 08 July 2020
DOI: https://doi.org/10.1038/s41598-020-68156-2

This article is cited by

Deep learning framework for sensor array precision and accuracy enhancement
- Julie Payette
- Fabrice Vaussenat
- Sylvain Cloutier
Scientific Reports (2023)
A robust and resilience machine learning for forecasting agri-food production
- Reza Lotfi
- Amin Gholamrezaei
- Kiana Kheiri
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.