Abstract

The detection of authenticity is essential to the development and management of Thai jasmine rice industry. In this study, the multispectral imaging system (405–970 nm) was used for the detection of adulteration in Thai jasmine rice combined with chemometric methods including principal component analysis (PCA), partial least squares (PLS), least squares-support vector machines (LS-SVM), and backpropagation neural network (BPNN). Three varieties of rice that were similar to Thai jasmine rice in appearance were selected to perform the classification and quantitative prediction experiments by multispectral images. For the classification experiment, four varieties of rice samples could be easily classified with accuracy achieved to 92% by the BPNN model. For the quantitative prediction of adulteration proportion experiments, the results showed that, among the different chemometric methods, LS-SVM achieved the best prediction performance comparing the results of coefficient of determination, root-mean-square error (RMSEP), bias, and residual predictive deviation (RPD). It can be concluded that multispectral imaging technology with chemometric methods can be applied in the rapid and nondestructive detection of authenticity of Thai jasmine rice.

1. Introduction

Rice (Oryza sativa), as one of the most important food crops, supplies staple foods for more than half of the world’s population [13]. In the world rice market, Thailand is the largest rice exporting country over the past decade according to USDA [4], and Thai jasmine rice is one of the most popular varieties that command a premium price due to its unique flavor and delicious taste [5]. However, in recent years, Thai jasmine rice is vulnerable adulterated by mislabeling or part adulterated with other varieties of rice for higher commercial profits. Thus, how to detect authenticity of Thai jasmine rice fast and accurately is becoming a key issue to be solved in the rice industry to protect the interests of quality conscious consumers, stakeholders, and importing countries [6].

Till now, many methods were employed to detect the authenticity of rice, such as DNA fingerprinting [7, 8], fluorescence-based inter-simple sequence repeat-polymerase chain reaction (ISSR-PCR) and simple sequence repeats (SSR) markers [9], fluorescent simple sequence length polymorphisms [10], denaturing high-performance liquid chromatography [11], and high-resolution inductively coupled plasma mass spectrometry [12]. However, these traditional analytical methods are generally time-consuming, laborious, sophisticating, destructive, or toxic. Thus, it is necessary to develop a rapid and nondestructive method to ensure the quality of Thai jasmine rice in market.

With the improvement of the optical and spectroscopic techniques, some new methods were developed for rapid determination of food quality such as X-ray fluorescence [13], Raman spectroscopy [14], near infrared (NIR) spectroscopy [1517], hyperspectral imaging [1820], and THz spectroscopy [21]. Multispectral imaging, with the spectra range from visible light to near infrared, is regarded suitable for the rice detection because it can acquire both spatial and spectral information by combining the benefits of traditional imaging and spectroscopic techniques. Moreover, in multispectral system, some typical spectral bands were selected to avoid mass spectral imaging data which was caused by the high resolution of spectroscopy in NIR spectral and hyperspectral methods. Recently, this technique was applied in many agriculture fields such as discrimination of transgenic rice seeds and soybean seeds [22, 23], identification of tomato varieties [24], differentiation of soybean and weed species [25], detection of minced beef adulteration [26, 27], and detection of dicyandiamide in infant formula powder [28]. Meanwhile, to analyze the spectral data, some chemometric methods have been developed such as partial least square (PLS), least square-support vector machine (LS-SVM), and backpropagation neural network (BPNN), which are important for the application of optical and spectroscopic techniques in nondestructive detection [29, 30].

To study the authenticity of Thai jasmine rice, the classification and adulteration experiments were developed using multispectral imaging technique combined with chemometric methods. Three varieties of rice that were similar to Thai jasmine rice in appearance were selected to perform the classification and quantitative prediction experiments. Comparing the results from different chemometric methods, the best model for the detection of authenticity of Thai jasmine rice was obtained.

2. Materials and Methods

2.1. Samples Preparation

Four varieties of rice including Thai jasmine rice, jasmine sticky rice, Simiao rice, and Northeast Wuchang rice were purchased from Metro supermarket of Hefei City, China (Figure 1). These four varieties of rice were similar in shape and color with the naked eye and stored in sealed plastic bags at 4°C until use.

Firstly, in classification experiment, a total of 800 intact rice samples (200 samples in each variety) were collected. Among all samples, 600 rice samples (150 samples in each variety) were randomly allocated to the calibration set, and 200 rice samples (50 samples in each variety) were randomly allocated to the prediction set. Then in adulteration experiment, Thai jasmine rice was selected as unadulterated rice samples and parts of them were adulterated with different proportions of other three varieties of rice, respectively. Rice samples including eleven adulterated proportions ranging from 0% to 100% (w/w) (0%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, and 100%) were prepared and each adulterated sample totally contained about 2 g of rice. In each adulteration experiment, there were 20 samples for each adulterated proportion and all samples summed up to 220 in total (15 × 11 samples in calibration set and 5 × 11 samples in prediction set, respectively).

2.2. Multispectral Imaging System and Image Acquisition

VideometerLab instrument (Videometer A/S, Hørsholm, Denmark) with 19 different wavelengths ranging from 405 nm to 970 nm was used to capture multispectral images of rice samples. The detailed wavelengths were 405, 435, 450, 470, 505, 525, 570, 590, 630, 645, 660, 700, 780, 850, 870, 890, 910, 940, and 970 nm. The instrument provides a rapid and nondestructive assessment of uniform and nonhomogeneous samples without preparation. The multispectral imaging system integrates illumination, a monochrome gray scale CCD camera, and computer technology with advanced digital image analysis and statistics. Spatial resolution of the multispectral images was 2056 × 2056. The images of rice samples without the background can be used for the extraction of spectra data. Moreover, morphological features including area (mm2), length/width, roundness, and Hunter L, a, and b values of rice were all extracted from the image analysis and processing. Detail information of the setup can be found in previous report [31]. Figure 2 shows the procedures of the authenticity detection of Thai jasmine rice.

2.3. Chemometrics Methods

Principal component analysis (PCA) is a method known for reducing the dimensionality of the system which is widely used to deal with large datasets like spectral data. It is possible to demonstrate differences between the various objects with fewer variables. In this study, PCA was used to distinguish the Thai jasmine rice from the other varieties of rice. With the multispectral imaging data, PCA can provide the most important information about the potential capability of differentiating the samples.

PLS, as a linear regression method for multivariate calibration, is used to find the fundamental relations between the predictors (X) and the responses (Y). Thus, it can reduce the primordial predictors to a new variable set called latent variables (LVs), which have the best predictive ability. The number of LVs, as a critical parameter in the algorithm, was determined by minimizing the value of the prediction residual error sum of square (PRESS).

Support vector machine (SVM) is an important classification learning algorithm. To make SVM suitable for engineering applications, researchers developed LS-SVM by applying least squares errors as the training error function and made it powerful for solving problems in nonlinear classification, function estimation, and density estimation. This modification simplifies the training process of SVM and reduces computation time [32]. Radial basis function (RBF) was used as the kernel function of LS-SVM. The algorithm comprises two main parameters: one is a regularization parameter that determines the trade-off between structural and empirical risk minimization and affects the generalization performance of the LS-SVM model, and the other represents the radial width of the kernel function. The details of LS-SVM algorithm could be found in the literature [33].

BPNN is also a common model utilized in agricultural products detection studies, which can solve complex problems more accurately than linear techniques. In this study, BPNN has a three-layer network consisting of an input layer, a hidden layer, and an output layer. Leave-one-out cross-validation procedure was used during the calibration step. Several network architectures were tested by varying the number of neurons in the hidden layer with different initial weights. The optimal parameters such as hidden nodes, the goal error, and iteration times were determined by the least prediction error. The theory of BPNN has been described in detail in a previous publication [34].

2.4. Evaluation of Model Performance

For the analysis of spectral data in experiments, the multivariate data must be trained to build a calibration model and then evaluated for its performance with prediction set of samples. The analysis model was evaluated with the parameters of root mean square error of calibration (RMSEC), root mean square error of prediction (RMSEP), bias, and the coefficient of determination (R2) in calibration () and prediction (). Moreover, the residual predictive deviation (RPD) was used to evaluate how well a calibration model can predict in new prediction samples. High RPD value indicates very good prediction in samples outside the calibration set. Generally, an optimum model should have high values of , and RPD, and low values of RMSEC and RMSEP.

2.5. Statistical Analysis

All statistical analyses for multispectral imaging features data of the four different varieties of rice samples, including area (mm2), length/width, roundness, and Hunter L, a, and b were performed using SAS software. The data were expressed as mean ± standard deviation and analyzed with one-way analysis of variance (ANOVA). Duncan’s multiple range test was used for mean comparison.

3. Results and Discussion

3.1. Analysis of Multispectral Images

Table 1 shows the features of the four different varieties of rice samples, including area (mm2), length/width, roundness, and Hunter L, a, and b. In these parameters, the morphology of the rice could be seen from area, roundness, and length/width values and the color of the rice could be seen from the Hunter L, a, and b values. From the statistical analysis in Table 1, it can be seen that the Northeast Wuchang rice was the most different variety among the four varieties of rice samples in appearance. The Thai jasmine rice and Simiao rice were similar in length/width values, while there was no significant difference between Thai jasmine rice and jasmine sticky rice in CIE L, a, and b.

Average reflection spectra of rice samples in each variety are shown in Figure 3. It could be seen that the general trend of spectra of the four varieties of rice samples was very similar. Although the shape and color were similar among the four varieties of rice with the naked eyes, apparent difference could be observed around 450 nm that was related to blue-green color. In Table 1, CIE L, a, and b values of different varieties of rice were given. L is used to describe the lightness, a presents the redness or greenness, and b designates the blueness. It could be seen that there was significant difference in b values among the four varieties of rice, which was consistent with the result in reflection spectra in Figure 3. Meanwhile, there was no obvious difference in reflection intensity of NIR region especially between 940 and 970 nm that was high related to the water content of the detection object [35]. Comparing the results of L, a, and b in Table 1 and the reflection spectra values in Figure 3, it could be seen that the difference in color was more apparent than in water content among the four varieties of rice. This may be due to the similar water content in all varieties of rice.

3.2. Analysis of PCA

PCA was performed initially to examine the qualitative difference of Thai jasmine rice and other three varieties of rice in principal component (PC) space. Figure 4 shows the three-dimensional principal component score plot using the first three score vectors, and PC1, PC2, and PC3 were derived from spectral features, morphological features, and combination of spectral and morphological features of the rice samples, respectively. The first three factors accounted for all spectral data information of 99.18%, 99.63%, and 99.59% for three different features combination, which showed that PC1, PC2, and PC3 could present almost all the information of the spectral data.

The results in Figures 4(a) and 4(b) suggested that it could not separate the four varieties of rice clearly only with spectral or morphological features, while in Figure 4(c), the discrimination between Thai jasmine rice and other varieties of rice was more apparent than in Figures 4(a) and 4(b). The results suggested that both spectral features and morphological features were key factors for the discrimination of Thai jasmine rice and other varieties of rice. However, it could not be used to classify the four varieties of samples directly due to the overlaps among four varieties of rice. Thus, the advanced intelligent chemometric methods were used to obtain the discrimination model with both the spectral and morphological features in this study.

3.3. Classification of Different Varieties of Rice

Establishment of an accurate and reliable predictive model for determining rice varieties was necessary to discriminate the Thai jasmine rice. The LS-SVM and BPNN prediction models were built by using the spectral and morphological features data. Table 2 shows the discrimination results of four varieties of rice in the calibration set and prediction set. In the prediction set, the classification accuracies of LS-SVM and BPNN were 82% and 89% with the spectral data, respectively. Meanwhile, the classification accuracies of LS-SVM and BPNN were achieved in 84% and 92% with the combined spectral and morphological features, respectively. These results proved that the discrimination performance could be improved by using the combination of spectral and morphological features. Meanwhile, compared with LS-SVM, BPNN was a better chemometric method for discriminating Thai jasmine rice from other three varieties of rice with the accuracy being 100% using the combined spectral and morphological features in the calibration set, while in the prediction set, the discrimination performance was 92% with the combined spectral and morphological features data.

3.4. Quantitative Detection of Adulteration

Quantitative detection models were obtained using PLS, BPNN, and LS-SVM methods. The performances of these three models for predicting Thai jasmine rice proportions are shown in Table 3. The excellent prediction results for the three different varieties of rice were achieved in adulteration experiments. Most of the coefficients of determination of the results for each method were beyond 0.90. In general, the results of the Thai jasmine rice adulterated with Northeast Wuchang rice was the best in both determination and root mean square error, which showed that it was the most different variety from Thai jasmine rice, while the lowest values of and were found in the results of adulteration with the jasmine sticky rice which showed it may be the most similar variety to Thai jasmine rice. For the variety of Northeast Wuchang rice, the best detection model was obtained using LS-SVM with of 0.995, of 0.988, RMSEC of 1.398%, bias of 2.10%, RMSEP of 1.603%, and the RPD of 8.482. For the most similar variety of jasmine sticky rice, the best detection model was obtained using LS-SVM with of 0.990, of 0.935, RMSEC of 2.164%, bias of 0.382%, RMSEP of 2.582%, and the RPD of 9.674. Furthermore, the RPD value showed that the LS-SVM model was adequate for predicting the adulteration proportion of Thai jasmine rice.

For the chemometrics methods, there were apparently different effects for each method to Thai jasmine rice detection. Compared with BPNN and PLS models, the LS-SVM model had the best predictive results with all and being over 0.93 in three different adulterated varieties of rice. Although some values of and in PLS and BPNN models were similar with LS-SVM model, the RMSEP was higher than those in LS-SVM model. The results indicated that the LS-SVM model was more suitable for adulteration proportion prediction of Thai jasmine rice compared to PLS and BPNN.

Different varieties of rice may have different physical and chemical properties, which can be used to achieve the variety classification through chemometric methods and spectral imaging [36]. Thus, although different varieties of rice samples were similar in appearance to the human eye, there were some differences in the special spectral band which can be related in visible and NIR spectral reflectance [3]. Chemometric methods can highlight the differences among different varieties of samples and reduce the variation. In this study, the result was similar to a previous study [23] in that the spectral data and the morphological features data were both key factors for the rapid discrimination of different varieties of rice seeds. The prediction results confirmed the suitability of multispectral imaging for determining adulteration proportion in Thai jasmine rice in a rapid and nondestructive manner.

4. Conclusion

The detection of authenticity of Thai jasmine rice using multispectral imaging combined with chemometrics methods was investigated through the classification and adulteration experiments. The results showed that both spectral data and morphological data were key factors for the rapid screening of Thai jasmine rice. Thai jasmine rice could be discriminated from other varieties of rice even if they were similar to the human eyes. Furthermore, quantitative prediction of adulterated rice proportion in Thai jasmine rice could be achieved. The best results were obtained by LS-SVM combined with spectral and morphological features. In conclusion, the present study has shown that the use of multispectral imaging combined with chemometric methods for detection of authenticity of Thai jasmine rice was a very attractive platform and had the potential to be widely used in rapid and on-site screening because it is nondestructive, simple, and without pretreatments.

Data Availability

Data are available upon request to the corresponding author.

Conflicts of Interest

The authors have no conflicts of interest.

Authors’ Contributions

Wei Liu contributed to the study design, collected the data, interpreted the results, and reviewed the manuscript. Lei Zheng designed the study, interpreted the results, and drafted the manuscript. Changhong Liu analyzed the multispectral imaging data and reviewed the manuscript. Xue Xu collected rice samples and multispectral imaging data.

Acknowledgments

This work was supported by the National Key R&D Program of China (2020YFC1606802), the Key Science & Technology Specific Projects of Anhui Province (202003a06020022 and 18030701200), the Natural Science Foundation of Anhui Province of China (2008085MC96 and 1908085QF270), the Fundamental Research Funds for the Central Universities (JZ2019YYPY0027), and the Talent Research Foundation of Hefei University (18-19RC43).