Rapid and Nondestructive On-Site Classification Method for Consumer-Grade Plastics Based on Portable NIR Spectrometer and Machine Learning

Yang, Yinglin; Zhang, Xin; Yin, Jianwei; Yu, Xiangyang

doi:https://doi.org/10.1155/2020/6631234

Journal of Spectroscopy

On this page

Abstract Introduction Materials and Methods Results and Discussion Conclusions Data Availability Conflicts of Interest References Copyright Related Articles

Research Article | Open Access

Volume 2020 | Article ID 6631234 | https://doi.org/10.1155/2020/6631234

Rapid and Nondestructive On-Site Classification Method for Consumer-Grade Plastics Based on Portable NIR Spectrometer and Machine Learning

Yinglin Yang,¹Xin Zhang,¹Jianwei Yin,²and Xiangyang Yu¹

Academic Editor: Ana Domi nguez Vidal

Received17 Oct 2020

Revised23 Nov 2020

Accepted30 Nov 2020

Published10 Dec 2020

Abstract

The classification of plastic waste before recycling is of great significance to achieve effective recycling. In order to achieve rapid, nondestructive, and on-site detection, a portable near-infrared spectrometer was used in this study to obtain the diffuse reflectance spectrum for both standard and commercial plastics made by ABS, PC, PE, PET, PP, PS, and PVC. After applying a series of pretreatments, the principal component analysis (PCA) was used to analyze the cluster trend. K-nearest neighbor (KNN), support vector machine (SVM), and back propagation neural network (BPNN) classification models were developed and evaluated, respectively. The result showed that different plastics could be well separated in top three principal components space after pretreatment, and the classification models performed excellent classification results and high generalization capability. This study indicated that the portable NIR spectrometer, integrated with chemometrics, could achieve excellent performance and has great potential in the field of commercial plastic identification.

1. Introduction

Plastics are ubiquitous in all aspects of modern life. Global plastic production has grown from 2 million tons in 1950, when plastic was initially conducted to large-scale production and use, to 380 million tons in 2015, which has increased approximately 190 times [1, 2]. The Ellen MacArthur Foundation reported that plastic use had increased 20-fold over the past half-century and predicted that plastic production would double again in the next 20 years and nearly quadruple by 2050, leading to an increase in plastic-related waste [3]. Nearly 50% of plastic produced is intended for single-use disposable products [4], such as packaging bags, water bottles, and disposable consumer goods. The widespread use of plastics brings great challenge related to the waste flow disposal. With plastic solid waste (PSW) growing at an annual rate of 7-8 per cent, the widely used methods of dealing with plastic waste are landfilling, composting, or burning, which are neither economical nor environmentally friendly [5]. Driven by the concept of circular economy, the efficient recycling of plastics and polymers has become an emerging need in the past few years.

Plastic waste can be recycled through various methods, such as thermal depolymerization and thermal compression. However, the first and major problem in plastics recycling is the identification and separation of different types of polymers prior to the recycling process. Otherwise, due to high temperatures and the effect of oxygen, the plastic will oxidatively degrade during mechanical recycling, changing its thermal stability and mechanical property and limiting its further use [6, 7].

Conventional analytical techniques are widely used, among which chromatographic, thermal, and spectroscopic techniques are prominent. Spectroscopic methods, including infrared spectroscopy, ultraviolet-visible absorption spectroscopy, Raman spectroscopy, mass spectrometry, are regularly used to provide accurate information about the structure of the polymers and analyze the additives contained qualitatively and quantitatively. However, most of these methods require an immense amount of time and diverse careful sample pretreatment. Besides, they are either expensive or mostly applied in laboratory.

In recent years, China has started to popularize national garbage classification, which means sorting and recycling are not an activity only performed in factories. Individual plastic identification requires portable equipment, which generally results in decline of measurement accuracy. Hyperspectral imaging is one of the potential solutions, but it has difficulties in data storage and transmission due to the large data volume. Moreover, the imaging equipment is also too expensive to upgrade [8]. By contrast, near-infrared (NIR) spectroscopy is a viable option.

NIR analysis, one of the fastest growing analytical techniques in the world, has advantage in convenient, rapid, efficient, accurate, nondestructive, and environmentally friendly detection [4, 9, 10], already widely used in the plastic recycling workshops to reduce the time consumed by manual recognition. The wavelength of near-infrared spectrum ranges from 780 to 2526 nm, which is consistent with the absorption region of the combined frequency and multiple frequency of hydrogen-containing groups (O–H, N–H, and C–H) in organic molecules. Kaihara et al. [11] constructed a decision tree to classify 18 plastics based on NIR spectroscopy. Tachwali et al. [12] built a plastic bottle classification system based on NIR reflectance spectrum and Bayes rule. Martin De Biasio et al. [13] combined hyperspectral technology with chemometric analysis to identify PP and PE. Masoumi et al. [14] invented “two filter” methods to achieve a PSW sorting system based on hyperspectral technology.

In this study, a portable near-infrared spectrometer was applied to collect the spectral data for standard and commercial plastic samples, including ABS, PC, PE, PET, PP, PS, and PVC. PCA was used to assess the cluster trend, and K-nearest neighbor algorithm (KNN), support vector machine (SVM), and backpropagation neural network (BPNN) were applied separately to perform the identification between different plastics. The novelty of this paper is that it developed a reliable, rapid, and nondestructive sorting model for consumer plastic rather than standard samples, with only handheld equipment, compensating for its gaps in testing accuracy. Moreover, the targeted modeling method mentioned was extensible for continuously growing data, having the potential to be integrated with smart devices and cloud platform, to achieve intelligence and practical application of spectral on-site sorting.

2. Materials and Methods

2.1. Sample Preparation

In this study, two types of plastic samples were chosen for classification. The first type was the standard plastic samples, including ABS, PC, PE, PET, PP, PS, and PVC, to conduct preliminary experiment and abandon unsuitable pretreatment. Considering the limitation of the NIR spectrum, dark plastic is difficult to identify due to its strong absorption capacity [5]. White and transparent samples were used for this study consequently. Each type of plastic included 30 samples with the same size, 30 mm by 30 mm by 3 mm. The second type was the consumer-grade plastic products in circulation, which is common in our daily life, but impossible to distinguish their raw material exteriorly. Specific sample information and the corresponding labels in this study are shown in Tables 1 and 2.

2.2. Spectral Acquisition

In order to achieve on-site rapid detection, the Pynect NIR-S-G1 NIR handheld spectrometer was used in this study to obtain diffuse reflectance spectrum for two types of plastic samples. This spectrometer is highly integrated with the size of 82 mm 65 mm 42 mm with low pretreatment requirement for acquisition. Spectral data with a wavelength of 228 bands ranging from 900 nm to 1700 nm were measured. The spectrometer was fitted as closely as possible to each sample and scanned six times to output an average spectrum automatically. The same white background was used for all spectral measurements. The data acquisition was conducted without any pretreatment.

For the standard samples, each sample was scanned five times parallelly. 150 spectra were obtained for each type of plastic and 1800 spectra in total. For commercial products, the same operation was repeated. Limited by the shape of products, a contact location as close as possible for spectral scanning was selected. 200 spectra were obtained for each type of samples and 2000 spectra in total. Figure 1(a) shows the raw spectra of transparent commercial samples.

(a)

(b)

(c)

(d)

2.3. Spectral Preprocessing Techniques

Preprocessing for spectral data is an integral part of modeling, to remove background information and noise from the useful characteristics of the samples scanned [15, 16]. Reproducibility of specific absorption features helps [17] to classify different polymers, and spectral pretreatment is an effective way to improve the repeatability of spectral features.

Considering the performance of the classification for standard samples, preprocessing methods used in this study included multiplicative scatter correction (MSC), standard normal variate transformation (SNV), first-order derivative, and Savitzky–Golay (SG) smoothing. MSC and SNV are commonly used to remove unwanted scattering effect from the spectral data matrix [15, 18]. Derivative and SG smoothing can eliminate interference caused by baseline drift, resolve overlapping peaks, and improve resolution and sensitivity, but also introduce some errors.

Figures 1(b)–1(d) show the spectra of transparent commercial samples preprocessed by MSC, SNV, and first-order derivative methods. Considering the practical application scenario, the model was built with as little processing of the prediction data as possible, e.g., the MSC preprocessing was only performed for the calibration set.

2.4. Principle Component Analysis

Principle component analysis (PCA) is an unsupervised pattern recognition method that clusters samples in the absence of a priori knowledge, extracting information from correlation matrix and visualizing data trends in dimensional scatterplots [19]. Transforming high-dimensional spectral data into an array of related variables containing overlapping information makes extracting useful information from high-dimensional data possible [20]. In addition, PCA is one of the manifold learning algorithms to reduce the dimensionality of spectral data [21].

In this study, PCA was applied to evaluate the cluster trend of spectral data and was also combined with other pretreatment methods to reduce the dimensionality. Figure 2 shows the performance of the MSC-PCA-KNN model with different numbers of principal components. The accuracy tended to stabilize with no significant improvement after the PC number exceeding 10, which was selected for the subsequent comparison of model performance. The same selection method was also applied to other pretreatment methods.

2.5. Multivariate Analysis Methods

KNN is a nonparametric [22], instance-based, lazy learning method. It works on a distance function, which classifies different features by measuring the distance between them. KNN has advantages of mathematical simpleness, not requiring statistical assumptions, and independence on the spatial distribution of the classes. When the training dataset is large, KNN requires a lot of storage space and time to calculate distance repeatedly [23].

SVM is a representative supervised statistical learning theory suitable for classification and regression problems, which can prevent underfitting and overfitting very well [24]. The algorithm has good classification performance on high-dimensional data though the number of training samples is limited [25]. If classes are separated by nonlinear boundaries, SVM uses kernel functions to enable linear separation of classes [18]. Radial basis function (RBF) was used in this study, for its computational simplicity and speed [23].

BPNN is one of the most commonly used algorithms for training feedforward neural networks [26]. BPNN has the characteristic of forward signal propagation and error backward propagation, which can automatically adjust the weights and biases in the architecture [27]. In this study, after multiple repeated experiments, BPNN had three hidden layers, with 128, 64, and 32 neurons, and the rectified linear unit (ReLU) function was used as the activation function. The output layer used softmax to output a probability matrix. The Adam optimization algorithm was used to calculate the gradient of the loss function in each epoch and update the parameters to minimize the loss function.

2.6. Data Partition and Model Evaluation

The complete dataset was divided into calibration set (80%) and prediction set (20%). The calibration set, further divided into training and validation sets, was applied to train and evaluate the models with the same pretreatments, and the prediction set was applied to evaluate the generalization capability of the models. The prediction set was not involved in the modeling process to avoid data leakage. Cross-validation is one of the simplest and most commonly reproduced methods for estimating prediction accuracy [28]. To avoid the impact of random data partition, 10-fold cross-validation method was used for the calibration set, which was repeated 10 times to obtain average metrics.

The model evaluation applied general evaluation metrics for multiclassification, including accuracy, precision, sensitivity, and F1-score. This study considered macroaverage and microaverage methods to calculate evaluation metrics. Averaged precision is defined as follows:where TP_i and FP_i refer to true-positive and false-positive values for single species of plastic, respectively, n_class refers to number of categories, and n_total refers to number of all samples.

2.7. Software

All the spectral data processing and multivariate analysis algorithms were carried out using Python 3.7 and JetBrains PyCharm 2019.2.3 x64 software under Windows 10.

3. Results and Discussion

3.1. Principle Component Analysis (PCA)

The principal component score plots of the mixed dataset, transparent dataset, and white dataset with different pretreatments are given in Figures 3(a)–3(d). The mixed sample datasets showed a faint clustering compared with other two datasets. Considering white or transparent samples separately, all the samples clustered well among top three PCs space after the spectral pretreatment. The MSC method gave excellent performance, with the cumulative variance contribution exceeding 99%, and different categories were clearly separated. In order to achieve better performance and expansibility for future applications, targeted modeling according to colors was adopted in the subsequent modeling process.

(a)

(b)

(c)

(d)

3.2. Model Comparison and Discussion

KNN, SVM, and BPNN were applied for modeling. Figure 4 presents the confusion matrix of SVM model built for the transparent sample dataset. The observed values of the correct and incorrect classes of the prediction set could be visually obtained and can be used to calculate corresponding metrics. It could be seen that the misjudgment mainly occurs between PC and PET samples.

(a)

(b)

(c)

(d)

Table 3 and 4 give the performance of the classification models developed for transparent samples and white samples. Compared with the standard samples, the number of PCs decreased significantly, indicating that after separating the datasets by color, the data of the same category were more consistent and fewer PCs could explain the difference. For the transparent plastic samples using the MSC method, the accuracy of the prediction set was much lower than that of the calibration set, most likely because of the great difference between the raw prediction dataset and the preprocessed dataset for training.

As a lazy learning method, KNN needs to store all the training data, resulting in slower prediction process, more memory, and expensive overhead [29, 30], which is not an optimal solution in practical application. Accordingly, for transparent samples, SNV-PCA-SVM and SNV/D1-PCA-BPNN were selected as the optimal models, and for white samples, D1-PCA-SVM was selected, both achieving 100% accuracy. Both SVM and BPNN can achieve high accuracy, and BPNN has the potential to outperform SVM dealing with large volumes of spectral data.

4. Conclusions

NIR spectroscopy has been considered as an effective method for the identification of PSW. This study has revealed that the handheld NIR spectrometer, rather than laboratory equipment, coupling with appropriate pretreatment and multivariate analysis methods, could be used to identify not only standard plastic samples but also consumer-grade plastics for field application. Targeted modeling of commercially available consumer products by color was carried out, and each optimal model chosen could give a 100% correct recognition rate in both the calibration set and prediction set, entirely meeting the requirement for field application.

The methods of establishing, tuning, and evaluating different plastic classification models described in this paper are universally valid, and it is feasible for modeling the classification of consumer products according to different colors. Furthermore, the targeted modeling methods is extensible and plastic of other colors can be added easily, contributing to expand the range of application. In the future study, the models can be extended by following several aspects: addition of threshold criterion of additives in consumer products to identify plastics qualitatively and quantitatively and targeted modeling for roughness and transparency. Besides, considering the advantages of BPNN in large volume data modeling, portable spectrometer can be redeveloped to integrate with intelligent devices and build a spectral data cloud platform. Such platform can continuously optimize the model by collecting vast data in the process of large-scale application, contributing to achieve the miniaturization, automation, and intelligence of the rapid on-site spectral detection.

Data Availability

The spectral data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

References

R. Geyer, J. R. Jambeck, and K. L. Law, “Production, use, and fate of all plastics ever made,” Science Advances, vol. 3, 2017.
View at: Publisher Site | Google Scholar
L. M. Rios Mendoza and M. Balcer, “Microplastics in freshwater environments: a review of quantification assessment,” TrAC Trends in Analytical Chemistry, vol. 113, pp. 402–408, 2019.
View at: Publisher Site | Google Scholar
D. E. MacArthur, D. Waughray, and M. Stuchtey, “The new plastics economy, rethinking the future of plastics,” World Economic Forum, 2016, https://www.weforum.org/reports/the-new-plastics-economy-rethinking-the-future-of-plastics.
View at: Google Scholar
J. Hopewell, R. Dvorak, and E. Kosior, “Plastics recycling: challenges and opportunities,” Philosophical Transactions of the Royal Society B: Biological Sciences, vol. 364, no. 1526, pp. 2115–2126, 2009.
View at: Publisher Site | Google Scholar
S. Zhu, H. Chen, M. Wang et al., “Plastic solid waste identification system based on near infrared spectroscopy in combination with support vector machine,” Advanced Industrial and Engineering Polymer Research, vol. 2, no. 2, pp. 77–81, 2019.
View at: Publisher Site | Google Scholar
W. Camacho and S. Karlsson, “Assessment of thermal and thermo-oxidative stability of multi-extruded recycled PP, HDPE and a blend thereof,” Polymer Degradation and Stability, vol. 78, no. 2, pp. 385–391, 2002.
View at: Publisher Site | Google Scholar
A. Boldizar and K. Möller, “Degradation of ABS during repeated processing and accelerated ageing,” Polymer Degradation and Stability, vol. 81, no. 2, pp. 359–366, 2003.
View at: Publisher Site | Google Scholar
M. Kishore and S. Kulkarni, “Approches and challenges in classification for hyperspectral data: a review,” in Proceedings of the 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT), pp. 3418–3421, IEEE, Chennai, India, 2016.
View at: Google Scholar
M. Kumagai, H. Suyama, T. Sato, T. Amano, and N. Ogawa, “Discrimination of plastics using a portable near infrared spectrometer,” Journal of Near Infrared Spectroscopy, vol. 10, no. 4, pp. 247–255, 2002.
View at: Publisher Site | Google Scholar
T. Huth-Fehre, R. Feldhoff, T. Kantimm et al., “Nir - remote sensing and artificial neural networks for rapid identification of post consumer plastics,” Journal of Molecular Structure, vol. 348, pp. 143–146, 1995.
View at: Publisher Site | Google Scholar
M. Kaihara, M. Satoh, and M. Satoh, “Systematization method for distinguishing plastic groups by using NIR spectroscopy,” Analytical Sciences, vol. 23, no. 7, pp. 921–924, 2007.
View at: Publisher Site | Google Scholar
Y. Tachwali, Y. Al-Assaf, and A. R. Al-Ali, “Conservation, and recycling: automatic multistage classification system for plastic bottles recycling,” Resources, Conservation and Recycling, vol. 52, no. 2, pp. 266–285, 2007.
View at: Publisher Site | Google Scholar
M. De Biasio, T. Arnold, G. McGunnigle, R. Leitner, D. Balthasar, and V. Rehrmann, “Detecting and discriminating PE and PP polymers for plastics recycling using NIR imaging spectroscopy,” in Proceedings of the Thermosense XXXII, p. 76610V, 2010.
View at: Publisher Site | Google Scholar
H. Masoumi, S. M. Safavi, and Z. J. I. J. Khani, “Engineering, “Identification and classification of plastic resins using near infrared reflectance,” International Journal of Mechanical and Industrial Engineering, vol. 6, pp. 213–220, 2012.
View at: Google Scholar
J. Coronel-Reyes, I. Ramirez-Morales, E. Fernandez-Blanco, D. Rivero, A. Pazos, and E. I. Agriculture, “Determination of egg storage time at room temperature using a low-cost NIR spectrometer and machine learning techniques,” Computers and Electronics in Agriculture, vol. 145, pp. 1–10, 2018.
View at: Publisher Site | Google Scholar
M. Blanco and I. Villarroya, “NIR spectroscopy: a rapid-response analytical tool,” TrAC Trends in Analytical Chemistry, vol. 21, no. 4, pp. 240–250, 2002.
View at: Publisher Site | Google Scholar
O. Rozenstein, E. Puckrin, and J. Adamowski, “Development of a new approach based on midwave infrared spectroscopy for post-consumer black plastic waste sorting in the recycling industry,” Waste Management, vol. 68, pp. 38–44, 2017.
View at: Publisher Site | Google Scholar
E. Teye, C. L. Y. Amuah, T. McGrath, C. Elliott, and B. Spectroscopy, “Innovative and rapid analysis for rice authenticity using hand-held NIR spectrometry and chemometrics,” Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, vol. 217, pp. 147–154, 2019.
View at: Publisher Site | Google Scholar
B. K. Lavine and N. J. E. Mirjankar, “Theory, and instrumentation, clustering and classification of analytical data,” Theory and Instrumentation, 2006.
View at: Publisher Site | Google Scholar
S. A. Mehdizadeh, S. Minaei, N. H. Hancock, and M. A. K. J. Torshizi, “An intelligent system for egg quality classification based on visible-infrared transmittance spectroscopy,” Information Processing in Agriculture, vol. 1, pp. 105–114, 2014.
View at: Google Scholar
A. L. J. Levada, “Parametric PCA for unsupervised metric learning,” Pattern Recognition Letters, vol. 135, pp. 425–430, 2020.
View at: Publisher Site | Google Scholar
R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, John Wiley & Sons, Hoboke, NJ, USA, 2012.
L. A. Berrueta, R. M. Alonso-Salces, and K. Héberger, “Supervised pattern recognition in food analysis,” Journal of Chromatography A, vol. 1158, no. 1-2, pp. 196–214, 2007.
View at: Publisher Site | Google Scholar
H. Yu and S. Kim, “SVM tutorial-classification, regression and ranking,” Handbook of Natural Computing, vol. 1, pp. 479–506, 2012.
View at: Publisher Site | Google Scholar
Y. Tarabalka, M. Fauvel, J. Chanussot, J. A. Benediktsson, and R. S. Letters, “SVM- and MRF-based method for accurate classification of hyperspectral images,” IEEE Geoscience and Remote Sensing Letters, vol. 7, no. 4, pp. 736–740, 2010.
View at: Publisher Site | Google Scholar
H. R. Maier and G. C. Dandy, “Neural networks for the prediction and forecasting of water resources variables: a review of modelling issues and applications,” Environmental Modelling & Software, vol. 15, no. 1, pp. 101–124, 2000.
View at: Publisher Site | Google Scholar
Y. Liu, L. Nie, L. Han, L. Zhang, and D. S. Rosenblum, “Action2Activity: recognizing complex activities from sensor data,” in Proceedings of the Twenty-fourth International Joint Conference on Artificial Intelligence, Buenos Aires, Argentina, 2015.
View at: Google Scholar
T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer Science & Business Media, Berlin, Germany, 2009.
L. Jiang, Z. Cai, D. Wang, and S. Jiang, “Survey of improving k-nearest-neighbor for classification,” in Proceedings of the Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007), pp. 679–683, IEEE, Haikou, China, 2007.
View at: Google Scholar
S. J. R. M. Thirumuruganathan, “A detailed introduction to K-nearest neighbor (KNN) algorithm,” vol. 20, p. 2012, 2010.
View at: Google Scholar

Copyright

Copyright © 2020 Yinglin Yang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1491

Downloads

1058

Citations