Abstract

Discrimination and identification of melanoma (a kind of skin cancer) by using laser-induced breakdown spectroscopy (LIBS) combined with chemometrics methods are reported. The human melanoma and normal tissues are used in the form of formalin-fixed paraffin-embedded (FFPE) blocks as samples. The results demonstrated higher LIBS signal intensities of phosphorus (P), potassium (K), sodium (Na), magnesium (Mg), and calcium (Ca) in melanoma FFPE samples while lower signal intensities in normal FFPE tissue samples. Chemometric methods, artificial neural network (ANN), linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), and partial least square discriminant analysis (PLS-DA) are used to build the classification models. Different preprocessing methods, standard normal variate (SNV), mean-centering, normalization by total area, and autoscaling, were compared. A good performance of the model (sensitivity, specificity, and accuracy) for melanoma and normal FFPE tissues has been achieved by the ANN and PLS-DA models (all were 100%). The results revealed that LIBS combined with chemometric methods for detection and discrimination of human malignancies is a reliable, accurate, and precise technique.

1. Introduction

Cancer is an increasing concern worldwide and has become a major threat to human health. The leading causes of death are still due to various types of cancers. Skin cancer is one of the most frequently diagnosed types of cancer worldwide. For the identification and diagnosis of melanoma skin cancer analytical methods such as synchrotron radiation microanalysis (SXRF) [13], or transmission electron microscopy combined with energy dispersive X-ray analysis (TEM-EDX) [4, 5], laser ablation inductively coupled plasma spectrometry, mass spectrometry (LA-ICP-MS) [58], and Raman Spectroscopy are already used [9]. In terms of LIBS sensitivity and spatial resolution with other techniques such as LA-ICP-MS, TEM, Nano-SIM (nanosecondary ion mass spectrometry), EPMA (electron probe micro-analysis), TEM, and SEM (scanning electron microscope), as indicated in [1], which illustrated that LIBS achieves micrometer scale resolution with part-per-million range sensitivities. The LIBS detection range is much more than other techniques. In brief, comparison of LIBS feature that LIBS is better than XRF is as follows: for example, elemental detection range, isotope detection, spectral line interference, lateral spatial resolution, spatial resolution of depth, surface contamination sensitivity, rapidity of analysis, and safety during use [3]. Micro-SXRF is ideal for elemental mapping and chemical speciation with a spatial resolution down to less than 100 nm and a detection limit in the order of less than parts per million because of its high brightness and linear polarized existence [5].The sensitivity and resolution given by SXFR or microXRF is very high and is primarily used for Cu, Fe, and Zn imaging. Quantification with SXRF, however, remains difficult and appropriate calibration requirements should be achieved in an appropriate matrix [6, 7]. LIBS has been the only all-optical technique that is entirely consistent with optical microscopy, providing part-per-million-range sensitivity (accessible for most metals) and micrometer-scale resolution [8]. The acquisition rate can be very high in LIBS since it is constrained only by the speed of the detector and the frequency rate of the laser. A pace of acquisition up to kilohertz has already been seen [9]. Although these analytical approaches showed high performance in terms of spatial resolution and sensitivity, their relatively slow analysis and the complexity of the required equipment make their use rather restrictive and difficult for routine medical diagnosis. Therefore, it is important to develop a precise and accurate method to identify and differentiate the melanoma samples from the normal.

In recent years, laser-induced breakdown spectroscopy (LIBS) has become a well-recognized and valuable analytical spectroscopic technique for the identification and analyses of different tissues with the ability to detect all the elements on the periodic table, including low- and high-Z elements [1012]. Some additional advantages of LIBS are acknowledged such as its ability to work at room temperature and atmospheric pressure, ease of use, simplicity in apparatus, full compatibility with optical microscopy, and the prospect of standoff and on-site operations, and there is no sample size or shape restrictions. In LIBS, the laser pulse is focused on the sample surface. Plasma can be formed if the irradiance at the focal spot overreach a specific threshold valued. Excitation and deexcitation of ions and atoms in the plasma of the ablated sample will give out fingerprint emission of the different elements. The chemical components of the samples can be firmed by analysing atomic and ionic emissions, and the sample can then be discriminated and classified [13].

In the literature, LIBS has been widely used to investigate various types of malignancies discrimination. Kumar et al. used LIBS spectra intensities difference ratios of various elements of the histological sections of dog liver, to distinguish normal liver and hemangiosarcoma [14]. Han et al. used LIBS spectra in the presence of argon flow rate = 15 L/min of mice sample (in the form of tissue and pellets) for a distinction of surrounding dermis and melanoma via LDA model for quantitative analysis difference of magnesium and calcium [13]. Gaudiuso et al. used mice blood and tissues as a sample, for the early diagnosis of melanoma; they obtained an accuracy of up to 96% [15]. To the best of our knowledge, here we, for the first time, evaluated the use of LIBS to obtain elemental analyses, the classification and discrimination of paraffin-embedded human biopsies of healthy and malignant skin tissues combined with chemometric methods.

In the present work, LIBS combined with chemometrics approaches has been used for investigations and discriminations of human melanoma FFPE tissue samples instead of animal samples. We performed our experiment in an open environment while using no inert gasses during sample ablation in experiments. We also investigate the performance of the model for different types of input data (10 lines intensities, 27 lines intensities, and all lines intensities). For precise and quantified results, data preprocessing plays a vital role. The analyst uses different procedures for data analysis. The main difference in the results can come from the use of different algorithms to perform deionizing and/or baseline subtraction [16, 17], to extract the emission signals for the selection of spectral line analysis. We also proposed different preprocessing methods like standard normal variate (SNV), autoscaling (auto), mean-centering (mean), and normalization by area (norm). We investigated which preprocessing method performed better and which data set line intensity is suitable in ANN, QDA, LDA, and PLS-DA models by comparing their impacts on the final classification accuracies, respectively. The performances of the models are determined in terms of accuracy, sensitivity, and specificity along with suitable preprocessing methods, and also reported P (phosphorus) line of the FFPE tissue samples, while the above-reported results about melanoma references [13, 15] did not report any P lines in their spectra because of mice samples (tissue, pellets, and blood) and early melanoma diagnosis. Based on these results, we analysed distinguishable elements from melanoma FFPE tissues, reflecting the clinical situation.

2. Experimental Setup and Samples

2.1. LIBS Experimental Setup

As shown in Figure 1, a flash- pumped Q-switched Nd: YAG laser at 1064 nm with pulse duration 5 ns, repetition frequency 1 Hz, energy 64 mJ/pulse, and beam diameter ∅6 mm was used as the excitation source. The laser pulse was focused on the sample surface by three mirrors and a convex lens with a 100 mm focal length. The fiber with a diameter ∅ 600 μm was used to collect plasma emission by another convex lens with a 36 mm focal length. Two-channel spectrometer (AvaSpec 2048-2-USB2 and Avantes) was connected to the outlet of optical fiber. The samples were placed on a 3D translation stage in order to ablate the fresh tissue samples spots. The coverage range of the spectrometer was 190 nm–1100 nm with 0.2∼0.3 nm resolution. A digital delayer (SRS-DG535, Stanford Research System) and a photodetector were also used in the system. When the laser signal is detected by a photodetector, the spectrometer was triggered by DG535 after a preset delay time, 1.28 μs in our case, to reduce the bremsstrahlung radiation. The time of integration of CCD detector was set to 2 ms. The experiment was performed in an open environment without any buffer gasses.

2.2. Samples

The melanoma and normal FFPE tissues were used as the samples in the study; some images of samples are shown in Figure 2 before and after laser ablation. The samples were collected from the Origene Technologies Inc. of America with ethical permission from the Chinese custom Authority.

Based on the anatomic level of invasion of the layers of skin and vertical thickness of the lesion in millimetres, two classification schemes have been developed for skin cancer. For accurate prediction of future tumour behaviour, the most exclusive classification, the Breslow classification scheme is used recently. The stage of thin (T1) melanoma was classified by Clark level from the last decades [18]. In our research, the normal FFPE samples were collected from stage I melanoma patients and the melanoma FFPE samples were from stage III C patients. The number of the normal and the melanoma spectra was 111, respectively.

2.3. Data Preprocessing

Usually, due to interference of surrounding air, fluctuation of laser energy, and inhomogeneity of sample surface, spectral fluctuations are produced between the measurements of each pulse. To reduce data fluctuations, suitable preprocessing methods are needed [19]. For obtaining a reliable data matrix, proper preprocessing of the raw data is very important, to perform the actual statistical analysis. In addition, proper data preprocessing appreciably upgrade and clarify the data analysis [2022]. Four types of preprocessing methods are used in the present study, that is, autoscaling, normalization by total area, mean-centering, and SNV. The scaling process gives equal importance to all data sets and should not be suitable for noisy (i.e., poor signal-to-noise ratio) data [22].

2.4. Classification Models

In the present study, four types of classification models (ANN, PLS-DA, LDA, and QDA) are used on the four types of different preprocessing methods (autoscaling, mean-centering, normalization by total area, and SNV). For ANN networks, feedforward networks are used because of their excellent ability of self-adapting and self-learning. The two common and principal network types of feedforward networks are radial basis function (RBF) and multilayer perceptron (MLP). For data analysis when ANN is used, it is very essential to determine between the ANN model (the network’s arrangement) and ANN algorithms (computation that eventually produces the network output). The network that has been constructed for an application is ready to be trained. There are two types of training approaches, which are supervised and unsupervised. A fully connected ANN is used most frequently, with a backpropagation learning rule supervised network. ANN model of this type is better for discrimination purposes [23].

PLS-DA is one of the supervised methods and needs a learning step prior to its application to unknown samples. The interclass difference increases while the intraclass difference decreases in the PLS-DA model, which revealed the model recognition ability. PLS-DA model is very suitable for binary classification, while complexity will increase in the case of multiclassification PLS-DA model [24]. PLS-DA has also been applied to the classification of LIBS data and discriminates the tissue samples [25].

For classification purposes, both LDA and QDA are also used and achieved good accuracy results [26, 27]. LDA and QDA models are derived from simple probabilistic models, which model the class conditional distribution of the data P (X|y = k) for each k class. Bayes rules are used for predicting the class as shown inwhere X and y are events, P (y = k|X) is a conditional probability, and k is the class which maximizes the conditional probability.

Sensitivity is the ability of a test which decides the disease cases correctly, while specificity is defined as the ability of a test which decides and determines the healthy cases. Similarly, accuracy is defined as the ability of a test to demarcate the disease and healthy cases precisely. Sensitivity, specificity, and accuracy are calculated by the following equations, respectively:where (TP) is true-positive, the number of cases precisely recognized as a disease, (FP) is false-positive, the number of cases incorrectly identified as a disease, (TN) is true-negative, the number of cases precisely recognized as healthy, and (FN) is false-negative, the number of cases incorrectly identified as healthy.

3. Results and Discussion

3.1. LIBS Spectra Measurement Results

The typical LIBS spectra of human melanoma and normal FFPE tissue samples are illustrated in Figure 3(a), which is the average of 111 spectra in the spectral range of 200–900 nm of each sample. The LIBS spectra are usually normalized for discrimination purposes to reduce the spectral fluctuations that are produced due to the matrix effects and experimental variations. The normalization is operated comparatively to a specific line. The Carbon C 247.482 nm line is used for normalization purposes because the C emission is a specific line [13]. Carbon C 247.482 nm is used for normalization purposes because the C emission is firmed among all the measurement of LIBS spectra. The normalized LIBS spectra of the samples in the spectral region of 200 to 900 nm are also illustrated in Figure 3(b). Emission lines of magnesium Mg II, Mg I, calcium Ca I, Ca II, iron Fe I, phosphorus PII, sodium Na I, potassium K I, hydrogen H, oxygen OI, and CN molecular bands (386–389 nm) are observed in the melanoma and normal FFPE tissue samples spectra. The spectral intensities of the melanoma and normal FFPE tissue samples are different. For example, the intensities of Ca II (393.366 nm and 396.847 nm), Ca I (422.67 nm), P I (531.607 nm), and K I (766.5 nm and 769.9 nm) in melanoma FFPE tissue sample are stronger as compared to those in normal FFPE tissue sample. It is notable that the spectral data intensities and ratio between them are very important for the classification purpose of FFPE tissue samples.

For further classification analysis, these 27 element lines are selected as features variables of the LIBS spectra. The main spectral lines can be seen, which is located at the ionic and atomic lines from Ca, Fe, H, K, Mg, Na, O, and P. These atomic and ionic emission lines of melanoma are more vigorous than those of the normal FFPE tissue samples. Other emission lines like Fe and H are frail and weaker. Therefore, these emission lines are listed in Table 1 for the classification of FFPE melanoma and normal tissue sample.

We detected significant changes in the LIBS spectra between melanoma and normal samples. In melanoma FFPE tissue samples, the peak intensities of Ca, K, Mg, and P increase relative to the normal FFPE tissue peak intensity. Magnesium plays an important role in regulating cell division as well as Ca. The melanin-containing pigment granules have an enormous amount in melanoma cells, which are wealthy in Ca [28]. The high levels of Ca are also reported in previous literature in different cancer tissues, like breast cancer, colorectal cancer [13], uterine cancer [29], and canine hemangiosarcoma [14]. Magnesium has also an important role in cell proliferation and the biosynthesis of proteins [30]. In the LIBS imaging technique, the elemental images of the tumour reported a gradient concentration for Ca, Mg, and P in metastatic melanoma cases, and also reported the wide dynamic ranges of concentration for elements, such as Fe and Na from low to high concentration [31]. A bioelemental study of melanoma lymph nodes also showed a clear distinction of P from the normal tissue [32].

The normalized intensities of 27 emission lines of the melanoma and normal FFPE tissue samples are shown in the bar graph in Figure 4. These 27 emission lines are selected as features emission lines of normal and melanoma spectra. Then, the area average was taken for these 27 elemental lines and selected only 10 lines on the basis of their normalized spectral intensities to draw a threshold line, which is also shown in Figure 4. For discrimination purposes, three types of input variables are used as input data for all the classification models (10 lines, 27 lines, and all lines intensities, resp.).

3.2. Classification Results

Classification accuracy is defined as the division of the number of correct predictions by predictions number, multiplied by 100% to gain the percentage accuracy. We performed all the models with the training set including 83 spectra of melanoma and normal sample (166 spectra in all) and a test set including 27 spectra of melanoma and normal sample (54 spectra in all).

For the ANN model, the number of hidden layers of neurons l, which needs to be optimal, is calculated according to the formula

The accuracy of the ANN model for different input variable data sets (10 lines, 27 lines, and all lines) is illustrated in Figure 5. In 10-line input variable data set, the highest accuracy is 100% for an autoscaling method in all hidden layers of a neuron, while the mean-centering, normalization by total area, and SNV preprocessed method showed 98% and 97%, respectively, which is shown in Figure 5(a). Similarly, for 27-line intensity, input variables data set mean-centering showed the highest accuracy 100%, while the autoscaling and SNV showed 98% accuracy for different hidden neurons’ layers as shown in Figure 5(b). Furthermore, for the whole input spectra variable data set, the ANN accuracy is 98% in autoscaling and 99% for normalization in all hidden layers, while the mean-centering and SNV showed very low accuracy for all the hidden layers as shown in Figure 5(c). Overall, the normalization data set revealed the best sensitivity, specificity, and accuracy results, for the ANN model. So, for tissue analysis, the normalization by area is the best and suitable compared to other preprocessing methods in the ANN model; the accuracy % result of the ANN model is illustrated in Figure 6(a).

PLS-DA is also used to discriminate against the tissue samples. The 10-line intensity data set showed the highest accuracy result for PLS-DA model classification. Both mean-centering and SNV data sets showed good classification accuracy as compared to other preprocessing data sets as revealed in Table 2. The classification result of the PLS-DA model for a different preprocessing method is shown in Figure 6(b).

For classification purposes, both LDA and QDA are also used and achieved good accuracy results. In the LDA model, the accuracy of classification for all the cases (10 lines, 27 lines, and all lines) cannot achieve the 100% for FFPE tissues differentiation. The highest and lowest accuracy values for LDA classification model are 98% and 77%, respectively. While in QDA model classification, normalization by total area preprocessed method achieved 98% accuracy in the 27-line input variable data set. The minimum accuracy for QDA model classification is calculated for the whole spectra input variable data set in SNV preprocessed method, which is 77%. The percentage accuracy rate of LDA and QDA models are shown in Figures 6(c), and 6(d), respectively.

The accuracy results of ANN, PLS-DA, LDA, and QDA models along with suitable and appropriate preprocessing methods are illustrated in Table 2, which showed the highest accuracy for different input variable data sets and suitable preprocessed method. In the ANN model for the 10-line input variable data set, only the autoscaling preprocessed method showed the highest 100% accuracy, while for the 27-line input variable data set only the mean-centering showed the highest 100% accuracy. For the whole spectral input data set, only the norm-area preprocessing method showed the highest 98% accuracy, while the other preprocessed methods showed lower accuracy results. Similarly, in the PLS-DA model for 10-line intensity input variable data set, three preprocessing methods (autoscaling, SNV, and mean-centering) showed the highest 100% accuracy; on the other hand, for 27-line intensity and whole spectral data set, the preprocessed methods (mean-centering, normalized by the area, and SNV) showed the highest 100% accuracy, respectively. Furthermore, the LDA model classification for 10-line input variable data set showed the highest 98% accuracy for the autoscaling preprocessed method, while for 27-line intensity, the only normalization by area (norm-area) revealed 98% accuracy result and for the whole input variable data set and also only the autoscaling preprocessed method showed 98% accuracy result. In the QDA model classification for 10 lines, input variable data set, only autoscaling, and normalization preprocessed method showed the highest 94% accuracy; similarly, for 27-line input variable data set, normalization by area showed 100% accuracy as well, and for whole data input variable, only autoscaling revealed the highest 98% accuracy result for melanoma FFPE and normal tissue differentiation. In comparison, ANN and PLS-DA is the best classification model instead of QDA and LDA models for all types of input variable data sets.

Receiving operative curves (ROC) is a graphical plot that shows the diagnostic capability of a binary classifier system as its discrimination threshold is varied. The ROC curves for ANN, PLS-DA, LDA, and QDA are revealed in Figures 7(a)7(d) for a different input variable data set of 10 lines, 27 lines, and whole spectral data along with different preprocessing methods, respectively. The best ROC result is shown by the PLS-DA model as shown in Figure 7(b)

The ROC curves of the ANN, PLS-DA, LDA, and QDA model showed the lowest accuracy of 98% for the 10-line input variable dataset. Similarly, for the ROC curve for 27-line input variable data set, the highest sensitivity 100%, specificity 100%, and accuracy 100% for ANN, PLS-DA, and QDA model are shown, while LDA showed the same result as for 10-line input variable data set. On the other hand, for the whole input variable data set, the highest sensitivity 100%, specificity 100%, and accuracy 100% are revealed for PLS-DA models while for LDA and QDA classification models, the whole input variable data set showed the highest sensitivity 96%, specificity 100%, and accuracy 98%. It is concluded that the PLS-DA models are the best and useful for FFPE tissue classification as compared to LDA and QDA models.

The classification compact results of all the preprocessing methods, input variable data sets, and all four types of models ANN, PLS-DA, LDA, and QDA are shown in Figure 8. For ANN model classification, the highest accuracy is 100%, shown by the autoscaling preprocessed method in 10-line input variable data set, and mean-centering in 27-line input variable data set, respectively, while the lowest (minimum) accuracy percentage is shown 52% by SNV preprocessed data in whole spectral input variable data set. Similarly, for PLS-DA model classification, the highest accuracy is 100%, while the lowest is 94% for normalization by area (norm-area) in 10-line input variable data set. Furthermore, for the LDA model classification, the highest accuracy is 98% while the lowest (minimum) accuracy is 77% for SNV preprocessed method in the whole spectral input variable data set. On the other hand, for QDA classification model, the highest accuracy is 100%, while the lowest is 77% for SNV preprocessed method in the whole spectral input variable data set, the same as the LDA model result for the whole input variable data set. All model results revealed that the 27-line intensity was the average highest accuracy rate of 97.6% as compared to others; autoscaling data set showed the highest average accuracy 97.8% of all the models, and PLS-DA showed the averaged best performance model in all of them; average accuracy is 99.16%.

4. Conclusion

The purpose of the research was to discriminate melanoma skin cancer and normal skin by using laser-induced breakdown spectroscopy (LIBS) combined with the chemometric methods. The melanoma and normal FFPE tissues were used as samples. In the LIBS spectra, the intensities of the lines of several elements showed a significant difference between the melanoma and normal FFPE samples and were regarded as feature variables. The lines intensities of calcium (Ca), magnesium (Mg), phosphorus (P), potassium (K), sodium (Na), and oxygen (O) in melanoma samples were higher than the normal samples. Phosphorus (P) line, especially, showed the highest intensity for malignancies. Therefore, phosphorus (P), calcium (Ca), magnesium (Mg), and potassium (K) were defined as biomarkers for discrimination in this study. The chemometric models, such as ANN, PLS-DA, LDA, and QDA were used to analyse the spectral data from melanoma and normal tissue in FFPE. The best performance of the model (sensitivity, specificity, and accuracy) has been achieved by the ANN and PLS-DA models (all were 100%). The results indicated that LIBS combined with the chemometric models could be used as a quick discrimination method for human malignancies.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest related to this article.

Acknowledgments

The research was supported by the National Natural Science Foundation of China (NSFC) under Grant 61775017 and Graduate Technological Innovation Project of Beijing Institute of Technology (2019CX20026).