Mathematical Problems in Engineering

Mathematical Problems in Engineering / 2015 / Article
Special Issue

Extreme Learning Machine on High Dimensional and Large Data Applications

View this Special Issue

Research Article | Open Access

Volume 2015 |Article ID 268132 |

Sen Zhang, Xi Chen, Yixin Yin, "An ELM Based Online Soft Sensing Approach for Alumina Concentration Detection", Mathematical Problems in Engineering, vol. 2015, Article ID 268132, 8 pages, 2015.

An ELM Based Online Soft Sensing Approach for Alumina Concentration Detection

Academic Editor: Yi Jin
Received19 Aug 2014
Accepted30 Oct 2014
Published27 May 2015


The concentration of alumina in the electrolyte is of great significance during the production of aluminum; it may affect the stability of aluminum reduction cell and the current efficiency. However, the concentration of alumina is hard to be detected online because of the special circumstance in the aluminum reduction cell. At present, there is lack of fast and accurate soft sensing methods for alumina concentration and existing methods can not meet the needs for online measurement. In this paper, a novel soft sensing method based on a modified extreme learning machine (MELM) for online measurement of the alumina concentration is proposed. The modified ELM algorithm is based on the enhanced random search which is called incremental extreme learning machine in some references. It randomly chooses the input weights and analytically determines the output weights without manual intervention. The simulation results show that the approach can give more accurate estimations of alumina concentration with faster learning speed compared with other methods such as BP and SVM.

1. Introduction

In the industrial aluminum reduction cells, the stability of the alumina concentration is the key issue to maintain high efficiency during the production of aluminum. It is easy to lead to the occurrence of the so-called “anode effect” if the alumina content in the electrolyte becomes too low (e.g., 1%–1.5%). When it occurs, cell voltage rises abruptly to 30v–50v, which directly affects the energy balance of aluminum reduction cells. It is important to avoid the high alumina concentration because alumina-rich sludge will accumulate at the base of the cell and the cell operation can be seriously disrupted [1]. Therefore, how to detect the distribution of alumina concentration in the cell in real time is the key problem to control the production of aluminum. The site environment of aluminum reduction cell, which has the characteristics of large current, strong magnetic field, high temperature, high humidity, and a lot of dust, is very severe and complex. Aluminum reduction cell is a severe nonlinear, multi-input multioutput, slow time-variety, long time-delay, and coupled process. Therefore, it is a challenge to seek an online soft sensing method for alumina concentration. Numerous investigations have been carried out on the soft measurement method for alumina concentration by the researchers. A prediction model based on wavelet neural network was proposed by Li et al. [2]. The prediction method based on linear regression and orthogonal transform is applied to improve the accuracy of the alumina concentration forecast by Lin et al. [3]. Yan and Liang proposed a predictive model of aluminum reduction cell based on LS-SVM [4]. Li et al. [5] proposed a new fuzzy expert control method based on smart identification, multicontrol mode, and decision-making mechanism to achieve the alumina concentration prediction and real time control. The model is introduced into the aluminum concentration estimate by Zhang et al. [6]. However, the computational burden of the above nonlinear predictive models is still large when the dimension of input variable increases; the learning speed and accuracy of networks are in general far slower and can not meet the requirement of real time detection.

Figure 1 shows the experimental environment in the ZunYi aluminium electrolysis factory in China. In industries, there often exist some crucial variables that can not be directly measured online due to the fact that no sensor is available in the certain complex environment or due to its high cost. Soft sensing technology is a way to solve the problems. The soft sensing technology is widely used and becomes one of the important developing directions of surveying and processing control area. In practice, the application and research of this technology have gotten more extensions and many related technologies based on it have emerged [710].

The main thought of the soft sensing technology is to build one model which uses variables that could be directly online measured as input and thus the variables to be estimated as output, to use many kinds of complex calculating and evaluation and to get the values of detecting variables by computer software [1114]. So the main problem of soft sensing technology is how to build the relation model between detecting variables and other easy getting variables. By now, there are many methods to build the models, and many methods are in the trend of intersecting and mixing together. In all the methods, the artificial intelligence method is used often, such as the method based on model identification, the method based on artificial neural network, or the method based on fuzzy set theory. Some professors and scholars have done some research about the technology, such as global asymptotic stability of neural networks with multiple time-varying delays and fuzzy model-based robust networked control for a class of nonlinear systems proposed by Zhang et al. [15, 16], weighted least squares support vector machines, robustness and sparse approximation, proposed by Suykens et al. [17], Artificial Neural Network in process engineering proposed by Willis et al. [18], and the modified extreme learning machine method by Cao et al. [19]. Soft sensing technology is applied to various fields; Wang and Ren proposed soft sensing method for wastewater treatment based on BP neural network [20]. Rolling bearing fault detection based on the Teager Energy Operator and Elman Neural Network was proposed by Liu et al. [21].

So far, the widely used method of determining alumina concentration in the industrial factory is to use the spectrometer to analyze the sampled electrolyte. It is an offline method. The improved ELM method is applied to build up the soft sensing online detection approach of alumina concentration in this paper. This ELM algorithm tends to provide the best generalization performance at extremely fast learning speed. The experimental result shows that the new method can produce the best generalization performance and learn much faster than the traditional prediction models.

The following sections are organized as follows. Section 2 shows the theory of the extreme learning machine. Section 3 proposes the modified ELM algorithm. Data selection and data preprocessing are shown in Section 4. Section 5 gives the simulation results. Section 6 summarizes the conclusions.

2. The Theory of Extreme Learning Machine

Huang proposed extreme learning machine (ELM) algorithm; ELM was originally proposed for the single-hidden-layer feedforward neural networks (SLFNs) and then it extends to the generalized SLFNs. ELM can randomly generate the input weights and the bias of hidden nodes. It uses the theory of least squares to get the output weights. The learning speed of ELM can be thousands of times faster than traditional feedforward network learning algorithms like backpropagation (BP) algorithm while obtaining better generalization performance and smaller training error [22, 23]. Figure 2 shows the single-hidden-layer feedforward network (SLFN) architecture.

Considering there are arbitrary distinct samples , when and , standard SLFNs with hidden neurons and activation function are mathematically modeled aswhere is the weight vector connecting the th hidden neuron and the input neurons, is the weight vector connecting the th hidden neuron and the output neurons, and is the threshold of the th hidden neuron.

The fact that standard SLFNs with hidden neurons with activation function can approximate these samples with zero error means that ; there exist , , and such that

The above equations can be written compactly as where is called the hidden layer output matrix of the neural network. ELM is to minimize the training error as well as the norm of the output weight. Minimize and .

The minimal norm least squares method instead of the standard optimization method was used in the original implementation of ELM: where is the Moore-Penrose generalized inverse of . Several methods can be used to calculate the ; these methods may include orthogonal projection, orthogonalization method, iterative method, and singular value decomposition (SVD), and ELM algorithm makes use of SVD, where , so The algorithm ELM can be summarized as Three-Step Learning Model. Given a training set , an activation function , and the hidden neuron number , we have the following steps.

Step 1. Assign arbitrary input weighs and bias of hidden layer nodes .

Step 2. Calculate the hidden layer output matrix .

Step 3. Calculate the output weights .

3. The Modified ELM Algorithm

3.1. Ridge Regression Based ELM Algorithm

Ridge Regression, which was proposed by Horel and Kennard in , proposes the idea to adding a small positive number on the main diagonal of the design matrix. Considering ELM algorithm, the estimation of can be obtained by employing the following formula:where is the ridge parameter. Superficially, it is possible to get the inversion of design matrix . The following is the new results of (8) and (9):which denotes that Ridge Regression is biased.

Considerwhich denotes that Ridge Regression makes the estimation more stable.


Hoerl and Kennard had proved that Ridge Regression has less mean square error than the ordinary regression under the proper ridge parameter. It is as follows:

So, when ,

From the above equations, is an increasing function of when , where . Therefore, the selection of parameter is essential to the performance of Ridge Regression. In our ER-ELM algorithm, a method to determine the ridge parameter proposed by Huang [24] is used:where is the ordinary ELM algorithm estimation and .

3.2. To Select the Number of Hidden Nodes Based on the Improved ELM

The Error Minimized Extreme Learning Machine algorithm starts from a small size of ELM hidden layer and adds random hidden node (nodes) to the hidden layer, while the output weights are updated incrementally.

Suppose a SLFN, , denotes the hidden layer output matrix with hidden nodes and ridge parameter calculated by (19). Considering the poor performance due to lower number of hidden nodes, additional hidden nodes are added to the SLFN. A new hidden layer output matrix is composed of and other extra hidden nodes aswhere , and is ridge parameter of .

Huang had proved , where denotes the output error function of SLFNs. We set a stopping criterion for the iterative algorithm as follows:where is called the target error.

Consider where .

In order to facilitate the calculation, the inversion is substituted as follows:where

Sowhere .

Similarly, we get .

Now, a new hidden layer output matrix is obtained which has less output error. Then, we can update the output weight matrix based on the new hidden layer output matrix.

4. Data Selection and Preprocessing

In the aluminum production, through referring to relative documents and soliciting experts opinion, there are many factors that affect the alumina concentration, such as alumina feeding speed, cell voltage, series current, current of anode rod, voltage between anode rod and cathode bar, and bath temperature [25]. Existing predictive method of alumina concentration is to get an average alumina content in the aluminum reduction cell; the distribution of alumina concentration in the cell is not known yet. At present, people want to know the distribution situation of alumina content in the cell in order to know the process of production better and control the production of aluminum more accurately. Our experimental aluminum reduction cell has 24 anode rods; we can get the current signal of 24 anode rods, voltage signal between anode rods and cathode bars, and alumina concentration signal below the 24 anode rods. Through experiment and analysis, we find that the voltage signal between anode rod and cathode bar can reflect the alumina concentration below the anode rod more accurately. So we can use the voltage signal between anode rod and cathode bar in the aluminum reduction cell to reflect the distribution of alumina concentration. To use the ELM algorithm, we decide to select voltage between anode rod and cathode bar variable as model input according to experiments, and output variable is alumina concentration in the electrolyte below the anode rod.

It is necessary to do the data denoising preprocessing before using the algorithm, because the process of data collection may introduce noise, where data collection is affected by interference and influence of all kinds of noise signal. At present, there are two denoising methods, the traditional filtering method and the wavelet denoising method. Traditional denoising method is based on Fourier analysis and can always be used in the environment where signal and noise are very small. While wavelet analysis is known as the “microscope” of mathematical analysis, it is a time-frequency analysis method of signal, with the characteristics of multiresolution analysis [26]. Different signals may choose the different denoising methods. Through experiment and comparision, we decide to use the wavelet denoising method in view of the voltage signal between anode rod and cathode bar.

5. Simulative Results of Alumina Concentration

In this paper, we select voltage between anode rod and cathode bar as model input and alumina concentration in the electrolyte below the anode rod as model output.

First, ELM algorithm is applied to build soft sensing method model of alumina concentration. In general, active functions play an important role in computing of neural networks. Widely used active functions in the ELM algorithm are , , , and . Through comparison, we find that the function has more outstanding performance than , , and . So we apply function as active function in the ELM algorithm. In order to make comparison, we also use the BP and SVM algorithm to build soft sensing models of alumina concentration [27, 28]. The BP parameters are chosen as 17 hidden layer neurons and 1 output layer neuron and transfer function of hidden layer is tan-sigmoid and transfer function of output layer is .

Data in this paper came from a 350 kA prebaked aluminum reduction cell in the ZunYi aluminium electrolysis factory; Figure 3 shows experimental aluminum reduction cell and anode rod. We got the data of voltage between anode rod and cathode bar through the voltage measurement instruments. At the same time, we obtained the electrolyte below the experimental anode rod in the experimental aluminum reduction cell, and then we used the fluorescence spectrometer in the laboratory to analyze the alumina concentration. Figure 4 shows the sampled electrolyte. We got the experimental data through this method. We select 100 pairs as samples to construct sample set (), where denotes voltage between anode rod and cathode bar and denotes alumina concentration obtained at the same time with the voltage siginal.

The ELM, BP, and SVM soft sensing models are trained by the same sample set. All the data are preprocessed before training and simulating in the algorithm. For the purpose of comparing the three models quantitatively, we substitute ten of new sample (out of the sample set) into ELM, BP, and SVM models, respectively, to calculate corresponding ; simulation results of ELM model are shown in Figure 5; the alumina concentration results of three models are shown in Table 1. To see the results more clearly, the actual alumina concentration values and simulation results are displayed under the same axis which is shown in Figure 6. There are five performance indicators to measure the quality of algorithm: Training Time, Testing Time, Training RMSE (root mean square error), Testing RMSE, and Average Relative Error (ARE), where The simulation results of five performance indicators are shown in Table 2.

1 2 3 4 5 6 7 8 9 10

Actual value (%) 1.94 2.37 2.03 1.95 2.16 2.02 1.99 1.94 2.64 2.94
Output of ELM (%)1.8164 2.0641 2.0739 2.1544 1.9423 1.9065 1.9166 2.047 2.6761 2.6443
Output of BP (%) 1.7551 2.015 1.9953 2.4558 1.9594 1.9047 1.9510 2.0183 3.0956 2.9993
Output of SVM (%)1.9176 2.003 2.006 2.0917 1.8828 1.8834 1.8979 1.8833 2.301 2.3042

Training Time (s) Testing Time (s) Training RMSE Testing RMSE ARE (%)

ELM 0 0 0.0926 0.1784 6.78
BP 28.3281 0.0156 0.1698 0.2626 9.17
SVM 0.0156 0 0.2635 0.2797 8.83

From Figure 7, the horizontal axis denotes the values of voltage between anode rod and cathode bar and the vertical axis denotes the alumina concentration below the experimental anode rode. The simulation results of ELM approach the sample data in a certain range. As is shown in Table 1 and Figure 8, the values of output of ELM model are closer to the actual values of alumina concentration. From Table 2, Training Time of ELM model is 0 s whose Training RMSE is and Testing RMSE is , while Training Time of BP model is 28.3281 s whose Training RMSE is and Testing RMSE is . In the SVM model, Training Time is 0.0156 s and Training RMSE and Testing RMSE are and . In terms of Average Relative Error (ARE) performance indicator, ELM has the smallest ARE () in all of the three algorithms. It is clear that SVM model has faster learning speed and less Average Relative Error than BP model, but SVM model is slower and less accurate than ELM model. So the soft sensing measurement model based on ELM algorithm has better performance than BP model and SVM model.

6. Conclusions

Alumina concentration is very important in the aluminum electrolysis, which may affect the performance of aluminum reduction cell. It is difficult to measure the alumina content due to the complicated environment of aluminum reduction cell. This paper proposes a novel soft sensing method of alumina concentration in the electrolyte based on extreme learning machine (ELM) and builds the relationship between alumina concentration and voltage between anode rod and cathode bar. Through the simulation results and comparison of BP model and SVM model, we can see the validity and advantage of the proposed method. This method is able to effectively achieve rapid and reliable estimation of alumina concentration in a relatively short time.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.


This work has been supported by the National High Technology Research and Development Program (“863” Program) of China (Grant no. 2013AA040705).


  1. H. Vogt, “Effect of alumina concentration on the incipience of the anode effect in aluminum electrolysis,” Journal of Applied Electrochemistry, vol. 29, no. 7, pp. 779–788, 1999. View at: Publisher Site | Google Scholar
  2. J. J. Li, C. D. Wang, and L. Ying, “Application research on neural network predictive control technology in Aluminum electrolysis process,” Instrument Technique and Sensor, vol. 8, pp. 91–93, 2011. View at: Google Scholar
  3. J. D. Lin, L. Li, and P. Zhang, “Research of predicting alumina concentration based on orthogonal transformation,” Journal of Wuhan Institute of Technology, vol. 32, pp. 9–13, 2010. View at: Google Scholar
  4. G. Yan and X. Liang, “Predictive models of aluminum reduction cell based on LS-SVM,” in Proceedings of the International Conference on Digital Manufacturing and Automation (ICDMA '10), pp. 99–102, Changsha, China, December 2010. View at: Publisher Site | Google Scholar
  5. J. Li, W.-G. Zhang, F.-Q. Ding, and Y.-X. Liu, “Fuzzy expert control method based on on-line intelligent identification and its application,” Journal of Central South University of Technology, vol. 35, no. 6, pp. 911–914, 2004. View at: Google Scholar
  6. H. Zhang, J. Li, W. Zhang, X. Chen, and Z. Zou, “Application of gray GM (1, 1) model to alumina concentration estimation in aluminum electrolysis,” Chinese Journal of Scientific Instrument, vol. 29, no. 4, pp. 883–887, 2008. View at: Google Scholar
  7. J. W. Cao, T. Chen, and J. Fan, “Fast online learning algorithm for landmark recognition based on BoW framework,” in Proceedings of the 9th IEEE Conference on Industrial Electronics and Applications, Hangzhou, China, June 2014. View at: Google Scholar
  8. W. Xiao, P. Liu, W.-S. Soh, and G.-B. Huang, “Large scale wireless indoor localization by clustering and extreme learning machine,” in Proceedings of the 15th International Conference on Information Fusion (FUSION '12), pp. 1609–1614, Singapore, September 2012. View at: Google Scholar
  9. W. Xiao, P. Liu, W. S. Soh, and Y. Jin, “Extreme learning machine for wireless indoor localization (poster),” in Proceedings of the 11th International Conference on Information Processing in Sensor Networks (IPSN '12), pp. 101–102, Beijing, China, April 2012, (ISTP, EI). View at: Google Scholar
  10. W. Xiao, Y. Lu, J. Cui, and L. Ji, “Recognition of human stair ascent and descent activities based on extreme learning machine,” in Proceedings of the 5th International Conference on Extreme Learning Machines (ELM '14), Marina Bay Sands, Singapore, 2014. View at: Google Scholar
  11. M. J. Arauzo-Bravo, J. M. Cano-Izquierdo, E. Gomez-Sanchez et al., “Automatization of a penicillin production process with soft sensors and an adaptive controller based on neuro fuzzy systems,” Control Engineering Practice, vol. 12, pp. 1073–1090, 2004. View at: Google Scholar
  12. A. J. de Assis and R. M. Filho, “Soft sensors development for on-line bioreactor state estimation,” Computers and Chemical Engineering, vol. 24, pp. 1099–1103, 2000. View at: Publisher Site | Google Scholar
  13. F. L. Huang, “Thought of soft sensing and technology of soft Sensing,” Journal of Metrology, vol. 7, 2004. View at: Google Scholar
  14. J. Yu, “Soft sensing technology and its application,” Journal of Automatic Instrument, vol. 1, pp. 71–78, 2008. View at: Google Scholar
  15. H. Zhang, Z. Wang, and D. Liu, “Global asymptotic stability of recurrent neural networks with multiple time-varying delays,” IEEE Transactions on Neural Networks, vol. 19, no. 5, pp. 855–873, 2008. View at: Publisher Site | Google Scholar
  16. H. G. Zhang, M. Li, J. Yang, and D. D. Yang, “Fuzzy model-based robust networked control for a class of nonlinear systems,” IEEE Transactions on Systems, Man, and Cybernetics Part A:Systems and Humans, vol. 39, no. 2, pp. 437–447, 2009. View at: Publisher Site | Google Scholar
  17. J. A. K. Suykens, J. de Brabanter, L. Lukas, and J. Vandewalle, “Weighted least squares support vector machines: robustness and sparce approximation,” Neurocomputing, vol. 48, pp. 85–105, 2002. View at: Publisher Site | Google Scholar
  18. M. J. Willis, C. Di Massimo, G. A. Montague, M. T. Tham, and A. J. Morris, “Artificial neural networks in process engineering,” IEE Proceedings D, vol. 138, no. 3, pp. 256–266, 1991. View at: Publisher Site | Google Scholar
  19. J. W. Cao, Z. Lin, G.-B. Huang, and N. Liu, “Voting based extreme learning machine,” Information Sciences, vol. 185, no. 1, pp. 66–77, 2012. View at: Publisher Site | Google Scholar | MathSciNet
  20. W.-L. Wang and M. Ren, “Soft-sensing method for wastewater treatment based on BP neural network,” in Proceedings of the 4th World Congress on Intelligent Control and Automation, pp. 2330–2332, Shanghai, China, June 2002. View at: Google Scholar
  21. H. Liu, J. Wang, and C. Lu, “Rolling bearing fault detection based on the teager energy operator and elman neural network,” Mathematical Problems in Engineering, vol. 2013, Article ID 498385, 10 pages, 2013. View at: Publisher Site | Google Scholar
  22. G.-B. Huang and L. Chen, “Enhanced random search based incremental extreme learning machine,” Neurocomputing, vol. 71, no. 16–18, pp. 3460–3468, 2008. View at: Publisher Site | Google Scholar
  23. G.-B. Huang, L. Chen, and C.-K. Siew, “Universal approximation using incremental constructive feedforward networks with random hidden nodes,” IEEE Transactions on Neural Networks, vol. 17, no. 4, pp. 879–892, 2006. View at: Publisher Site | Google Scholar
  24. J. C. Huang, “Improving the estimation precision for a selected parameter in multiple regression analysis: an algebraic approach,” Economics Letters, vol. 62, no. 3, pp. 261–264, 1999. View at: Publisher Site | Google Scholar
  25. J. Li, Y. Huang, H. Wang, and Y. Liu, “Estimation model of alumina concentration for point-feeding aluminum reduction cells,” in Proceedings of the 123rd TMS Annual Meeting on Light Metals, pp. 441–447, March 1994. View at: Google Scholar
  26. X. Zhang, D. Xu, and Z. Qi, “Study of wavelet denoising method Based on the modulus maxima domain,” Data Acquisition and Processing, no. 9, pp. 315–318, 2003. View at: Google Scholar
  27. A. Meghlaoui, J. Thibault, R. T. Bui, L. Tikasz, and R. Santerre, “Neural networks for the identification of the aluminium electrolysis process,” Computers & Chemical Engineering, vol. 22, no. 10, pp. 1419–1428, 1998. View at: Publisher Site | Google Scholar
  28. J. A. K. Suykens, J. Vandewalle, and B. de Moor, “Optimal control by least squares support vector machines,” Neural Networks, vol. 14, no. 1, pp. 23–35, 2001. View at: Publisher Site | Google Scholar

Copyright © 2015 Sen Zhang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

More related articles

 PDF Download Citation Citation
 Download other formatsMore
 Order printed copiesOrder

Related articles

We are committed to sharing findings related to COVID-19 as quickly as possible. We will be providing unlimited waivers of publication charges for accepted research articles as well as case reports and case series related to COVID-19. Review articles are excluded from this waiver policy. Sign up here as a reviewer to help fast-track new submissions.