Equivalent Neural Network Optimal Coefficients Using Forgetting Factor with Sliding Modes

Aguilar Cruz, Karen Alicia; Medel Juárez, José de Jesús; Urbieta Parrazales, Romeo

doi:https://doi.org/10.1155/2016/4642052

Computational Intelligence and Neuroscience

On this page

Abstract Introduction Results Conclusions Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2016 | Article ID 4642052 | https://doi.org/10.1155/2016/4642052

Equivalent Neural Network Optimal Coefficients Using Forgetting Factor with Sliding Modes

Karen Alicia Aguilar Cruz,¹José de Jesús Medel Juárez,¹and Romeo Urbieta Parrazales¹

Academic Editor: Leonardo Franco

Received29 Jun 2016

Revised14 Nov 2016

Accepted20 Nov 2016

Published13 Dec 2016

Abstract

The Artificial Neural Network (ANN) concept is familiar in methods whose task is, for example, the identification or approximation of the outputs of complex systems difficult to model. In general, the objective is to determine online the adequate parameters to reach a better point-to-point convergence rate, so that this paper presents the parameter estimation for an equivalent ANN (EANN), obtaining a recursive identification for a stochastic system, firstly, with constant parameters and, secondly, with nonstationary output system conditions. Therefore, in the last estimation, the parameters also have stochastic properties, making the traditional approximation methods not adequate due to their losing of convergence rate. In order to give a solution to this problematic, we propose a nonconstant exponential forgetting factor (NCEFF) with sliding modes, obtaining in almost all points an exponential convergence rate decreasing. Theoretical results of both identification stages are performed using MATLAB® and compared, observing improvement when the new proposal for nonstationary output conditions is applied.

1. Introduction

Artificial Neural Networks (ANNs) are computational models based on Biological Neural Networks (BNN) synapses description. Biological neurons include a fire function after concatenating the neuron inputs with innovative conditions, given by external stimuli and excitation signals. The answer of this function is transmitted and manipulated, obtaining interconnections with other neurons and integrating a complete network. On the other hand, ANN models are famous because of having the ability to learn and adjust their parameters dynamically, adding factors compound by corrections, or different combining techniques including expert systems [1].

Instead of investing in high computational resources to represent ANN by adding factors to adjust its hidden gains, in [2] are proposed three model approximations allowing the identification selecting, in some sense the gains that the neural net requires, and showing three different representations of ANNs, considering characteristics that make them ideal for modelling and identification and also indicating that nonlinear models could be interpreted as an ANN with specific properties. Unfortunately, the ANNs by themselves have poor performance in identification and estimation tasks when considering nonlinear systems and are not adequate to accomplish online requirements due to their complex algorithms, which are usually based on stable and invariant conditions [3].

A solution for these restrictive conditions is the combination of ANNs with traditional methods and others, such as sliding modes (SM), probability, or fuzzy logic (FL), improving the ANNs performance in convergence sense [4–6]. Nevertheless, the combination of several algorithms also increases its complexity, giving place to a higher computational cost.

The Equivalent Artificial Neural Network (EANN), developed in [7], is a representation that considers the linearization of a multiple input-single output (MISO) system and is useful for cases where time is an important factor, applying different techniques to adjust their gains according to a reference, even with difficulty in modelling external perturbations. In general, any ANN could be reduced to a simpler equivalent model (EANN) integrated by multiple inputs that interact with a set of weights combined in some sense giving a final output [7, 8].

In [9], different adaptive algorithms are presented, where the desired signal is compared to an output having a correction error. The diagram representations are accomplished with the EANN description. Therefore, it is possible to consider the EANN through the Black Box (BB) concept as a MISO system by having the ANN inputs as the BB input vector and the output signal as the one to be compared to the nonstationary reference [1, 8, 10]. Then, the dynamic weights or parameters are calculated, using them instead of those that are random traditionally assigned, whose unique restriction is a preestablished range. These considerations generate a fewer calculation cycles obtaining the desired nonstationary output and reducing the computational cost. Figure 1 shows the analogy between the EANN and the BB and compares both answer signals through the error.

Recursive and traditional identification, for example, through the Least Square Method (LSM) or Kalman Filter (KF), has good results on average as long as the assigning of the initial conditions is adequate. Nevertheless, their performance is reduced when a point-to-point approximation is developed. As the initial weights assignation is not enough, including a forgetting factor (FF) in the estimation process enables the minimization of the convergence error in almost all points [11].

FFs have been used as constants or linear functions, improving the parameter estimation; now the question is what would happen if the system conditions vary significantly from the point where a constant or linear FF was designed? Does it stop being useful? In this sense, different approaches use individual coefficients for particular system evolution times, changing along intervals. Others apply the exponential function to the evolution or sampling time as the argument, presenting a faster exponential convergence to the reference, but only good for smooth changes [12].

The equivalent EAAN model requires the weights estimation with time-varying conditions and considering smooth movements, as a first application. Nevertheless, in many cases, the BBs have nonstationary conditions with ranges that exceed the smooth ideal conditions; that is, the first two probability moments are bounded by distribution functions, respectively, without solving movement tracking tasks, so that there are researchers continuing proposing different adaptive techniques.

In spite of all combinations developed, fortunately in this paper we propose a novel estimation technique combining three traditional tools: (a) the estimation using LSM with instrumental variable, applying the reference signal and the convergence error (built by the difference between the reference and the EAAN output answer) and its sign, (b) the sliding surface based on error properties that allow developing a new evaluation strategy, minimizing the convergence error in less time than the traditional LSM [13, 14], and (c) an innovative exponential FF (EFF) applying traditional SM. The strategy considered is over the traditional estimation because both the calculation of EFF and the construction of the new estimation are made by using SM, a combination that allows for tracking nonstationary weights or parameters.

2. Optimum Weight Values

For making an analytical analysis of the EANN, its elements can be considered as vectors, so that the number of variables is conserved and applicable for any proposition. From Figure 1, the input signals will be represented by the vector , the weights are calculated as the vector , and the output signal will be seen as [15]. These considerations agree with a first-order MISO system defined as which could be solved by using the expected value in a probabilistic sense. From (1), where the input vector and the desired signal are valid for a specific instant of time , we clear . Thus, we obtain and its discrete form:

The results of are applied into the output identification and compared to the reference signal , obtaining the error , whose functional would tend to zero when is optimum.

The description is for the optimum vector coefficients on average so any correction through feedback is not necessary; nevertheless, there is a need to express the solution recursively and to apply it to the equivalent model system, observing the answer evolution through time.

First, we define and with their recursive forms represented by the following, respectively:

Substituting (3) and (4) into (2) and then expressing it in terms of , whose block diagram is shown in Figure 2, we determine the vector, the term whose description is in the following and Figure 3:

3. Exponential Forgetting Factor (EFF)

The system response when using (5) is adequate for cases that need an average approximation, meaning the system has constant parameters. Nevertheless, as we do not know what the weights are or if they suffer any change through the system evolution, it is necessary to improve the estimation technique by creating a more robust procedure.

In [16], we proposed to use an FF for nonnormalized Least Mean Square Algorithm (NLMSA) to improve it. Meanwhile, in [17] is used a deterministic FF to achieve an Optimized Convergence Rate (OCR).

The FF is used to reduce the influence of past information for the calculation of new parameters, obtaining a renewed approximation and braking with the convergence to the media value of the complete process. In [18] it was suggested to use an equivalent ANN model and the sliding modes (SM) in combination with an FF, which gradually reduces the influence of past data. On the other hand, in [19] it is indicated that the FF value should be between zero and one, so that when it is closer to zero, it discards the old data faster, making the response more sensitive to the new data; the opposite happens when the FF is closer to one.

To obtain an optimum coefficient which allows a better response, where the output follows the variations generated in the reference, we propose a nonconstant exponential FF (NCEFF), as the following indicates: whose properties depend on the error , considered as an innovation process, and are based on the calculation of and its sign [13, 14]. The sign function gives the SM sense and allows for converging not only to positive parameters, so that it is necessary to use it as an additional tool.

Applying (6) in combination with (5), and its sign function, we obtain (7), which is an innovative way to calculate a new weight parameter , through an improved technique.

4. Simulation and Results

We developed the simulations in MATLAB integrating both analysing its performances with respect to a reference signal and substituting (5) and (7) into (1). The comparisons give us the idea of how the new estimation technique improves the original estimation presented in [7] for the EANN. In graphics, the estimation named as “optimum” is made using (5) while the “EFF” estimation includes the implementation of (7).

Beginning with the simplest reference, a signal with invariant parameters and no external noise is possibly seen as a constant function , where is constant. Figure 4 compares the reference with the responses that use the estimated parameters through (5) and (7).

As a second test, Figure 5 presents the response when the system has variable parameters without external noise, describing how both approaches behave and their scopes; in this case, the reference comes from , where modifies the vector parameters.

The stationary system response gives an idea of how the estimation could be made and is included in the proposed model. Comparing both, the difference generated affects the evolution model in some sense because the parameters are function of the convergence error.

The main objective of the present paper is to build a special stochastic estimation, which affects the output model system, seeking a better convergence rate, where , where is a random variable .

Now, for the third simulation, we will suppose being able to measure the internal parameters; nevertheless, they are only as an illustrative consideration. Then, Figure 6 compares the unknown system parameters to the estimated ones, and Figure 7 represents the approaches to reach a reference signal, viewed as a polar trajectory.

5. Application Example

The previous considerations could be applied, improving, for example, the resource distribution of a touristic place according to the season current situation, calculating the incomes, the tourism charge, and economic events [20, 21] oscillating in a period of time > 10 years as shown in Figure 8.

These parameters help to determine the passenger numbers going to a touristic place, according to the information from previous years [22]. Figure 9 shows the behaviour of this phenomenon and its identification, describing the nonstationary tendency through a stochastic estimation with sliding modes, describing a nondeterministic situation, point-to-point.

Then, Figure 10 presents the entropy rises of incoming passengers process viewed in polar form, in the same period of years.

6. Conclusions

The present paper demonstrates the advantages and limits of a classic approximation through stochastic system when using the expected value, which, as shown in Figures 3–6, has a good performance in a distribution sense.

Nevertheless, when more precise results are needed, it is necessary to add a correction factor in order to track the parameters that are far from the average, to give a better output response for systems with stochastic characteristics (Figures 7 and 9), such as the proposed dynamic exponential forgetting factor (EFF).

The use the Euler number and the sign function (SF) improved the complex trajectory, as seen in Figure 7, because the implementation of error properties affects the exponential forgetting factor (NCEFF) and sliding modes.

The SF allows the convergence towards the parameters that are not only above the reference but also below this, while the absolute function allows the conversion to only one side of them. In addition, the error used determines the EFF actualized point-to-point as the systems evolves. It was not necessary to create a new recursive approximation for the last estimation stage; nevertheless, as future work, it would be interesting to analyse this second recursion and test if there is some improvement.

Finally, we considered that the proposed method is adequate to estimate coefficients by a complex system, affecting, positively, the Equivalent Artificial Neural Network (EANN) or, in other words, a MISO system.

Competing Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The authors wish to thank the Instituto Politécnico Nacional (IPN) for its support while carrying out our research work through Projects SIP20160382 and SIP20161922.

References

J. Godjevac, Neuro-Fuzzy Controllers: Design and Application, Presses Polytechniques et Universitaires Romandes, Lausanne, Switzerland, 1997.
S. Chen and S. A. Billings, “Neural networks for nonlinear dynamic system modelling and identification,” International Journal of Control, vol. 56, no. 2, pp. 319–346, 1992.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
P. M. Mills, A. Y. Zomaya, and M. O. Tade, “Adaptive model-based control using neural networks,” International Journal of Control, vol. 60, no. 6, pp. 1163–1192, 1994.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
B. Subudhi and D. Jena, “A differential evolution based neural network approach to nonlinear system identification,” Applied Soft Computing, vol. 11, no. 1, pp. 861–871, 2011.
View at: Publisher Site | Google Scholar
W. Wei, E. Barnard, and M. Fanty, “Improved probability estimation with neural network models,” in Proceedings of the International Conference on Spoken Language Processing (ICSLP '96), pp. 502–505, October 1996.
View at: Google Scholar
M. Witters and J. Swevers, “Black-box model identification for a continuously variable, electro-hydraulic semi-active damper,” Mechanical Systems and Signal Processing, vol. 24, no. 1, pp. 4–18, 2010.
View at: Publisher Site | Google Scholar
K. A. Aguilar, J. J. Medel, J. L. Fernández, and M. E. Vigueras, “Neural net gains estimation based on an equivalent model,” Computational Intelligence and Neuroscience, vol. 2016, Article ID 1690924, 10 pages, 2016.
View at: Publisher Site | Google Scholar
J. Wang, “Adaptive control for nonlinear systems using decomposed neural network,” Cybernetics and Systems, vol. 29, no. 1, pp. 73–83, 1998.
View at: Publisher Site | Google Scholar
B. Widrow, M. Lehr, F. Beaufays, E. Wan, and M. Bilello, “Learning algorithms for adaptive signal processing and control,” in Proceedings of the IEEE International Conference on Neural Networks, pp. 1–8, IEEE, San Francisco, Calif, USA, April 1993.
View at: Google Scholar
F. Qiu and J. R. Jensen, “Opening the black box of neural networks for remote sensing image classification,” International Journal of Remote Sensing, vol. 25, no. 9, pp. 1749–1768, 2004.
View at: Publisher Site | Google Scholar
A. Vahidi, A. Stefanopoulou, and H. Peng, “Recursive least squares with forgetting for online estimation of vehicle mass and road grade: theory and experiments,” Vehicle System Dynamics, vol. 43, no. 1, pp. 31–55, 2005.
View at: Publisher Site | Google Scholar
J. E. Parkum, N. K. Poulsen, and J. Holst, “Recursive forgetting algorithms,” International Journal of Control, vol. 55, no. 1, pp. 109–128, 1992.
View at: Publisher Site | Google Scholar | Zentralblatt MATH | MathSciNet
J. W. Harris and H. Stocker, Handbook of mathematics and computational science, Springer-Verlag, New York, NY, USA, 1998.
View at: Publisher Site | MathSciNet
W. Gellert, H. Küstner, M. Hellwich, and H. Kästner, The VNR Concise Encyclopedia of Mathematics, Springer Science & Business Media, 2012.
K. Warwick, “Self-tuning regulators—a state space approach,” International Journal of Control, vol. 33, no. 5, pp. 839–858, 1981.
View at: Publisher Site | Google Scholar | MathSciNet
K. Nishiyama, “A new approach to introducing a forgetting factor into the normalized least mean squares algorithm,” Signal Processing, vol. 114, pp. 19–23, 2015.
View at: Publisher Site | Google Scholar
M. Badoni, A. Singh, and B. Singh, “Variable forgetting factor recursive least square control algorithm for DSTATCOM,” IEEE Transactions on Power Delivery, vol. 30, no. 5, pp. 2353–2361, 2015.
View at: Publisher Site | Google Scholar
J. Yinling and J. Beiyan, “Multi-model neural network sliding mode control for robotic manipulators,” in Proceedings of the International Conference on Mechatronics and Control (ICMC '14), pp. 2431–2435, IEEE, Jinzhou, China, July 2014.
View at: Publisher Site | Google Scholar
A. N. Chand, M. Kawanishi, and T. Narikiyo, “Parameter estimation for the pitching dynamics of a flapping-wing flying robot,” in Proceedings of the IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM '15), pp. 1552–1558, IEEE, Busan, Korea, July 2015.
View at: Publisher Site | Google Scholar
C. Goh and R. Law, “Modeling and forecasting tourism demand for arrivals with stochastic nonstationary seasonality and intervention,” Tourism Management, vol. 23, no. 5, pp. 499–510, 2002.
View at: Publisher Site | Google Scholar
V. Cho, “A comparison of three different approaches to tourist arrival forecasting,” Tourism Management, vol. 24, no. 3, pp. 323–330, 2003.
View at: Publisher Site | Google Scholar
Grupo Aeroportuario Centro Norte, “Historical passenger data base,” June 2016, http://www.oma.aero/en/airports/mazatlan/airlines/passenger-traffic/.
View at: Google Scholar

Copyright

Copyright © 2016 Karen Alicia Aguilar Cruz et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

2033

Downloads

1191

Citations