Abstract

This paper presents an artificial intelligence application using a nonconventional mathematical tool: the radial basis function (RBF) networks, aiming to identify the current plant of an induction motor or other nonlinear systems. Here, the objective is to present the RBF response to different nonlinear systems and analyze the obtained results. A RBF network is trained and simulated in order to obtain the dynamical solution with basin of attraction and equilibrium point for known and unknown system and establish a relationship between these dynamical systems and the RBF response. On the basis of several examples, the results indicating the effectiveness of this approach are demonstrated.

1. Introduction

System identification is the process of deriving a mathematical model of a system using observed data [1]. Once we have the mathematical model of a system we can analyze it (plant or process); however the representation of a physical system is a difficult task. To mitigate this difficulty, an alternative approach is to build models from measured or observed data.

It is important to understand that the model developed on the identification of a certain system represents its approximation; in this paper a mathematical model that represents the real physical system as close as possible is presented. The achievement of such a model is needed for the majority of industrial processes, since knowing the system allows us to control it, therefore the necessity of the identification process.

If the identification is based exclusively on the group of data obtained through measurements and assuming that there is no knowledge of the system dynamics, the identification process is known as the black-box modeling; however, if the laws of physics that conduct the process are used to build a phenomenological model of the system, we have the white-box modeling. Thus certain knowledge about the system is used to improve the empirical modeling; we have a gray-box modeling. Thereby some understanding of the system can facilitate the identification, such as the order of the system, response, sampling time, stability, range, delay time, freedom degree, and basic characteristics of nonlinearity [1]. In many cases we use identification techniques to obtain models that describe the behavior of a system. With such models we intend to describe the relation cause/effect between the input/output variables.

The motivation for the study of system identification techniques comes with the fact that usually we do not know the equations that describe the dynamics of such system and to obtain these equations, depending on the complexity of the system, would be impractical due to time and resources limitations. System identification consists in choosing an appropriate model and adjusting the parameters of the model according to some adaptive law such that the model’s response to an input signal approximates the response of the real system to the same input. System identification is important not only for understanding and predicting the behavior of the process, but also for designing effective control laws [2].

In this study we intend to investigate and analyze the identification of a nonlinear system using a RBF network. The RBF networks are a particular class of function approximators which can be trained, using a set of samples. RBFs have been receiving a growing amount of attention since their initial proposal by Broomhead and Lowe [3].

Other major contributions to the theory, design, and applications of RBF networks can be found, for instance, in papers by Moody and Darken and Poggio and Girosi [46]. Poggio and Girosi explain the use of regularization theory applied to this class of neural networks as a method for improved generalization to new data. The applications of RBF networks in identification and control have been demonstrated, through simulation, by several studies, including [79].

We present a design procedure, for identification of a nonlinear dynamical system using RBF networks. This study also serves to unify kernel methods and system identification and control theory. In Section 2 the motivation for use the Radial Basis Function networks in approximation theory and data modeling is presented. In Section 3 our problem formulation leads to a system identification based on the use of Radial Basis Function networks and the results are used to show that a proposed combination of static RBF networks with dynamical components forms a recurrent network capable of approximating a large class of dynamical systems. The recurrent network is used for describing the dynamics of a nonlinear system (induction machine) or an autonomous dynamic system. This model has sufficient information to predict the system evolution for many inputs. Finally, in Section 4 we discuss the contribution of the paper and draw some final conclusions.

2. Radial Basis Function Neural Networks

Methods of regression and function approximation have gained more attention recently. Not only in math, but also in different areas of knowledge such as engineering, geophysics, signals and images processing, graphic computing, theory of machine learning, neural networks, and data mining, among others.

The function approximation theory consists in approximating a multivariate function by an approximation function , given a fixed number of parameters , where and are vectors. Thus, there are two important aspects of the model to be defined: the function and the parameters of the vector . By choosing the specific , the problem is reduced to determiningthe vector , which offers a better function approximation for a group of data .

The identification strategy used in RBF networks consists in approximating an unknown function with a linear combination of a group of nonlinear functions, known as base functions, in order to predict the value of one or more objective variables, given a set of input variables [10].

The radial basis functions can be seen as a mathematical structure whose objective is to approximate disperse data in multiple dimensions with precision; this approach is frequently applied in the theory of modern approximation [11]. This approach is very useful in the modeling and identification of nonlinear systems. There are many types of basis functions, such as multiquadrics, Cauchy, and Gaussians, among others [10].

The basis functions used in this work are radials; then we can define a function if a univariate function exists, such that where is the Euclidean norm. In other words, we can say that the value of at any point has a fixed distance to the constant origin (or another fixed point named center ). Therefore, is radially (or spherically) symmetric around the center. In definition (1) we used a function based on the Euclidean distance to approach another case of radial basis function, which will be shown next. The radial basis functions networks can be represented in Figure 1.

In our analysis we consider RBF networks with Gaussian activation functions.

2.1. Gaussian Radial Basis Functions

The Gaussian functions are local functions and are widely used in the theory of multivariate functions approximation, usually expressed as

In this type of function the parameter corresponds to the width of the Gaussian function; in other words, it defines the average Euclidean distance (average radius) which measures the dispersion of the data represented by the function of radial basis around its center [12]. The parameter represents how quickly the value of the radial basis function falls to zero, as the input data departs from the center.

The networks using Gaussian radial basis functions produce a nonlinear mapping of the input space into a space of characteristics, usually of high dimensionality, followed by a linear mapping of the hidden space for the output space. This nonlinear transformation is defined by the radial basis functions and the group of weights defines the linear transformation. In Figure 2 we can observe that the Gaussian basis functions are crescent localized functions at and decrescent at .

Another important concept related to the Gaussian radial basis functions is that they are defined positive for each parameter . A function is defined positive if the quadratic form for distinct pairs of different points and [12].

We can observe that the design of a network using radial basis functions can be seen as a problem of curve adjustment, where the learning is equivalent to find a surface in a multidimensional space, mapping the results in the better adjustment to a group of training data.

2.2. Learning Algorithms

Assuming a group of training data, we need to estimate a function as close as possible to the objective function using a RBF network. The universal approximation property states that an optimal solution to the approximation problem exists: finding the corresponding cost function minimum is the goal of the learning algorithms. There are different learning strategies that we can follow in the design of a RBF network, depending on how the centers of the RBF network are specified.

The simplest approach is to assume fixed Gaussian functions defining the activation functions of the hidden units. In particular, the centers locations may be chosen randomly from the training data set and kernel size is fixed according to the spread of the centers. The kernel size of all the Gaussian radial basis functions is fixed at where is the maximum distance between the chosen centers and the number of centers [13].

Another method is the Gradient Descent Optimization used in [14, 15]. For the supervising learning strategies, the RBF network optimization is usually based on a gradient descent learning process that is generally suffered from the local minima problem and slow convergence speed. In this paper, a modified learning scheme is used for computing the adjustable (weight) parameters [13]; the cost functional of this problem is convex with respect to the linear parameters . The optimum values of these parameters are determined by the linear least squares method directly with regularization parameter so that the algorithm convergence is speeded up. The training structure is not initially given, so in the regularization theory the function approximation problem is to find the function that minimizes where and are called the regularization parameter, is the Tikhonov functional, and is differential operator. Differentiating (5) we obtain the Euler-Lagrange equation for : Since is the Dirac delta function using its shifting property, the solution in (6) is

Equation (7) states that minimizing the solution to the regularization problem is a linear superposition of Green functions. The shape of these functions depends only on the differential operator , that is, on the assumptions concerning the mapping characteristics between input and output space. Thus the choice of determines the approximator basis functions. In particular if is invariant for rotation and translation the Green function is [13] represents the expansion center and the term represents the coefficient expansion. So the minimized solution of (7) evaluated in the point is Evaluating (9), we get a system of simultaneous equations: where represents the vector of centers chosen from the group of input data; the equations above can be written in a shorter way as Where is the Gaussians radial basis functions, is the parameters of the model, the equations system above can be written in matrix form as That is, In the compact form, we have where the system matrix has inputs and represents the unknown coefficients vector (weights) defined by Substituting (14) in (15), where is the vector of desired values, we have

In 1986, Micchelli established a relation between disperses data interpolation and positive defined functions. Micchelli demonstrated that, for a class of radial basis functions, the interpolation matrix is nonsingular as long as the data are distinct. This is all that is demanded for the nonsingularity of the interpolation matrix [13].

There are many radial basis functions that are covered by the studies presented by Micchelli, the Gaussian among them, which lead us to conclude that, if we have Gaussian basis functionsgenerating an interpolation matrix , it will be defined as positive matrix, consequently having a nonsingular matrix. It may occur that the interpolation matrix is not square; in this case, it is necessary to use the pseudoinverse matrix in the adjustment problem solution through the least squares. Then we have where represents the pseudoinverse of the matrix .

We can still add a parameter to avoid conditioning problems in the interpolation matrix, in other words so that

When the parameter of regularization is close to zero, the weights vector converges to a solution of minimum norm for the vector error [13].

Using nonlinear basis functions, we allow the function to be a nonlinear function from the input data. The great advantage of functions (5) or (6), which are linear in , is the simplicity of the model, that is, the possibility of obtaining a linear combination in a closed form, through the application of solution techniques based on the least square method. The main problem in a RBF network is the choice of the centers and other parameters, inherent to the structure, for obtaining the best weights group. In the construction of a RBF network, we must take into account some parameters: the radius of the Gaussians , the number of centers, the input vector , and the value of the weights .

The three first requirements, the parameter determination, the number of centers, and the determination of the inputs group, are based on heuristics, while the determination of is done through the minimization of a cost functional. All the parameters must be chosen to minimize the training error through the adjustment of [16].

In this study, the choice of the centers was made from a group of data extracted from the group of input using means algorithm for finding clusters and cluster centers in a set of unlabeled data and is computationally efficient [17] and the parameter σ was adjusted using the Silverman rule [18]. The centers and the parameter of the radial basis function are the parameters of the model, and the parameter controls the smooth of the interpolation. All tests have been made without the use of the regularization parameter defined in (5). The advances in RBF network training during the last couple decades can be seen in many studies [1921].

2.3. The Induction Machine

The induction machine is the most common alternate current (AC) motor, mostly due to its simplicity, robust construction, and low cost. The name induction machine is derived from the fact that the alternating currents are induced in the rotor’s circuit, by the stator magnetic field.

For the identification of an induction machine current plant, we will use a frequency inverter as an element for exciting the motor; in other words, the signal that feeds the machine is modulated in pulses width (PWM: Pulse Width Modulation) [22].

The drive module can be divided in two parts: induction machine and frequency converter. The platform induction machine has the following attributes:(i)three-phase induction machine;(ii)power: 1,5 CV;(iii)number of poles: 4;(iv)speed: 1720 rpm;(v)voltage: 220 V/380 V;(vi)current: 4,43 A/1,56 A.

The frequency converter is made of five parts: rectifier, capacitive filter, preload circuit, drivers, and inverter. The rectifier transforms the three-phased alternate voltage (which feeds the module) into a continuous voltage, which is filtered by the capacitors. The preload circuit allows the currents peak reduction in the initial load of the filter capacitors. The platform is configured to work at 220 V, operating with the motor in triangle.

The PWM inverter is composed of a module containing a IGBTs (Insulated Gate Bipolar Transistor), which receive the actuation command through the PWM modulation generated by the interface. The measure acquisition system is composed of an analog-digital signal conversion board, which allows the acquisition of signals necessary to the proposed identification. The measured signals are currents and three-phase voltages.

3. Results and Discussion

In this section, we present the application of the proposed identification technique (RBF) in the dynamic system analysis (the current plant of the induction motor) based on observed data. Initially, the obtained data from the plant were used for validating and analyzing the model. The data used in the identification process were collected at LACI, Laboratory of Actuation Control and Instrumentation (UFRN), using current sensors.

The current plant of the induction machine is a nonlinear system of discrete time with discrete data; we assume that the nonlinearity of the system is unknown.

The tool used in the identification process is a neural network that uses a Gaussian radial basis function that gradually grows to encompass the adequate and sufficient space in a way that provides an approximation of the system. The RBF is an alternate tool for the solving problems in identification and control, especially for those involving nonlinearities.

The system to be identified has the following attributes.(i)It is a discrete time system.(ii)It is primarily meant for the black box identification.(iii)They are monovariables or SISO, Single Input Single Output, although all cases can be extended to more than one input and/or output.

The dynamic and static characteristics of a system will come out from its excitation signal; in this case the frequencies range of a signal must be carefully chosen when the objective is to excite the linear or nonlinear dynamics of a determined system. However, the amplitude or excursion of a signal is what takes a nonlinear system to achieve different points of operation revealing its nonlinear attributes. We have used as input a sinusoidal signal as shown in Figure 3.

And we have obtained the response signal shown in Figure 4.

In this work, the sampling period of 250 ms (milliseconds) was chosen to generate a good response.

3.1. Determination of the Structure Models

If we consider the plant as a nonlinear function defined by where represents the pair of inputs-output of the SISO plant in the moment . This is a general model in which the output for any moment is the nonlinear function of the values givenby the inputs and outputs. The discrete representation of the system states can be defined as where is an unknown function and is nonlinear in the parameters and . If the function has a derivate around an arbitrary equilibrium point, then the nonlinear system defined in (20) can be approximated by a RBF network.

Results obtained by Powell [23] and Wendland [24] demonstrate that the RBF networks have the property to be a universal function approximators. More precisely, for a determined tolerance admitted for an estimator and an arbitrary nonlinear function , there will be a parameters vector and a matrix , so that

This result is the base to justify the use of RBFs in nonlinear systems identification problems. The RBF universal approximation property guarantees the existence of these weights vector (parameters) which minimize the average error between the system response and the RBF response, so the system can be represented by where is the RBF input vector, the centers vector chosen from the input group using means algorithm [17], and the width of the basis functions that is adjusted using the Silverman rule [18].

The main problem of the identification algorithm is to find the parameters vector w and the function that will build the approximate model for (20). In this paper we will work the identification problem in two different tasks.

First we will find the vector that promotes the best approximation of the desired function through the RBF training using least squares and then we will analyze its dynamic behavior.

Based on some previous knowledge about the plant a model is chosen to identify it. In this study we will be considering this as being a second-order system. The goal is to determine the RBF weights using the least squares method. Figure 5 was implemented.

Analyzing the structure above it can be seen that the data presented to the RBF network is the data from the plant, and the estimated outputs of the RBF are compared with the plant outputs for calculating the average error used in the RBF network free parameters adjustment. In the training 40% of samples were used. This training process is offline and is based on the least squares method presented in the regularization theory.

3.2. Simulations

Figure 6 is a comparison between the outputs obtained in the plant with the outputs obtained from the RBF network from the structure shown in Figures 6 and 7.

The RBF network training process was based in the adjustment of the basis functions parameters from the input data and the centers group; the weights vector was found using the least squares method. After training the network we obtained the best results with the following parameters:

From now on we must be careful not to use the same data used in the training where 40% of samples were used for validation. Since the identified system is nonlinear, both tests have been executed with the system operating in similar conditions, that is, with the same weights group obtained during the training phase.

Once the system is identified and the model validated, the data obtained from the plant can be ignored; we begin to analyze its dynamic testing the structure shown in Figure 8.

The model main attributes are as follows:(i)possibility of representing the nonlinear behavior of the phenomena;(ii)approximate an input/output mapping;(iii)capacity of learning, which was obtained through a training session with input/output representative examples;(iv)capacity of generalization, which allows the RBF to have a satisfactory performance in response to data that do not belong to the training group.(v)tolerance to errors, which allows the RBF to continue presenting acceptable results in case of data noise.

The model presented in Figure 8 is generally referred to as a model of recursive prediction, in other words, neurodynamics model with associative memories (present and past inputs) and input/output mapping. We can describe the model as

The notions of states perform an essential role in the mathematical formulation of dynamic systems; that is, the notions of states perform an essential role in the mathematical formulation of dynamic systems. A state represents the actual quantity of information about the effect of the past inputs applied to the system, where this information is enough to describe its future evolution. The RBF viewed as a nonlinear dynamic system, with particular emphasis on the stability problem, is referred to as a neurodynamics model [13]. If we consider the model described in (25) as a model of states and we add and subtract a constant and positive parameter . We have where so we can say that the RBF network estimated states can be described as

Since are the states of the RBF, , and the adjustment parameters and considering the function as assuming we know the ideal weights, we have if we consider = −1, we have considering the Gaussian radial basis function, we will have the following representation:

In the previous equation the values x are called estimated states and its equations differences the quantities that each moment characterizes the phases or the system evolution. Now we can extend the mentioned concepts to a more interesting geometrical representation, the vector field.

A vector field in is a function , defined in a subset , where every point in is associated with a vector . The most interesting geometrical representation of a vector field is the one that represents as a vector with initial extremity in for each point in [25]. To construct the system’s vector field we can consider the following system: where is the directional variation of the point in a plane, and the plane determined by these two variables is called phase portrait. For the RBF, we can define it as

The main goal is to present the RBF response to different systems (nonlinear) and analyze the graphics. First, the RBF network was trained and simulated in order to obtain the dynamical solution with basin of attraction and equilibrium point for known system. To analyze its behavior and response the Van der Pol oscillator (Nonlinear System with Limit Cycle) was used:

Identifying the nonlinear system above and considering , we obtained the following response using a RBF.

Figure 9 shows the system dynamic along the estimated orbit of the RBF approximations in the dynamic system that can be seen as a stable limit cycle of periodic orbit, stable in the sense that nearby trajectories approach it asymptotically. The attractors represent only the equilibrium states of a dynamic system that was observed experimentally. In the Following, we have the phase diagram plot (the real dynamic system, the current plant of the induction motor).

In Figure 11 we can observe the system nonlinearity region. For better visualization a straight line intercepting the phase diagram (Figure 10 rotated) has been used to put in evidence the nonlinearity of the system.

Once the RBF was trained and the model validated for the unknown system (real dynamic system, the current plant of the induction machine), the parameters (weights) were obtained and the structure shown in Figure 8 was used to analyze all the process dynamic. The RBF networks response shown below is presented along with the current plant phase diagram (in blue) in Figure 10.

In Figure 14, a vector sequence in space can be represented as a trajectory of the system or the model immersed in the space of states created by the RBF network. In Figures, 12, 13, and 14, we show that a RBF network take the form of a nonlinear dynamic equation that manipulates the location of the attractors in a structure of interest. It is possible to establish a relationship between the physics of the electrical current of the induction machine and the algorithms of the computation (RBF) [13].

4. Conclusions

In this paper, we presented a mathematical tool and implemented it to effectively resolve a problem in modern engineering: the plant modeling, supposedly unknown and with significant nonlinear attributes. The RBF networks dynamic behavior was tested and analyzed from data obtained from the induction motor current plant, without provoking the system instability at the same time optimizing some formulated performance standards using mathematical tools that can be algebraically manipulated.

This paper describes the Gaussian radial basis function analysis and synthesis in a typical problem of functions approximation. A peculiar characteristic of these RBFs is the fact that they are universal approximators with a simple architecture and capable of representing complex nonlinear mappings using nonlinear basis functions and adjustment parameters. The RBF networks can be seen as a mathematical structure of multivariate approximation that are frequently applied in modern approximation theory; what makes them a tool that can be used in the modeling and identification of nonlinear systems.

The contributions of this paper are primarily focused in the result presented and in the RBF network training and simulation for obtaining the dynamical solution with basin of attraction and equilibrium point for known and unknown system. Also a relationship between the physics of the machine, the Van der Pol equations, and the RBF response was established. This technique can be used in other related areas such as dynamic fault diagnosis and dynamical pattern recognition. The observation errors are taken as the measure of similarity between the test and training dynamical patterns.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.