Abstract

A recursive gradient identification algorithm based on the bundle method for sandwich systems with backlash-like hysteresis is presented in this paper. In this method, a dynamic parameter estimation scheme based on a subgradient is developed to handle the nonsmooth problem caused by the backlash embedded in the system. The search direction of the algorithm is estimated based on the so-called bundle method. Then, the convergence of the algorithm is discussed. After that, simulation results on a nonsmooth sandwich system are presented to validate the proposed estimation algorithm. Finally, the application of the proposed method to an X-Y moving positioning stage is illustrated.

1. Introduction

Usually, a sandwich system with backlash-like hysteresis is defined as the system that a backlash-like hysteresis is sandwiched between two linear dynamic subsystems. In engineering applications, many mechanical systems such as mechanical transmission systems, servo control systems, and hydraulic valve systems can be described by the so-called sandwich systems with backlash-like hysteresis. The reason to cause the backlash-like hysteresis phenomenon is mainly due to the gaps existing in transmission mechanism systems such as gearbox and ball screw.

Recently, identification of sandwich systems has become one of the interesting issues in the domain of modeling and control for complex systems. References [1โ€“3] proposed the recursive identification methods for the sandwich system with smooth nonlinearities. The main ideas of those approaches are to extend the linear system identification methods to smooth nonlinear cases. Moreover, there have been some methods for the identification of Hammerstein or Wiener systems with backlash-like hysteresis [4โ€“8], most of which are the modified linear system identification methods.

However, until today, there have been very few publications concerning the identification of the sandwich systems with backlash-like hysteresis. Reference [9] proposed a method to identify the sandwich systems with backlash-like hysteresis, but the approach is still based on idea to extend the linear system identification method to nonlinear cases. On the other hand, the switching functions in that method have significant influence on the convergence speed of the algorithm.

In this paper, a recursive gradient algorithm based on the bundle method is proposed to identify parameters of the sandwich model. In this algorithm, the effect of the nonsmoothness caused by the backlash-like hysteresis in sandwich system is considered. In order to obtain the optimizing search direction at the nonsmooth points of the system, the Clarke subgradient technique is utilized based on the idea of the bundle method [10โ€“12]. By comparing with the above-mentioned available methods, the proposed method employs the nonsmooth optimization technique to identify the nonsmooth sandwich systems with backlash-like hysteresis. Thus, it will provide us with a new approach for dealing with on-line modeling of nonsmooth dynamic systems. A numerical example will be presented to evaluate the performance of the proposed approach. Finally, experimental results on an ๐‘‹-๐‘Œ moving positioning stage are illustrated.

2. Brief Description of Sandwich Systems with Backlash

The structure of a sandwich system with backlash-like hysteresis is shown in Figure 1, in which a backlash-like hysteresis is embedded between the input and output linear subsystems, that is, ๐ฟ1(ยท) and ๐ฟ2(ยท). It is assumed that input ๐‘ข(๐‘˜) and output ๐‘ฆ(๐‘˜) can be measured directly, but the internal variables ๐‘ฅ(๐‘˜) and ๐‘ฃ(๐‘˜) are not measurable.

Suppose that both linear subsystems are stable, and the time delays ๐‘ž1 and ๐‘ž2 in ๐ฟ1(ยท) and ๐ฟ2(ยท) are known, respectively. The corresponding discrete-time models of ๐ฟ1(ยท) and ๐ฟ2(ยท) are, respectively, written as ๐‘ฅ(๐‘˜)=โˆ’๐‘›๐‘Ž๎“๐‘–2=1๐‘Ž๐‘–2๐‘ฅ๎€ท๐‘˜โˆ’๐‘–2๎€ธ+๐‘›๐‘๎“๐‘—2=0๐‘๐‘—2๐‘ข๎€ท๐‘˜โˆ’๐‘ž1โˆ’๐‘—2๎€ธ,๐‘ฆ(๐‘˜)=โˆ’๐‘›๐‘๎“๐‘–1=1๐‘๐‘–1๐‘ฆ๎€ท๐‘˜โˆ’๐‘–1๎€ธ+๐‘›๐‘‘๎“๐‘—1=0๐‘‘๐‘—1๐‘ฃ๎€ท๐‘˜โˆ’๐‘ž2โˆ’๐‘—1๎€ธ,(2.1) where ๐‘›๐‘Ž and ๐‘›๐‘ are the orders of ๐ฟ1(ยท), ๐‘ž1 is the time delay, and ๐‘Ž๐‘–2 as well as ๐‘๐‘—2 are the coefficients of ๐ฟ1(ยท); ๐‘›๐‘ and ๐‘›๐‘‘ are the orders of ๐ฟ2(ยท), ๐‘ž2 is the time delay, and ๐‘๐‘–1 and ๐‘‘๐‘—1 are the coefficients of ๐ฟ2(ยท). Let both ๐‘0 and ๐‘‘0 be equal to unity for unique representation.

Note that the backlash-like hysteresis shown in Figure 1 is specified by the slopes ๐‘š1 and ๐‘š2 as well as the absolute thresholds, ๐ท1 and ๐ท2, where 0<๐‘š1<โˆž,0<๐‘š2<โˆž,0<๐ท1<โˆž, and 0<๐ท2<โˆž. Hence, the discrete-time model of the backlash-like hysteresis is described as โŽงโŽชโŽชโŽจโŽชโŽชโŽฉ๐‘š๐‘ฃ(๐‘˜)=1๎€ท๐‘ฅ(๐‘˜)โˆ’๐ท1๎€ธ,๐‘ฅ(๐‘˜)>๐‘ฃ(๐‘˜โˆ’1)๐‘š1+๐ท1๐‘ฃ๐‘ฃ,๐‘ฅ(๐‘˜)>๐‘ฅ(๐‘˜โˆ’1),increasezone,(๐‘˜โˆ’1),(๐‘˜โˆ’1)๐‘š2โˆ’๐ท2๐‘ฃโ‰ค๐‘ฅ(๐‘˜)โ‰ค(๐‘˜โˆ’1)๐‘š1+๐ท1๐‘š,memoryzone,2๎€ท๐‘ฅ(๐‘˜)+๐ท2๎€ธ,๐‘ฅ(๐‘˜)<๐‘ฃ(๐‘˜โˆ’1)๐‘š2โˆ’๐ท2,๐‘ฅ(๐‘˜)<๐‘ฅ(๐‘˜โˆ’1),decreasezone.(2.2)

For the convenience to describe the system, the discrete-time model of the backlash-like hysteresis can be rewritten as ๐‘š(๐‘˜)=๐‘š1+๎€ท๐‘š2โˆ’๐‘š1๎€ธ๐‘”๎€ท(๐‘˜),๐‘ฃ1(๐‘˜)=๐‘š(๐‘˜)๐‘ฅ(๐‘˜)+๐‘”(๐‘˜)๐‘ฅ(๐‘˜)โˆ’๐ท1๐‘”1(๐‘˜)+๐ท2๐‘”2(๎€ธ,๐‘ฃ[๐‘ฃ]๎€ท๐‘”๐‘˜)(๐‘˜)=๐‘ฃ1(๐‘˜)+(๐‘˜โˆ’1)โˆ’๐‘ฃ1(๐‘˜)1๐‘”(๐‘˜)โˆ’1๎€ธ๎€ท2๎€ธ,(๐‘˜)โˆ’1(2.3) where the switching functions๐‘”(๐‘˜), ๐‘”1(๐‘˜), and ๐‘”2(๐‘˜) are, respectively, defined as ๎‚ป๐‘”๐‘”(๐‘˜)=0,ฮ”๐‘ฅ(๐‘˜)>01,ฮ”๐‘ฅ(๐‘˜)โ‰ค0,1(โŽงโŽชโŽจโŽชโŽฉ๐‘˜)=1,๐‘ฅ(๐‘˜)>๐‘ฃ(๐‘˜โˆ’1)๐‘š1+๐ท1๐‘”,๐‘ฅ(๐‘˜)>๐‘ฅ(๐‘˜โˆ’1),0,else,2โŽงโŽชโŽจโŽชโŽฉ(๐‘˜)=1,๐‘ฅ(๐‘˜)<๐‘ฃ(๐‘˜โˆ’1)๐‘š2โˆ’๐ท2,๐‘ฅ(๐‘˜)<๐‘ฅ(๐‘˜โˆ’1),0,else,(2.4) where ฮ”๐‘ฅ(๐‘˜)=๐‘ฅ(๐‘˜)โˆ’๐‘ฅ(๐‘˜โˆ’1).

Thus, (2.1)โ€“(2.3) present the model to describe the sandwich system with backlash-like hysteresis. Hence, the unknown parameter vector of the model can be written as ๐œฝโˆˆ๐‘…๐‘›๐‘Ž+๐‘›๐‘+๐‘›๐‘+๐‘›๐‘‘+4, where ๎€บ๐‘๐œฝ=1,โ€ฆ,๐‘๐‘›๐‘,๐‘Ž1,โ€ฆ,๐‘Ž๐‘›๐‘Ž๐‘š1,๐‘š2,๐ท1,๐ท2,๐‘1,โ€ฆ,๐‘๐‘›๐‘,๐‘‘1,โ€ฆ,๐‘‘๐‘›๐‘‘๎€ป๐‘‡.(2.5) According to concept of the gradient algorithm, define the objective function as ๐‘„๎‚€๎๎‚=๐‘˜,๐œฝ(๐‘˜)๐‘›๎“๐‘˜=1๎‚ƒ๎‚€๎๐‘ฆ(๐‘˜)โˆ’ฬ‚๐‘ฆ๐‘˜,๐œฝ(๐‘˜)๎‚๎‚„22=12๐‘›๎“๐‘˜=1๐‘“๎‚€๎๎‚,๐‘˜,๐œฝ(๐‘˜)(2.6) where ๎๐œฝ is the estimate of ๐œฝ, and ๎ฬ‚๐‘ฆ(๐‘˜,๐œฝ(๐‘˜)) is the output of system model. The optimal estimate of ๎๐œฝ can be obtained by minimizing the above-mentioned criterion.

3. The Nonsmooth Estimation of the Sandwich Model with Backlash-Like Hysteresis

In this section, a gradient-based identification algorithm is proposed for identification of the sandwich system with backlash-like hysteresis. Due to the nonsmoothness of the backlash, the gradients of the system output with respect to the parameters of the backlash at nonsmooth points will not exist. The smooth gradient-based methods directly applied to nonsmooth systems may fail in convergence [13]. On the other hand, the genetic algorithms [14] or Powellโ€™s method [15], which are based on derivative-free techniques, may be unreliable and become inefficient when the system structure is complicated. Thus, we should find a special way for solving this problem. The simplest way to solve the problem is to apply the Clarke subgradients [11] to the approximation of the gradients at the nonsmooth points.

The basic idea of the bundle method is to approximate the subdifferential of ๎๐‘„(๐‘˜,๐œฝ(๐‘˜)) with respect to ๎๐œฝ(๐‘˜) by gathering the subgradients from previous iterations into a bundle for the nonsmooth objective function ๎๐‘„(๐‘˜,๐œฝ(๐‘˜)). The gradient ๎โˆ‡๐‘„(๐‘˜,๐œฝ(๐‘˜)) can change discontinuously, and some change of the gradient may not be small in the neighborhood of the minimum of the function. So the values of ๎๐‘„(๐‘˜,๐œฝ(๐‘˜)) and ๎๐œ•๐‘„(๐‘˜,๐œฝ(๐‘˜)) at a single point ๎๐œฝ(๐‘˜) do not offer sufficient information of the local behavior of ๎๐‘„(๐‘˜,๐œฝ(๐‘˜)). The detail of the bundle method can be found in [10โ€“12] and reference therein.

Considering that the sandwich system with backlash-like hysteresis is locally Lipschitz continuous, we have the following definition.

Definition 3.1 (see [11]). Let ๐น: ๐‘…๐‘›ร—๐‘…โ†’๐‘… be locally Lipschitz continuous. This allows one to define a Clarke subgradient of ๐น at ๐œ‰ as ๐‘‘๐น(๐œ‰): ๎€ฝ๎€ท๐œ‰๐‘‘๐น(๐œ‰)โˆˆ๐œ•๐น(๐œ‰),subjectto๐œ•๐น(๐œ‰)=convโˆ‡๐น๐‘–๎€ธโˆฃ๐œ‰๐‘–๎€ท๐œ‰โŸถ๐œ‰,โˆ‡๐น๐‘–๎€ธ๎€พexists,(3.1) where โ€œconvโ€ denotes the convex hull of a set.
The set of all the Clarke subgradients is the Clarke subdifferential of ๐นat ๐œ‰ which is denoted by ๐œ•๐น(๐œ‰) [11].
Considering that backlash-like hysteresis is a nonsmooth mapping, the gradients of parameters in ๐ฟ1(ยท) with respect to ๐‘ฃ(๐‘˜) do not exist at a nonsmooth point. Hence, we define the parameters of the backlash-like hysteresis and ๐ฟ1(ยท) as ๐ˆ={๐‘š1,๐‘š2,๐ท1,๐ท2,๐‘Ž1โ‹ฏ๐‘Ž๐‘›๐‘Ž,๐‘1โ‹ฏ๐‘๐‘›๐‘}โˆˆ๐‘…4+๐‘›๐‘Ž+๐‘›๐‘. Considering the cost function described by (2.6), the gradients of ๐‘“(โ‹…) with respect to ๐ˆ will not exist at the nonsmooth points. Hence, at the nonsmooth points of ๐‘„(โ‹…), the Clarke subdifferential of ๐‘“(โ‹…) with respect to ๐ˆ, that is, ๐œ•๐‘“(๐ˆ), can be obtained by โŽงโŽชโŽจโŽชโŽฉ๎‚ƒ๐‘ฆ๎‚€๎๐œฝโŽ›โŽœโŽœโŽ๐œ•๐‘“(๐ˆ)=โˆ’conv(๐‘˜)โˆ’ฬ‚๐‘ฆ๐‘˜,(๐‘˜)๎‚๎‚„๐‘›๐‘๎“๐‘—1=0๐‘‘๐‘—1๐œ•ฬ‚๐‘ฃ(๐‘˜โˆ’๐‘—1โˆ’๐‘ž2)โŽžโŽŸโŽŸโŽ โŽซโŽชโŽฌโŽชโŽญ(๐ˆ),(3.2) where ๐œ•ฬ‚๐‘ฃ(๐‘˜โˆ’๐‘—1โˆ’๐‘ž2)ฬ‚๐‘ฃ(๐ˆ)=conv{โˆ‡(๐‘˜โˆ’๐‘—1โˆ’๐‘ž2)(๐ˆ)}, and โˆ‡ฬ‚๐‘ฃ(๐‘˜โˆ’๐‘—1โˆ’๐‘ž2)(๐ˆ) is the gradient of ฬ‚๐‘ฃ, the output of backlash-like hysteresis, with respect to ๐œŽ at the smooth points. Thus, the corresponding gradients of ฬ‚๐‘ฃ(๐‘˜โˆ’๐‘—1โˆ’๐‘ž2) with respect to ๐ˆ at the smooth points are โˆ‡ฬ‚๐‘ฃ(๐‘˜โˆ’๐‘—1โˆ’๐‘ž2)โŽงโŽชโŽชโŽชโŽชโŽชโŽชโŽชโŽจโŽชโŽชโŽชโŽชโŽชโŽชโŽชโŽฉ๎‚ƒ๐‘ข๎€ท(๐ˆ)=๐‘˜โˆ’๐‘—1โˆ’๐‘ž2โˆ’๐‘ž1๎€ธโˆ’๎๐ท1(๐‘˜โˆ’1),0,โˆ’๎๐‘š1(๐‘˜โˆ’1),0,โˆ’๎๐‘š1๎€ท(๐‘˜โˆ’1)ฬ‚๐‘ฅ๐‘˜โˆ’1โˆ’๐‘—1โˆ’๐‘ž2๎€ธ,โ€ฆ,โˆ’๎๐‘š1๎€ท(๐‘˜โˆ’1)ฬ‚๐‘ฅ๐‘˜โˆ’๐‘›๐‘Žโˆ’๐‘—1โˆ’๐‘ž2๎€ธ,๎๐‘š1(๎€ท๐‘˜โˆ’1)๐‘ข๐‘˜โˆ’1โˆ’๐‘—1โˆ’๐‘ž2โˆ’๐‘ž1๎€ธ,โ€ฆ,๎๐‘š1๎€ท(๐‘˜โˆ’1)๐‘ข๐‘˜โˆ’๐‘›๐‘โˆ’๐‘—1โˆ’๐‘ž2โˆ’๐‘ž1๎€ธ๎€ป๐‘‡[],inincreasezones,0,0,0,0,0,โ€ฆ,0,0,โ€ฆ,0๐‘‡๎‚ƒ๎€ท,inmemoryzones,0,๐‘ข๐‘˜โˆ’๐‘—1โˆ’๐‘ž2โˆ’๐‘ž1๎€ธ+๎๐ท2(๐‘˜โˆ’1),0,๎๐‘š2(๐‘˜โˆ’1),โˆ’๎๐‘š2๎€ท(๐‘˜โˆ’1)ฬ‚๐‘ฅ๐‘˜โˆ’1โˆ’๐‘—1โˆ’๐‘ž2๎€ธ,โ€ฆ,โˆ’๎๐‘š2๎€ท(๐‘˜โˆ’1)ฬ‚๐‘ฅ๐‘˜โˆ’๐‘›๐‘Žโˆ’๐‘—1โˆ’๐‘ž2๎€ธ,๎๐‘š2๎€ท(๐‘˜โˆ’1)๐‘ข๐‘˜โˆ’1โˆ’๐‘—1โˆ’๐‘ž2โˆ’๐‘ž1๎€ธ,โ€ฆ,๎๐‘š2๐‘ข๎€ท(๐‘˜โˆ’1)๐‘˜โˆ’๐‘›๐‘โˆ’๐‘—1โˆ’๐‘ž2โˆ’๐‘ž1๎€ธ๎€ป๐‘‡,indecreasezones,(3.3) where โˆ‘ฬ‚๐‘ฅ(๐‘˜)=โˆ’๐‘›๐‘Ž๐‘–2=1ฬ‚๐‘Ž๐‘–2ฬ‚๐‘ฅ(๐‘˜โˆ’๐‘–2โˆ‘)+๐‘›๐‘๐‘—2=0ฬ‚๐‘๐‘—2๐‘ข(๐‘˜โˆ’๐‘ž1โˆ’๐‘—2), and the coefficients ฬ‚๐‘Ž๐‘–2 and ฬ‚๐‘๐‘—2 are the corresponding estimated values at the previous step.
Hence, based on (3.2) and (3.3), the Clarke subdifferential of ๐‘“(โ‹…) with respect to ๐ˆ can be obtained at nonsmooth points of the system. Besides, as ๐ฟ2(ยท) is a smooth function, the gradients of ๐‘“(โ‹…) with respect to the parameters of the linear subsystems ๐ฟ2(ยท) always exist. So, the Clarke subdifferential of ๐‘“(โ‹…) with respect to all the unknown parameters of the sandwich system can be determined.
The proper Clarke subgradient direction t๎(๐‘˜,๐œฝ(๐‘˜)) of ๐‘“(โ‹…) with respect to the parameters to be estimated at nonsmooth points can be derived based on min๐œ‘,๐๎‚ต1๐œ‘(๐‘˜)+2โ€–โ€–๐ญ๎‚€๎๎‚โ€–โ€–๐‘˜,๐œฝ(๐‘˜)2๎‚ถs.t.โˆ’๐›ฝ๐‘—๎‚ฌ๐ก(๐‘˜)+๐‘—๎‚€๎(๐‘˜),๐ญ๐‘˜,๐œฝ(๐‘˜)๎‚๎‚ญโ‰ค๐œ‘(๐‘˜),โˆ€๐‘—โˆˆ๐ฝ๐‘˜,(3.4) where โ€–โ‹…โ€– denotes the Euclidean norm; ๐ฝ๐‘˜ is a nonempty subset of {1,โ€ฆ,๐‘˜}; set ๐œ‘(๐‘˜) is the predicted amount of descent; ๐ก๐‘—๎๐œฝ(๐‘˜)โˆˆ๐œ•๐‘“(๐‘˜,โˆ—๐‘—(๐‘˜)) for ๐‘—โˆˆ๐ฝ๐‘˜, and ๎๐œฝโˆ—๐‘—(๐‘˜) are some trail points (from the past iterations); ๐›ฝ๐‘—(๐‘˜)=max{|๐›ผ๐‘—(๐‘˜)|,๐›พ(๐‘ ๐‘—(๐‘˜))2} is the locality measure of subgradient; ๐›พโ‰ฅ0 is the distance measure parameter (๐›พ=0 if ๎๐‘“(๐‘˜,๐œฝ(๐‘˜)) is convex), ๐›ผ๐‘—๎๎๐œฝ(๐‘˜)=๐‘“(๐‘˜,๐œฝ(๐‘˜))โˆ’๐‘“(๐‘˜,โˆ—๐‘—(๐‘˜))โˆ’๐ก๐‘—๎๎๐œฝ(๐‘˜)(๐œฝ(๐‘˜)โˆ’โˆ—๐‘—(๐‘˜)) is the linearization error; ๐‘ ๐‘—๎๐œฝ(๐‘˜)=โ€–๐‘—๎๐œฝ(๐‘˜)โˆ’โˆ—๐‘—โˆ‘(๐‘˜)โ€–+๐‘˜โˆ’1๐‘–=๐‘—โ€–๎๐œฝ๐‘–+1๎๐œฝ(๐‘˜)โˆ’๐‘–(๐‘˜)โ€– is the distance measure to estimate โ€–๎๎๐œฝ๐œฝ(๐‘˜)โˆ’โˆ—๐‘—(๐‘˜)โ€– without the requirement to store the trial point ๎๐œฝโˆ—(๐‘˜).
According to formula (3.4), ๐ญ(๐‘˜) and ๐œ‘(๐‘˜) are obtained, that is, ๐ญ๎‚€๎๎‚๎“๐‘˜,๐œฝ(๐‘˜)=โˆ’๐‘—โˆˆ๐ฝ๐‘˜๐œ†๐‘˜๐‘—๐ก๐‘—(๎“๐‘˜)=๐‘—โˆˆ๐ฝ๐‘˜๎‚ƒ๎‚€๎๐œฝ๐‘ฆ(๐‘˜)โˆ’ฬ‚๐‘ฆ๐‘˜,๐‘—(๐œ†๐‘˜)๎‚๎‚„๐‘˜๐‘—๐ฐ๐‘—(๎‚€๎๎‚๐‘˜)=๐‘’๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก๐‘—(โ€–โ€–๎โ€–โ€–๐‘˜),(3.5)๐œ‘(๐‘˜)=โˆ’๐‘ก(๐‘˜,๐œฝ(๐‘˜))2โˆ’๎“๐‘—โˆˆ๐ฝ๐‘˜๐œ†๐‘˜๐‘—๐›ฝ๐‘—(๐‘˜),(3.6) where ๐ฐ๐‘—๎๎๐œฝ(๐‘˜)=๐œ•ฬ‚๐‘ฆ(๐‘˜,๐œฝ)/๐œ•๐œฝ|=๎๐œฝ๐‘—(๐‘˜), ๎๐‘’(๐‘˜,๐œฝ(๐‘˜))=๎๐œฝ๐‘ฆ(๐‘˜)โˆ’ฬ‚๐‘ฆ(๐‘˜,๐‘—(๐‘˜)), โˆ’๐ก๐‘—โˆ‘(๐‘˜)=๐‘—โˆˆ๐ฝ๐‘˜๐œ†๐‘˜๐‘—๐ฐ๐‘—(๐‘˜),๐œ†๐‘˜๐‘—โ‰ฅ0, and โˆ‘๐‘—โˆˆ๐ฝ๐‘˜๐œ†๐‘˜๐‘—=1.

Remark 3.2. If ๐‘“(โ‹…) is convex, the model ๎๐‘“(๐‘˜,๐œฝ(๐‘˜)) is an underestimate for ๐‘“(โ‹…), and the nonnegative linearization error ๐›ผ๐‘—(๐‘˜) measures the performance of an approximation of the model to the original cost function. If ๐‘“(โ‹…) is nonconvex, these facts are not valid anymore because ๐›ผ๐‘—(๐‘˜) may have a small or even negative value, although the trial point ๎๐œฝโˆ—๐‘—(๐‘˜) locates far away from the current iteration point ๎๐œฝ(๐‘˜), and thus, the corresponding subgradient ๐ก๐‘—(๐‘˜) is worthless. For these reasons, the locality measure of subgradient ๐›ฝ๐‘—(๐‘˜) is introduced.
Therefore, the proposed recursive gradient estimation algorithm based on bundle method for the sandwich model with backlash-like hysteresis is shown as follows.

Step 1. Select starting point ๐œฝ0โˆˆ๐‘…๐‘›๐‘Ž+๐‘›๐‘+๐‘›๐‘+๐‘›๐‘‘+4 and stopping parameter ๐›ฟ>0. Calculate ๐‘“(๐‘˜,๐œฝ0) and vector ๐ก๐‘—(๐‘˜)โˆˆ๐œ•๐‘“(๐‘˜,๐œฝ0), where ๐‘—โˆˆ๐ฝ๐‘˜, ๐ฝ๐‘˜={๐‘˜0}, |๐ฝ๐‘˜|โ‰ค๐‘˜1, |๐ฝ๐‘˜| is the element number of ๐ฝ๐‘˜, and ๐‘˜1 is a given positive number. Set ๐›ฝ๐‘—(๐‘˜)=0, ๐‘˜=๐‘˜0 and the line search parameters ].๐‘žโˆˆ(0,0.5),๐‘žโˆ—โˆˆ(๐‘ž,1),๐œ‚(0)โˆˆ(0,1(3.7)

Step 2. Calculate optimal solution ๎(๐œ‘(๐‘˜),๐ญ(๐‘˜,๐œฝ(๐‘˜))) based on formulas (3.2)โ€“(3.6). If ๐œ‘(๐‘˜)โ‰ฅโˆ’๐›ฟ, then stop.

Step 3. Search for the largest step size ๐œ‚(๐‘˜)โˆˆ[0,1] such that ๐œ‚(๐‘˜)โ‰ฅ๐œ‚(0) and if ๐‘“๎‚€๎๎‚๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜)+๐œ‚(๐‘˜)๐ญ(๐‘˜)โ‰ค๐‘“๐‘˜,๐œฝ(๐‘˜)+๐‘ž๐œ‚(๐‘˜)๐œ‘(๐‘˜),(3.8) it holds ๎‚€๎๎‚๎‚€๎๎‚๐œ‘(๐‘˜)=๐‘“๐‘˜,๐œฝ(๐‘˜)+๐ญ(๐‘˜)โˆ’๐‘“๐‘˜,๐œฝ(๐‘˜)<0.(3.9) Then, we take a long step and set ๎๎๐œฝ(๐‘˜+1)=๐œฝ(๐‘˜)+๐œ‚(๐‘˜)๐ญ(๐‘˜) and ๎๐œฝโˆ—๎(๐‘˜+1)=๐œฝ(๐‘˜+1); go to Step 4.
Otherwise, if 0<๐œ‚(๐‘˜)<๐œ‚(0), and formula (3.8) holds, then we take a short step and set ๎๎๐œฝ(๐‘˜+1)=๐œฝ(๐‘˜)+๐œ‚(๐‘˜)๐ญ(๐‘˜), and ๎๐œฝโˆ—๎(๐‘˜+1)=๐œฝ(๐‘˜)+๐œ‚โˆ—(๐‘˜)๐ญ(๐‘˜) where ๐œ‚โˆ—(๐‘˜)>๐œ‚(๐‘˜). Go to Step 5.
If ๐œ‚(๐‘˜)=0, and formula (3.8) holds, we take a null step, and namely set ๎๎๐œฝ(๐‘˜+1)=๐œฝ(๐‘˜) and ๎๐œฝโˆ—๎(๐‘˜+1)=๐œฝ(๐‘˜)+๐œ‚โˆ—(๐‘˜)๐ญ(๐‘˜); go to Step 5.

Step 4. Let ๐ฝ๐‘˜=๐ฝ๐‘˜โˆช{๐‘˜+1}, ๐‘˜=๐‘˜+1; if ๐‘˜โ‰ค๐‘˜1, then ๐ฝ๐‘˜={1,โ€ฆ,๐‘˜}, and if ๐‘˜>๐‘˜1, then ๐ฝ๐‘˜=๐ฝ๐‘˜โˆ’1โˆช{๐‘˜}โงต{๐‘˜โˆ’๐‘˜1}, then go to Step 2.

Step 5. ๐ฝ๐‘˜=๐ฝ๐‘˜โˆช{๐‘˜+1}, ๐‘˜=๐‘˜+1; if ๐‘˜โ‰ค๐‘˜1, then ๐ฝ๐‘˜={1,โ‹ฏ,๐‘˜}, and if ๐‘˜>๐‘˜1, then ๐ฝ๐‘˜=๐ฝ๐‘˜โˆ’1โˆช{๐‘˜}โงต{๐‘˜โˆ’๐‘˜1}, and the proper Clarke subgradient ๐ก๐‘—(๐‘˜) satisfies โˆ’๐›ฝ๐‘—๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜)+๐ก๐‘‡๐‘—๎‚€๎๎‚๐ญ๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜)๐‘˜,๐œฝ(๐‘˜โˆ’1)โ‰ฅ๐‘žโˆ—๐œ‘๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜โˆ’1),(3.10) then go to Step 2.

Remark 3.3. In long step, there is an obvious decrease in the value of the objective function. Hence, it is unnecessary to detect discontinuities in the gradient of ๐‘“(โ‹…). Thus, we just set ๐ก๐‘—๎(๐‘˜)โˆˆ๐œ•๐‘“(๐‘˜,๐œฝ(๐‘˜)). On the other hand, in short steps and null steps, the gradient of ๐‘“(โ‹…) is discontinuous. Then, based on (3.10), both ๎๐œฝ(๐‘˜) and ๎๐œฝโˆ—(๐‘˜) located on the opposite sides of this discontinuity are guaranteed, and the new subgradient ๐ก๐‘—(๐‘˜)โˆˆ๐œ•๐‘“(๐‘˜,๐œฝโˆ—(๐‘˜)) will force an obvious modification of the next search direction. Hence, the algorithm approximates the effectively searching direction at nonsmooth points based on the bundle method, which cannot be realized by the smooth optimization techniques.

Remark 3.4. If the value of ๐œ‚(0) is too small, the convergence speed will be very sluggish, while ๐œ‚(0) is too large, and the algorithm may not be convergent. Hence, it is important for ๐œ‚(0) to be chosen properly. Usually, ๐œ‚(0) is chosen based on an empirical method.

Remark 3.5. If all the Clarke subgradients are included in ๐ฝ๐‘˜, the corresponding storage capacity is infinite. Hence, the number of the subgradients in ๐ฝ๐‘˜ must be constrained. In the proposed algorithm, we give the upper bound of |๐ฝ๐‘˜|โ‰ค๐‘˜1, and the upper bound ๐‘˜1 is specified by empirical method.

4. Convergence of the Estimation

For the convergence of the above-mentioned estimation algorithm, we have the following

Theorem 4.1. Suppose that ๐œ‚(๐‘˜) and ๐›ฝ๐‘—(๐‘˜) satisfy 0โ‰ค๐œ‚(๐‘˜)โ‰ค2๐‘’2(๐‘˜)โˆ’๐ก(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜)โˆ’๐›ฝ๐‘—(๐‘˜)๐‘’2(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜)โˆ’๐ก๎‚ต๎‚ธ(๐‘˜)1+โˆ’๐ก(๐‘˜)โˆ’๐ก๐‘‡,๐›ฝ(๐‘˜)๎‚น๎‚ถ(4.1)๐‘—(๐‘˜)โ‰ค2๐‘’2(๎‚ธ๐‘˜)โˆ’๐ก(๐‘˜)โˆ’๐ก๐‘‡(๎‚น๐‘˜),(4.2) respectively, then the parameters ๐œฝ can be convergent to a local optimal value.

Proof. The proof of this theorem can be found in Appendix.

5. Simulation

The proposed approach is used to identify a numerical sandwich system with backlash-like hysteresis based on the measured system input and output. Suppose that the parameters of the backlash-like hysteresis in the system are ๐‘š1=1, ๐‘š2=1.2, ๐ท1=0.5, and ๐ท2=0.6. The linear subsystems ๐ฟ1(ยท) and ๐ฟ2(ยท) are ๐‘ฅ(๐‘˜)=โˆ’0.1๐‘ฅ(๐‘˜โˆ’1)โˆ’0.2๐‘ฅ(๐‘˜โˆ’2)+1.5๐‘ข(๐‘˜โˆ’1),(5.1)๐‘ฆ(๐‘˜)=โˆ’1.2๐‘ฆ(๐‘˜โˆ’1)โˆ’0.32๐‘ฆ(๐‘˜โˆ’2)+2๐‘ฃ(๐‘˜โˆ’1)โˆ’0.1๐‘ฃ(๐‘˜โˆ’2),(5.2) respectively.

That implies ๐‘Ž1=0.1,๐‘Ž2=0.2, ๐‘0=1.5, ๐‘1=1.2, ๐‘2=0.32,๐‘‘0=2, and ๐‘‘1=โˆ’0.1. In the simulation, both ๐‘0 and ๐‘‘0 are assumed to be equal to unity for model uniqueness, which implies that the corresponding equivalent true values of the coefficients are ฬƒ๐‘Ž1=0.1, ฬƒ๐‘Ž2=0.2, ฬƒ๐‘1=1.2, ฬƒ๐‘2=0.32, ๎‚๐‘‘1=โˆ’0.05, ๎‚๐‘š1=3, ๎‚๐‘š2=3.6, ๎‚๐ท1=0.33, and ๎‚๐ท2=0.4, respectively, but this does not affect the properties of the whole system.

In the simulation, the signal to excite the system is a random sequence with variance ๐œŽ2=0.49. Choose ๐›ฟ=1.0ร—10โˆ’4. In the proposed algorithm, based on Remark 3.4, select ๐œ‚(0)=0.015,๐‘˜1=6, ๐œฝ0=[0,0,0,0,0.1,0.1,0.1,0.1,0]๐‘‡, and ๐›ฝ1(๐‘˜0)=0, respectively. For comparison, the traditional gradient method is also used to estimate the parameters of the system. In this method, the nonsmooth points of the system are omitted for the gradients of the system do not exist at nonsmooth points. The initialized values of the parameters are the same as those used in the proposed method. The optimizing step is chosen as 0.009.

Figure 2 illustrates the comparison of the estimated parameter convergence procedures between the proposed method and the traditional gradient method. In Figure 2, blue and solid lines denote the convergence procedures of the parameters estimated by the proposed method, while red and dotted lines show the convergence procedures of the parameters determined by the traditional gradient method. From Figure 2, we note that the parameters of the backlash-like hysteresis converge slower than those of the linear submodels especially the input linear submodel. Moreover, the proposed method has achieved faster convergence than that of the traditional gradient method. It is noticed that the oscillation and sharp jumps happened in the estimation procedure of the traditional gradient approach.

In the case that the system is affected by random noise, the proposed strategy can still obtain better convergence of parameter estimation. In the simulation with noise, the signal to noise rate (SNR) is equal to 46.5. All the initial values of the parameters are the same as those in the noise-free case.

Figure 3 shows the comparison of the convergence procedures of the estimated parameters in the case with noise between the proposed method and the traditional gradient approach. Similar to the noise-free case, the blue and solid lines denote the convergence procedures of the parameters estimated by the proposed method, while the red and dotted lines show the convergence procedures of the parameters estimated by the traditional gradient method. Obviously, the proposed method has obtained faster convergent results than the traditional gradient method.

6. Application to an ๐‘‹-๐‘Œ Moving Positioning Stage

The proposed identification approach is also applied to the modeling of an ๐‘‹-๐‘Œ moving positioning stage with the architecture shown in Figure 4. In this equipment, the movement of the work platform of each axis is driven by a DC servomotor through a ball-screw-nut mechanism which transforms the rotational shaft movement into linear displacement. The servomotor is controlled by a digital signal processor (TMS320LF-2407A). The displacement of each axis is measured by a linear encoder (RGF2000H125B). The signals of both phase A and phase B encoders are decoded by a quadrature decoding circuit which is based on the decoding chip (Agilent HCTL-2020).

In this system, the servomotor can be considered as a second-order linear dynamic subsystem. The movement of the work platform is also described by a linear second-order dynamic model. Due to the inherent characteristic, both dead zone and backlash-like hysteresis exist in this system. In order to simplify the identification procedure, the dead zone is compensated by a dead zone inverse model-based compensator. Thus, in the identification, only the effect of backlash-like hysteresis existing in the ball-screw-nut mechanism will be considered. Therefore, the identified system is actually a typical sandwich system with backlash-like hysteresis. In this section, only the identification procedure of axis A will be presented due to the limited space. The corresponding models used to describe the behavior of axis A are shown as follows:(1)the input linear model (๐ฟ1): ๐‘ฅ(๐‘˜)=โˆ’๐‘Ž1๐‘ฅ(๐‘˜โˆ’1)โˆ’๐‘Ž2๐‘ฅ(๐‘˜โˆ’2)+๐‘0๐‘ข(๐‘˜โˆ’1),(6.1)(2)the model of the backlash-like hysteresis: โŽงโŽชโŽชโŽจโŽชโŽชโŽฉ๐‘š๐‘ฃ(๐‘˜)=1๎€ท๐‘ฅ(๐‘˜)โˆ’๐ท1๎€ธ,๐‘ฅ(๐‘˜)>๐‘ฃ(๐‘˜โˆ’1)๐‘š1+๐ท1๐‘ฃ๐‘ฃ,๐‘ฅ(๐‘˜)>๐‘ฅ(๐‘˜โˆ’1),(๐‘˜โˆ’1),(๐‘˜โˆ’1)๐‘š2โˆ’๐ท2๐‘ฃโ‰ค๐‘ฅ(๐‘˜)โ‰ค(๐‘˜โˆ’1)๐‘š1+๐ท1,๐‘š2๎€ท๐‘ฅ(๐‘˜)+๐ท2๎€ธ,๐‘ฅ(๐‘˜)<๐‘ฃ(๐‘˜โˆ’1)๐‘š2โˆ’๐ท2,๐‘ฅ(๐‘˜)<๐‘ฅ(๐‘˜โˆ’1),(6.2)(3)the output linear model (๐ฟ2): ๐‘ฆ(๐‘˜)=โˆ’๐‘1๐‘ฆ(๐‘˜โˆ’1)โˆ’๐‘2๐‘ฆ(๐‘˜โˆ’2)+๐‘‘0๐‘ฃ(๐‘˜)+๐‘‘1๐‘ฃ(๐‘˜โˆ’1),(6.3)where ๐‘ฆ(๐‘˜) is the moving speed of the work platform.

Based on the operating requirement, a sequence of square wave plus sinusoidal wave is used to excite the system within the operating range. The corresponding amplitude of the input varies in the range between โˆ’1.09โ€‰V and 1.05โ€‰V, and the sample period is 0.5โ€‰ms.

In this model, both ๐‘0 and ๐‘‘0 are set to one. The initial values of the other parameters are chosen as ๐œ‚(0)=0.00116, ๐œ‡=1, ๐œฝ0=[0,0,0,0,1,1,0.001,0.001,0]๐‘‡, and ๐›ฝ(๐‘˜0)=0. After 6700 steps, the convergence of the estimation is achieved. Figure 5 illustrates the corresponding procedure of the parameter estimation. It shows that the estimate procedure converges quickly. Figure 6 shows the corresponding mean square error (MSE) of the parameter estimation. We can see that the MSE is decreased sharply in the beginning, at the 180th step, and a local minimum can be found. After that, the algorithm jumps out of the local minimum, and the corresponding MSE gradually converges to a constant of about 0.4.

Then, the corresponding model validation result is shown in Figure 7(a), while Figure 7(b) shows the comparison of the input-output plots between the proposed model and the real data. The maximum relative modeling error is less than 11%. Moreover, it is obvious that the obtained model can accurately approximate the behavior of the ๐‘‹-๐‘Œ moving positioning stage. Hence, it can be concluded that the proposed identification method is rather promising in engineering application.

7. Conclusion

In this paper, a recursive gradient-based identification algorithm for the sandwich system with backlash-like hysteresis is proposed. The subgradient is applied to the search of gradient direction at the nonsmooth points of the system. In order to find the proper search direction at the nonsmooth points, the technique of so-called bundle method is utilized. Simulation results have shown that the proposed algorithm has provided us with an option for identification of nonsmooth dynamic systems, and it provides a novel method to identify the more complicated nonsmooth systems. The experimental results of ๐‘‹-๐‘Œ stage also show that the proposed method has potential in engineering applications.

Appendix

Based on (3.5) and Step 3 of the algorithm, it is obtained ๎๎๎‚€๎๎‚๐œฝ(๐‘˜+1)=๐œฝ(๐‘˜)+๐œ‚(๐‘˜)๐‘’๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก(๐‘˜).(A.1)

Subtracting the local optimal value ๐œฝ1 from both sides of (A.1), it leads to ๎๐œฝ(๐‘˜+1)โˆ’๐œฝ1=๎๐œฝ(๐‘˜)โˆ’๐œฝ1๎‚€๎๎‚+๐œ‚(๐‘˜)๐‘’๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก(๐‘˜).(A.2)

Rewrite (A.2) as ๎‚๎‚๎‚€๎๎‚๐œฝ(๐‘˜+1)=๐œฝ(๐‘˜)+๐œ‚(๐‘˜)๐‘’๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก(๐‘˜),(A.3) where ๎‚๎๐œฝ(๐‘˜+1)=๐œฝ(๐‘˜+1)โˆ’๐œฝ1.

Choose the quadratic function as ๎‚๐œฝ๐ฟ(๐‘˜+1)=๐‘‡๎‚(๐‘˜+1)๐œฝ(๐‘˜+1)+๐‘’2๎‚€๎๎‚.๐‘˜,๐œฝ(๐‘˜+1)(A.4)

According to (A.3), it leads to ๎‚๐œฝ๐‘‡(๎‚๎‚๐œฝ๐‘˜+1)๐œฝ(๐‘˜+1)โˆ’๐‘‡(๎‚๎‚€๎๎‚๎‚๐œฝ๐‘˜)๐œฝ(๐‘˜)=2๐œ‚(๐‘˜)๐‘’๐‘˜,๐œฝ(๐‘˜)๐‘‡(๐‘˜)โˆ’๐ก(๐‘˜)+๐œ‚2(๐‘˜)๐‘’2๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜)โˆ’๐ก(๐‘˜).(A.5)

As ๐‘“(๐‘˜,๐œฝ)=[๐‘ฆ(๐‘˜)โˆ’ฬ‚๐‘ฆ(๐‘˜,๐œฝ)]2, if (3.5) holds, then the cutting-plane model is ฬ‚๐‘’1๎‚ป๎‚ƒ๎‚€๎(๐‘˜,๐œฝ)=max๐‘ฆ(๐‘˜)โˆ’ฬ‚๐‘ฆ๐‘˜,๐œฝ(๐‘˜)๎‚๎‚„2๎‚€๎๎๐œฝ+2๐‘’๐‘˜,๐œฝ(๐‘˜)๎‚๎‚€๐‘‡(๐‘˜)โˆ’๐œฝ๐‘‡๎‚โˆ’๐ก(๐‘˜)โˆ’๐›ฝ๐‘—๎‚ผ(๐‘˜).(A.6)

Based on the definitions of ๐›ฝ๐‘—(๐‘˜) and ๐›ผ๐‘—(๐‘˜), as well as the idea of bundle method, we know that ฬ‚๐‘’1๎(๐‘˜,๐œฝ)โ‰ค๐‘“(๐‘˜,๐œฝ(๐‘˜)). Thus, considering (A.6) yields ๎‚€๎๎๐œฝ2๐‘’๐‘˜,๐œฝ(๐‘˜)๎‚๎‚€๐‘‡(๐‘˜)โˆ’๐œฝ๐‘‡๎‚โˆ’๐ก(๐‘˜)โ‰ค๐›ฝ๐‘—(๐‘˜).(A.7)

Choosing ๐œฝ=๐œฝ1 and holding (A.7) yield ๎‚€๎๎๐œฝ2๐‘’๐‘˜,๐œฝ(๐‘˜)๎‚๎‚€๐‘‡(๐‘˜)โˆ’๐œฝ๐‘‡1๎‚โˆ’๐ก(๐‘˜)โ‰ค๐›ฝ๐‘—(๐‘˜).(A.8)

Based on (A.5) and (A.8), we obtain ๎‚๎‚๐œฝ๐œฝ(๐‘˜+1)๐‘‡๎‚๎‚๐œฝ(๐‘˜+1)โˆ’๐œฝ(๐‘˜)๐‘‡(๐‘˜)โ‰ค๐œ‚(๐‘˜)๐›ฝ๐‘—(๐‘˜)+๐œ‚2(๐‘˜)๐‘’2๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜)โˆ’๐ก(๐‘˜).(A.9)

According to the idea of the gradient algorithm and [16], we know that ๐‘’๎‚€๎๎‚๎‚€๎๎‚๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜+1)=๐‘’๐‘˜,๐œฝ(๐‘˜)+ฮ”๐‘’๐‘˜,๐œฝ(๐‘˜).(A.10)

Hence, the change of ๐‘’(๐‘˜) is written as ๎‚€๎๐œฝ๎‚=โŽกโŽขโŽขโŽฃ๎‚€๎๎‚ฮ”๐‘’๐‘˜,(๐‘˜)๐œ•๐‘’๐‘˜,๐œฝ(๐‘˜)โŽคโŽฅโŽฅโŽฆ๐œ•๐œฝ(๐‘˜)๐‘‡ฮ”๐œฝ(๐‘˜)=โˆ’โˆ’๐ก(๐‘˜)ฮ”๐œฝ(๐‘˜).(A.11)

According to (3.5) and Step 3 of the algorithm, it yields: ๎‚€๎๎‚ฮ”๐œฝ(๐‘˜)=๐œ‚(๐‘˜)๐ญ(๐‘˜)=๐œ‚(๐‘˜)๐‘’๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜).(A.12)

From (A.11) and (A.12), we get ๎‚€๎๎‚๎‚€๎๎‚ฮ”๐‘’๐‘˜,๐œฝ(๐‘˜)=โˆ’๐œ‚(๐‘˜)๐‘’๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜).(A.13)

According to (A.10) and (A.13), we obtain ๐‘’2๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜+1)โˆ’๐‘’2๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜)=โˆ’2๐œ‚(๐‘˜)๐‘’2๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜)+๐œ‚2(๐‘˜)๐‘’2๎‚€๎๎‚๎‚ธ๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก(๐‘˜)โˆ’๐ก๐‘‡๎‚น(๐‘˜)2.(A.14) Based on (A.4), (A.9), and (A.14), it leads to the following: ๐ฟ(๐‘˜1)โˆ’๐ฟ(๐‘˜)โ‰ค๐œ‚(๐‘˜)๐›ฝ๐‘—(๐‘˜)โˆฃ๐œ‚2(๐‘˜)๐‘’2๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜)โˆ’๐ก๎‚ต๎‚ธ(๐‘˜)1+โˆ’๐ก(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜)๎‚น๎‚ถโˆ’2๐œ‚(๐‘˜)๐‘’2๎‚€๎๎‚๐‘˜,๐œฝ(๐‘˜)โˆ’๐ก(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜)(A.15) if 0<๐œ‚(๐‘˜)โ‰ค(2๐‘’2๎(๐‘˜,๐œฝ(๐‘˜))โˆ’๐ก(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜)โˆ’๐›ฝ๐‘—(๐‘˜))/๐‘’2๎(๐‘˜,๐œฝ(๐‘˜))โˆ’๐ก๐‘‡(๐‘˜)โˆ’๐ก(๐‘˜)(1+[โˆ’๐ก(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜)]) and ๐›ฝ๐‘—(๐‘˜)<2๐‘’2๎(๐‘˜,๐œฝ(๐‘˜))[โˆ’๐ก(๐‘˜)โˆ’๐ก๐‘‡(๐‘˜)], we have ๐ฟ(๐‘˜+1)โˆ’๐ฟ(๐‘˜)โ‰ค0.(A.16) Hence, the parameters ๐œฝ can be convergent to a local optimal value.

Acknowledgments

This work was supported by the projects of Shanghai Normal University (DZL811, DRL904, and DYL201005); the projects of Shanghai Education Commission (11YZ92); the project of NSFC (Grant nos. 60971004 and 61171088); the projects of Science and Technology. Commission of Shanghai (09220503000, 10JC1412200, and 09ZR1423400).