#### Abstract

An important objective of health monitoring systems for tall buildings is to diagnose the state of the building and to evaluate its possible damage. In this paper, we use our prototype to evaluate our data-mining approach for the fault monitoring. The offset cancellation and high-pass filtering techniques are combined effectively to solve common problems in numerical integration of acceleration signals in real-time applications. The integration accuracy is improved compared with other numerical integrators. Then we introduce a novel method for support vector machine (SVM) classification, called convex-concave hull. We use the Jarvis march method to decide the concave (nonconvex) hull for the inseparable points. Finally the vertices of the convex-concave hull are applied for SVM training.

#### 1. Introduction

Structural vibration control and structural health monitoring (SHM) technologies are concerned with the safety of building structures. The original problem in SHM is to find the structural damage and its location by performing some statistical pattern recognition on the measured data [1] termed as feature extraction. The damage caused by environmental loads should be repaired; otherwise it will grow with time and may lead to total system failure. Dynamic parameters like acceleration, velocity, and displacement play an important role in determining the structure dynamics [2]. Especially in the case of bridges, displacement is a vital information [3]. Traditional displacement sensors are difficult to install on bridges and cannot be useful especially during a seismic activity. Another popular sensing method is the global positioning system (GPS). But its usage is affected by bad weather, electromagnetic noise, satellite cycling, and cost. Laser Doppler vibrometer (LDV) is another option, which is limited due to its installation scheme and cost [4]. Because the structure of the accelerometer is very simple and it does not need any relative reference position, most of acceleration and tilt measurements use accelerometers [5]. Wireless networks (WNs) have been used to avoid the high cost of traditional generic wired systems [6].

Since the measured acceleration signal from an accelerometer contains offset and low-frequency noise, it is not convenient to integrate the acceleration signal directly. The offset and unknown initial conditions of an accelerometer cause drifts during integration. A drift-free numerical integrator was proposed in [7]. The drift elimination method in the frequency domain was designed in [8]. The main problem of these filter-like numerical integrators is that they have to use large time constants to avoid the drift. Baseline correction is an alternative method to avoid drift during integration [5]. The main problem of the baseline correction integrator is that the low-frequency noise was removed using a window filter designed for a particular input signal, so they cannot be applied for online estimation. Some SHM use observers to estimate velocity and position using the measured acceleration signal [9, 10]. However, the behavior of a filter with a large time constant is far from an ideal integrator.

The structural health monitoring of tall buildings usually uses vibration data. The damage reflects the changes of structural parameters, such as the stiffness and damping coefficients. Only few research used a data-mining technique on SHM. In [11] classification methods were used to determinate the modal parameters, such as the structure’s natural frequencies, the vibration intensity, and the damping coefficients.

Support vector machine (SVM) is a highly desirable classification method, because it offers a hyperplane that represents the largest separation (or margin) between the two classes [12]. However, it needs to solve the quadratic programming (QP) in order to find a separation hyperplane, which causes an intensive computational complexity. A method of reducing training data is to use the geometric properties of SVM [13]. Convex hull has been applied in training SVM [14]. In computational geometry, a number of algorithms are known for computing the convex hull for a finite set of points. The Graham scan [15] finds all vertices of the convex hull ordered along its boundary by computing the direction of the cross-product of the two vectors. The Jarvis march (gift wrapping) [16] identifies the convex hull by angle comparisons and wrapping a string around the point set. The Divide and Conquer method [17] is applicable to the three-dimensional case. The incremental convex hull [18] and quick hull [19] algorithms consist of eliminating some points so that the problems are easily solved. By using a nonconvex loss function, it forms a nonconvex SVM. But some good properties of SVM, for example, the maximum margin, cannot be guaranteed [20], because the intersection parts of data sets are not satisfied convex conditions.

In this paper, a baseline correction scheme is applied to the output of a filter-like numerical integrator. Instead of using a single baseline correction [5] may not guarantee a drift-free integration. By using multiple baseline correction, the time constant of the proposed low-pass filter can be reduced to a smaller value compared to the integrator in [7].

The data-mining technique in this paper does not identify structure parameters but compares the changes from the model output and the real output. Since the model of the building does not change, the vibrations compared with the model can be used to diagnose faults in the building.

We also propose a new algorithm to search the border points, called convex-concave hull. By using the Jarvis march method [16], we first find the convex hulls of the data set. Then concave hulls are formed using the vertices of the convex hull. In this way, the misleading points in the intersection become the vertices of the concave hulls. The classification accuracy is increased a lot compared with the above papers whereas training time is decreased considerably.

Finally, we design a shake bed and a five-level structure to test our methods. The real experiment results give validation of our methods.

#### 2. Numerical Integrator for Accelerometer Measurements

An accelerometer can be regarded as a single-degree-of-freedom (SDOF) mechanical system [21]. It can be modeled using a simple mass often called as the proof-mass, attached to a spring of stiffness and a dashpot with damping coefficient that is attached to a base Here is the acceleration acting on the accelerometer and is the relative acceleration of the proof-mass with respect to the base. Other than the acceleration, the accelerometer output signal contains offset and noise. An accelerometer has a bias termed as offset voltage or -offset measured under the absence of motion or gravity (), which is normally equal to the half of its power supply (). This offset may vary from one sensor to another. The main causes for the offset variation are the sensing material, temperature changes, supply voltage deviation, mechanical stress, and trim errors [22]. This change in the offset from its ideal value is termed as the offset error. The knowledge of this offset error will help to remove the bias from the acceleration signal effectively. The accelerometer output signal can be represented as [21, 23] where is the noise and disturbance effects on the measurement and denotes the -offset. Mathematically the velocity and position are calculated by integrating the acceleration : where and are the initial velocity and position, respectively.

Aliasing is unavoidable when digitizing the analog signals using ADC with a constant sampling frequency. It has been shown that the aliasing can cause low-frequency errors in the acceleration signal [24]. During the analog-to-digital conversion the frequency components above the Nyquist rate are folded back into the bandwidth of interest. Thus the accelerometer output signal in (2.2) can be modified as where is the aliasing content due to sampling. This low-frequency content will be amplified during the integration process. This error can be minimized by using an anti-aliasing filter between the accelerometer and the data acquisition card. The ADC sampling rate needs to be high enough compared to this filter cut-off frequency and the sampling should be done in uniform time intervals.

It is impossible to eliminate the noise and keep the acceleration signal feature at the same time. The aim is to minimize the estimation error caused by , , and . Most of the accelerometers come with antialiasing filters, which minimizes the aliasing component . The acceleration gain can be easily eliminated by performing calibration. In order to overcome the above problems, the proposed method combines the baseline correction and the filter-like integrator effectively. The scheme of the proposed numerical integrator for the accelerometer measurements is shown in Figure 1. The methods for minimizing the effects of and on estimation are discussed below.

Using multiple baseline corrections, a low-pass filter with a small time constant can be achieved to approximate the ideal integrator behavior. The transfer function of the filter-like integrator is where is the gain and is a time constant, which is close to zero. When is zero, the above system reduces to an ideal integrator.

The ideal integrator () amplifies the DC offset quickly. On the other hand, a large reduces the phase accuracy. Selecting an optimal is a trade-off problem between a minimal DC offset and a better phase accuracy. Usually we select . is between and .

As this offset changes with time, this circuit needs frequent calibration, which may be difficult in some cases. Another option is to use a high-pass filter in order to remove the low-frequency DC components. The major drawback of this approach is that the high-pass filter introduces a phase error in the cut-off frequency range. Here it is assumed that the accelerometer is at rest before the seismic activity. In that case, the only output is the -offset. Finally the offset free acceleration is It is clear that the above algorithm can remove the pure DC component completely. As discussed earlier, there exist other sources of offset other than -offset. In that case, the offset Calibration will not be able to eliminate completely the drift from velocity estimation. Now the drifted baseline needs to be calculated, which can be represented in a polynomial form as where are the coefficients to be determined. There are many ways to obtain the coefficients. In this paper, the curve fitting technique using the least-square method (LMS) is used to determine the polynomial coefficients. Once the baseline for the velocity is obtained, it is subtracted from the integrated acceleration signal, to get a drift-free velocity signal. It is found that a second-order polynomial is enough to get a good solution. Increasing the order of the polynomial causes numerical oscillation.

Finally the same methodology is applied to perform the baseline correction for the position estimation. It is found that a third-order polynomial gives a good solution. The position baseline is designed as

In order to remove the low-frequency components in (2.4), a high-pass filter is added after the baseline correction as shown in Figure 1. Here a two-pole high-pass filter is used as in [25]. The transfer function of the single-gain Sallen-Key high-pass filter is where is the filter time constant. The cut-off frequency of the filter is .

#### 3. Convex-Concave Hull Classification

##### 3.1. Building Model

Since we need to compare the real vibration signals and the output of the building model, we first need to model the tall buildings. In the case of high-rise flexible buildings, strong winds cause sickness or psychological responses like anxiety to the occupants and also may damage the fragile items. When the vibrations of taller buildings due to the high wind exceed a limit of 0.15 m/sec^{2}, the human may feel uncomfortable [26]. As a result, the main objective of structural control is to reduce the acceleration response of buildings to a comfortable level. A single-degree-of-freedom structure can be modeled using three components: the mass component , the damping component , and the stiffness component . Among these three components, the stiffness component can be modeled as either a linear or a nonlinear, in other words elastic or inelastic, respectively [27]. Usually the mass is considered as a constant. When an external force is applied to a structure, it produces changes in its displacement , velocity , and acceleration . Using Newton’s law, the equation of motion of a linear elastic system subjected to an external force can be written as
Then the equation of motion of a linear structure with *n-*degree-of-freedom (*n-*DOF) can be expressed as
where and , are the mass, damping and stiffness matrices respectively, , and are the relative acceleration, velocity and displacement vectors respectively and is the external force vector. The equation of the motion of a nonlinear structure subjected to ground acceleration is
The nonlinear force in (3.3) can be modeled using the Bouc-Wen model
In the above expression, introduces the nonlinearity, which satisfies the following condition:
where is the nonlinear time-dependent restoring force and and are the parameters, which controls the shape of the hysteresis loops and system degradation. The variables and control the initial tangent stiffness [28]. In the case of *n-*DOF structures, the nonlinear model can be modified as
where denotes the influence of the excitation force.

##### 3.2. Faults Diagnosis

We assume that accelerometers are installed in the building. The numerical integration outputs are velocity and displacement . The fault diagnosis signals are We define that a damage does not occur when where and , and are previous defined constants. When no damage occurs, we define that the property of is . Otherwise the property of is . In Figure 4, they are marked “o” and “+”.

Because the accelerometers work on-line, the sampled data are huge. Normal classification techniques do not work. We use the convex-concave hull method.

##### 3.3. Convex-Concave Hull

In many real applications of SVM, the data are not perfectly separated, and the kernel methods are not so powerful for nonlinear separation. The closest points in the convex hulls are no longer support vectors. In this case, the soft margin optimization of SVM can be applied directly to the inseparable sets of the convex hulls. The penal parameter affects the optimal performance of SVM. The optimization becomes a tradeoff between a large margin and a small error penalty.

For the intersection parts, the convex hulls have to be reduced such that the inseparable case becomes separable case [29]. The key disadvantage of this reduced convex hull is that the convex hull has to be calculated in each reducing step [14].

In this paper, we first find the convex hulls of the data set. Then we propose a new algorithm to search the border points; the nonconvex hulls are formed by the vertices of the convex hull. In this way, the misleading points in the intersection become the vertices of the concave hulls. The classification accuracy increases a lot compared with the other convex hull methods whereas training time is decreased considerably.

In order to define the vertices of a convex-concave hull, we used the border points definition , which are the outer border points located on the boundaries of the set . The vertices of a convex-concave hull are the border points’ : where is the set of vertices of a convex hull and is the set of the vertices of a nonconvex hull (concave hull); see Figure 2. A set is said to be convex if The border points have the following characteristics. For any two extreme points in , all other points in are located on one side of the line connecting these two points. We use two adjacent extreme points as reference points to detect the concave hull between them. The convex-concave hull searching scheme behaves good if the data set has a uniform distribution of elements in a region. However, the distribution of a data set is unknown in advance. Computing the convex-concave hull directly from is not appropriate for training the SVM classifier. In order to avoid the density problem, we separate into and and then we create partitions ( and ) and ( and ), they will be abbreviated as .

The basic idea here is to obtain regions where the distribution is more uniform than in the original ones. Convex-concave hull searching is then applied on each to reduce the size of whereas preserving most support vectors. The and are always included in regardless whether is linearly separable or inseparable. In addition, points on the intersection of convex hulls are also included in .

The subsets can be quickly created by introducing all points in a binary tree of height and then using the leaves as a version of the original points. Once all points have been introduced in the binary tree, it is possible to look down from height of the tree and take all leaves as a subset .

We propose a grid method to preprocess the data set. Figure 3 shows the data set before process.

There are basically two cases for the two-class classification using SVM: the linearly separable and the linearly inseparable. In the nonlinear case, suitable kernel can be applied to map the nonlinear classification into the linear classification.

In the separable case, , where represents the elements in with label or , respectively. It has been demonstrated that if the data set is a linearly separable one, then the SV corresponds to the closest vertices of and .

In the linear inseparable case, the convex-hulls-intersect. The convex-hull-based methods do not work well for SVM, because SVs are generally located on the exterior boundaries of data distribution; see Figure 5. Further, the reduced convex hull distorts the original data distribution and produces poor classification accuracy.

After we obtain the vertices of the convex-concave hull, , these points are sent to train the SVM classifier, that is, to find an optimal hyperplane or to solve the following quadratic programming problem (primal problem): where is slack variables to tolerate misclassifications , , , is the distance from to the hyperplane , and is a nonlinear function.

In our convex-concave hull SVM classification, the penal factor can be selected very small, because all the misleading points almost disappear by the concave algorithm. The classification accuracy is improved.

The kernel which satisfies the Mercer condition [30] is . Equation (3.11) is equivalent to the following quadratic programming problem which is a dual problem with the Lagrangian multipliers : Many solutions of (3.12) are zero; that is, , so the solution vector is sparse; the sum is taken only over the nonzero . The which corresponds to nonzero is called a support vector. Let be the index set of support vectors, then the optimal hyperplane is The resulting classifier is where is determined by the Kuhn-Tucker conditions. The decision hyperplane

#### 4. Experiments

In this paper, the velocity and position estimations are evaluated in a shaking table; see Figure 6. The accelerometer we used is Summit Instruments 13203B, which is mounted on the SDOF mechanical structure. The sensitive axis of the accelerometer is mounted parallel to the ground to measure the structure acceleration. A linear magnetic encoder (LM15) position sensor with a resolution of 50 m is used for verifying the estimated position data. The building structure base is mounted on the shaking table; see Figure 7. We first evaluate the numerical integrator proposed in this paper. We use the earthquake signal of Loma Prieta East-West on October 17, 1989. The position estimation is shown in Figure 8.

It has been seen that the proposed filter-like integrator is able to estimate the velocity and position in a reasonable manner. Still there is some error found between the estimated and the measured position. This error is caused due to the phase error introduced by the high-pass filter, which resulted in a small phase delay. But it is found that the estimation will be reasonable for the structural control and health monitoring applications.

Then we use our convex-concave hull classification (CCHSVM) to diagnose the fault. We compare our algorithms with the other four SVM classification methods: SMO [31], LIBSVM [32], clustering-based SVM (CSVM) [33], and the reduced convex hull SVM (RCHSVM) [29]. One part of data is shown in Figure 9. The circles represent the normal case. The squares correspond to a stiffness change in the second floor.

We first examine how does the training data size affect the training time and classification accuracy of our convex-concave hull SVM (CCHSVM). We use , , , , and data set samples to train CCHSVM and SMO. For all data sets, the training data are chosen randomly from of the data set; the remainig data are used for testing data set. The comparison results are shown in Table 1. Here means to use data to train SMO, means to use data to train CCHSVM. and are CCHSVM algorithm parameters, is the time to compute border points, is the training time, is the number of support vectors, is the number of vertices of the convex-concave hull, and is the classification accuracy.

We can see that our CCHSVM has less training time than SMO and almost the same classification accuracy with SMO. When the data size is increased, the training time is dramatically increased with SMO, while ours only increases a little. Although the classification accuracy cannot be improved significantly when data size is very large, it does not get worse, and the testing accuracy is still acceptable.

Now we compare our CCHSVM with SMO [31], LIBSVM [32], CSVM [33], and RCHSVM [29] with data set. The comparison results are shown in Table 2.

We can see that compared with the other SVM classifiers, our approach has good classification accuracy while the training is significantly faster than other SVM classifiers. The classification accuracy is higher than the other convex hull SVMs.

#### 5. Conclusions

In this paper, we use two techniques for structural health monitoring of tall buildings: numerical integrator and convex-concave hull classification (CCHSVM). A numerical integrator is proposed for estimating velocity and position from an acceleration signal. This method combines the multiple baseline corrections and the filtering techniques. The proposed integrator is able to produce stable and accurate estimation.

The CCHSVM uses convex-concave hull and grid before process; the CCHSVM overcomes the problems of slow training of SVM and low accuracy of many geometric properties based SVMs. The key point of our method is that we use the Jarvis march method to decide the concave (nonconvex) hull for the inseparable points.

The experimental results demonstrate that the accuracy of the drift-free integrator is increased by adding the offset cancellation filter, and our approach has good classification accuracy while the training is significantly faster than other SVM classifiers. The classification accuracy is higher than the other convex hull SVMs.