Abstract

Financial distress prediction plays an important role in the survival of companies. In this paper, a novel biorthogonal wavelet hybrid kernel function is constructed by combining linear kernel function with biorthogonal wavelet kernel function. Besides, a new feature weighted approach is presented based on economic value added (EVA) and grey relational analysis (GRA). Considering the imbalance between financially distressed companies and normal ones, the feature weighted one-class support vector machine based on biorthogonal wavelet hybrid kernel (BWH-FWOCSVM) is further put forward for financial distress prediction. The empirical study with real data from the listed companies on Growth Enterprise Market (GEM) in China shows that the proposed approach has good performance.

1. Introduction

Financial distress is a term in corporate finance used to indicate a condition when a company has serious losses and becomes insolvent with liabilities. Sometimes financial distress can lead to bankruptcy [1]. Financial distress prediction has been a good practical tool for distinguishing companies in distress from those healthy, and it also guarantees the survival of companies [2, 3].

Since the 1960s, enormous efforts have been made to construct efficient financial distress prediction models, but improving the models’ accuracy is still a challenging task. Many statistical methods such as discriminant analysis (DA) [4], logistic regression [5], and profit regression [6] have been applied to financial distress prediction in early studies [7]. After that, artificial intelligence methods such as genetic algorithm [8], rough sets [9], case-based reasoning [10], and artificial neural networks (ANNs) [11] have been proposed. In recent years, a number of studies about financial distress prediction have been announced concerning support vector machine (SVM), since the SVM has better performance in nonlinear approximation and local optimal solutions.

SVM is a class of typical machine learning algorithms and its main idea is to minimize the upper bound of the generalization error. Usually it maps the input vectors into a high-dimensional feature space through some nonlinear mapping. The SVM is a powerful method for classification and has shown promising performance in financial distress prediction. Huang et al. evaluate the performance of the SVM in credit rating prediction and prove that the model based on the SVM has a better explanatory power than the model based on back propagation neural networks (BPNN) [12]. Shin et al. investigate the efficacy of applying the SVM to bankruptcy prediction and the results demonstrate that the SVM approach also outperforms the BPNN in prediction [13]. Härdle et al. propose an application of visualization techniques at different stages of corporate default analysis based on the SVM and also conclude that the SVM model is always dominating the DA and logistic regression with regard to accuracy ratio [14].

The SVM is a very specific type of learning algorithms characterized by the use of kernel functions, which realizes the nonlinear mapping from the original space to a high-dimensional feature space. Kernel function has received tremendous attention as it has a significant impact on the performance and prediction accuracy of the SVM. Kernels commonly used in the SVM include Gaussian kernel, polynomial kernel, and spline kernel. Wavelet has been widely used in constructing kernel functions recently because of its excellent performance in nonlinear data processing. Zhang et al. propose a SVM based on a Morlet wavelet kernel and the experiments show its feasibility and validity in approximation and classification [15]. Wei presents the SVM based on Morlet wavelet kernel, which yields more accurate prediction than the SVM based on Gaussian kernel [16]. Yang and Wang apply the SVM based on the Morlet and Mexico hat wavelet kernels to improve the detection accuracy and deduce false positive rate of distributed denial of service attack detection [17]. Among the wavelets, biorthogonal wavelet excels in nonlinear signal approximation with high smoothness, compact support, and high vanishing moment. Biorthogonal wavelet is derived from a multiresolution analysis generated by a pair of dual scaling functions and has been found very useful in applications such as signal processing and image compression [18]. To the best of our knowledge, few researches work on constructing biorthogonal wavelet kernel function.

In order to take full advantage of different kernel functions’ mapping abilities, hybrid kernel function combined with multiple kernel functions is emphasized. Linear and nonlinear combinations are the main methods to construct hybrid kernel function. Linear combinations of polynomials and Gaussian kernel [19] and wavelet and RBF kernel are employed [20] while the results demonstrate that SVMs with hybrid kernels beat all single-kernel models in learning performance and prediction accuracy. Some researchers concentrate on the nonlinear combination of multiple kernels for SVMs so that useful information generated from the nonlinear interaction of different kernels will not be neglected. Li and Sun propose SVMs based on the nonlinear combination of multiple kernels and its effectiveness is testified by empirical studies [21]. Cortes et al. observe a considerable performance improvement in nonlinear combinations of kernels [22]. However, hybrid kernel SVM is rarely applied to financial distress prediction.

Feature weighting, which assigns different coefficients to different features with each coefficient indicating the relative importance of the corresponding feature to the given learning task (e.g., classifications), has been studied by machine learning researchers and some kind of performance improvements may be obtained [23]. Grey relational analysis (GRA) proposed by Deng aims for dealing with poor, incomplete, and uncertain systems. GRA model is used to analyze various factors’ dynamic relations and the changes with time in order to find the key factors in prediction and decision-making. According to Song et al. [24], the change trends of the two factors are basically the same, indicating that the two factors are closely related and the grey relational degree (GRD) between them is higher. Apparently, the GRDs between the financial situation and financial indicators of companies can reflect the correlations between them. The higher the GRD is, the greater the relevance is and the more important the financial indicator should be while evaluating the financial situation.

The GRA models provide appropriate tools for examining a rank of order of multiple objects with semblance from an objective. Owing to the usefulness and advantages of the GRA, the GRA was applied to financial distress prediction recently. Kung and Wen use six financial indicators to classify twenty items of financial ratios as research variables and apply the GRA to find the significant financial ratio variables affecting the financial situation of venture capital enterprises in Taiwan [25]. Lin and Wu propose a new approach of the GRA to construct a financial crisis warning system for banking industry and the empirical results illustrate that in the prediction of financial crisis as well as financially sound banks, the proposed GRA model demonstrates better prediction accuracy [26]. Lin et al. investigate the efficacy of applying the hybrid distress prediction model based on the GRA and rough set to financial distress prediction and the results demonstrate that the hybrid approach can significantly improve the prediction accuracy [27]. Chuang uses the GRA to evaluate the importance of the features to provide a ranking of the features, and it makes a better predictive performance to enhance the accuracy of distress prediction [28].

Economic value added (EVA) is a financial performance measure capable of presenting the financial situation of a company. Unlike credit rating used in many countries [29, 30] or special treatment (ST) used in the securities market of China [31], The EVA is a quantitative description of the financial situation. The larger the value of the EVA is, the better the financial situation. However, few researches combine the GRA and the EVA for weighting financial indicators and predicting financial distress.

The purpose of this paper is to propose a new approach combining biorthogonal wavelet hybrid kernel one-class SVM with feature weighted approach based on the EVA and the GRA for financial distress prediction. This paper is organized as follows. The next section focuses on the construction of hybrid kernel function based on biorthogonal wavelet kernel and linear kernel. Section 3 discusses the feature weighted approach based on the GRA and the EVA. In Section 4, we propose the feature weighted one-class SVM based on biorthogonal wavelet hybrid kernel (BWH-FWOCSVM) for financial distress prediction. Section 5 presents the experimental results of the proposed method. The concluding remarks are given in Section 6.

2. Construction of Biorthogonal Wavelet Hybrid Kernels Function

Assuming is a subset of n-dimensional real space , there is a mapping , , satisfying , where is the Hilbert space and denotes the inner product, and then the function defined on is called the kernel function. Kernel function transforms nonlinear problems in the input space into linear problems in the feature space. Consequently, sample point in the input space can be mapped to a high dimensional feature space by an appropriate mapping .

It is well known that high smoothness, compact support, and high vanishing moment are the three most important properties of a biorthogonal wavelet, which has been found very useful in various applications such as signal processing and image compression.

Supposing is a biorthogonal wavelet function and is the dual biorthogonal wavelet function with scaling functions and , respectively, the corresponding scale coefficients of and are and satisfy

Assuming and are the Fourier transformations of and , respectively, then and . Biorthogonal Riesz basis in is obtained by translating and scaling and if for any , there exists satisfying and , and and have sufficient attenuation in the frequency domain while they are also meeting .

Therefore, any functions in can be expanded by biorthogonal wavelet function, meaning for every , , where and .

Meanwhile, a multiresolution analysis in can be obtained, where space is generated from and space is generated from . Accordingly, for any , its expansion is , where and .

Based on the analysis above, we can construct a biorthogonal wavelet kernel function as shown in (2), where :

Limited to the admissibility of positive definite kernel, an effective kernel function should be symmetric and positive definite. We will prove that the biorthogonal wavelet kernel function constructed by (2) satisfies the conditions.

As mentioned above, and are a pair of biorthogonal dual scaling functions so that for any , the following equations can be obtained:

From (3), .

And then we will have

Obviously, the kernel function in (2) is symmetric.

Proving the positive definiteness of equals to prove that for any , the Gram matrix of is positive semidefinite. For any , we can obtain

Apparently is also a positive definite function. As a consequence, the biorthogonal wavelet kernel function in (2) is effective.

Local kernel and global kernel are two basic types of kernels. A kernel possessing translation invariance is a local kernel, which satisfies . While a kernel having rotational invariance is a global kernel, which means for every , there must be , , , and .

The biorthogonal wavelet kernel function is definitely a local kernel because it is symmetric, satisfying . Obviously, the linear kernel is a typical global kernel. According to [32], hybrid kernel function has a superior performance compared with the single kernel function since the mixtures of kernels have advantages of generation ability of the global kernel and learning capacity of the local kernel. Thus, by combining linear kernel function and biorthogonal wavelet kernel function, we construct the biorthogonal wavelet hybrid kernel function : where is the biorthogonal wavelet kernel function, is the linear kernel function, represents the weighting coefficient and . In this paper, because of its good performance in signal processing, Cdf9/7 biorthogonal wavelet is selected to construct and the details of Cdf9/7 biorthogonal wavelet refers to [33]. Then, biorthogonal wavelet hybrid kernel function can be expanded to the form

3. Financial Indicator Weighting Based on EVA and GRA

The EVA is a quantitative description of the financial situation. The larger the value of EVA is, the better the financial situation. The EVA focuses on shareholder value and the role of capital cost including costs of both debt and equity. A positive EVA signifies that value has been created for shareholders. In other words, this enterprise has high profitability and good financial situation, while a negative EVA indicates value destruction with financial distress which needs to make an early warning [34]. The EVA can be defined as the firm operating profit after taxes subtracting the cost of capital, which can be calculated as follows: where NOPAT is the net operating profit after taxes, is the debt book value, is the equity book value, and WACC is the weighted average cost of capital. The information needed to compute the EVA mainly comes from accounting data, which needs to go through some adjustments in order to correct the distortions in accounting information [35]. The most important adjustments include the adjustments of NOPAT, , , and WACC [3638]. Consider where NP is the net profit, LIC is the loan impairment number changes of this year, OAIC is the other assets impairment number changes of this year, NNOI is the net nonoperating income, T is the tax rate, and DTI is the deferred tax increases.

Consider where is the short-term loan, is the long-term loan in one year, is the long-term loan, and is the bonds payable.

Consider where is the minority stockholder’s interest and is the stockholders’ equity.

Consider where is the cost of equity and is the cost of debt capital.

GRA is considered to be an analysis of the geometric proximity between different discrete sequences within a system. The proximity is described by the GRD, which is regarded as a measure of the similarities of discrete data that can be arranged in a sequential order [39]. In this paper, the GRA is adopted for the description of the correlation between financial situation (represented by the EVA) and financial indicators.

There are companies, each with financial indicators. represents the EVA of the th company and all the EVAs compose the sequence . represents the th financial indicator of the th company, and the sequence consists of . is defined as the reference sequence, while is the comparison sequences. To determine the GRD between the reference and comparison sequences, a discrete function of the grey relational coefficient is constructed as follows: where is the absolute value of the difference between the reference and the th comparison sequence. and are the minimal and maximal proximity, respectively. Meanwhile, is the coefficient to distinguish and is generally adopted in most studies. Hence, the GRD between the th financial indicator and the financial situation of the th company is where is the weight of the grey relational coefficient and are adopted in this study as often used in the literatures. represents the relevance of the th financial indicator and the financial situation and is regarded as the weighting of the th financial indicator.

4. EVA-FWOCSVM Based on Biorthogonal Wavelet Hybrid Kernel Function

The feature weighted SVM is a SVM constructed by the feature weighted kernel function : where is a kernel function in , and is the feature weighting matrix, is the GRD between the th financial indicator and financial situation and is calculated by (11). As the weight of the indicator, is capable of changing the geometry of the feature space by scaling the geometry of the input space. Through the scaling, a more suitable hyperplane is generated in order to improve the classification performance of the SVM.

The feature weighted biorthogonal wavelet hybrid kernel function based on the EVA is shown in (17) by substituting (7) into (16)

The SVM model is originally designed for binary classification, where it is typically assumed that the numbers of training data for both classes are balanced. However, in real world applications, there is often little or even no data at all available for one of the classes, in other words, one of the classes has a large sample number and the other has a small sample number. For example, in the financial distress prediction, there are only a few companies with financial problems. As a consequence, the accuracy loss of the SVM for binary classification is unavoidable. One-class SVM (OCSVM) model is proposed to improve the classification accuracy by making up for the imbalance of the numbers of both classes [40].

Suppose the training sample is , where is the input vector and is the class label. In this paper, the class label is decided by the value of the EVA; that is, a positive EVA means good financial situation of the company and the corresponding class label is , while a negative EVA shows the company is trapped in financial crisis and its class label is −1.

According to [40], the construction of the feature weighted one-class SVM (FWOCSVM) model equals to the solution of the quadratic programming problem as follows:

This constrained optimization problem is dealt with by introducing Lagrange multipliers and a Lagrangian:

After taking the partial derivatives with respect to and , we will have the following formulas:

By substituting (20) into (18), it arrives at the dual problem of (18):

We employ the kernel function representation of , which is . Then, the dual problem becomes

By solving problem (22), the final decision function is obtained, representing the optimal hyperplane used to classify the samples. Finally, the BWH-FWOCSVM is constructed by substituting (17) into (22). Thus, the corresponding decision function is

5. Empirical Experiment

5.1. The Dataset and Indicators

In order to verify the performance of the BWH-FWOCSVM in financial distress prediction, the listed companies on Growth Enterprise Market (GEM) in China are selected as samples. A company is regarded as distressed if its EVA is negative and normal if its EVA is positive. A total of 284 companies listed on the GEM in 2012 are selected randomly as initial date set.

According to (8)~(12), the EVAs of the selected 284 companies are calculated with the 3~5 years benchmark lending rate representing and other related data from the audited financial statements of those companies. Forty four companies with negative EVAs are regarded as distressed, while 240 with positive EVAs are regarded as normal. The ratio of distressed companies to normal ones is approximately 1 : 5 indicating obvious imbalance, which means one-class SVM is appropriate. The date set is split into the testing set and the training set, of which the former consists of 20 distressed companies and 100 normal companies (based on the ratio 1 : 5), and the latter includes the rest 164 companies. The EVAs of samples in every quarter of 2012 are calculated. Data in quarter T-2 is used to predict financial situations of companies in quarter T, which can avoid overestimating the prediction accuracy of the model.

Financial indicators have been consulted by researchers as a major basis for predicting financial distress and business crises [41]. Different financial indicators may cause different prediction results. Most of the financial indicators include inventory turnover, current asset ratio, earnings per share, net profit growth rate, and return on equity.[13, 42]. According to the particularity of China’s securities market and the accounting system, financial indicators can be divided into different categories, such as liquidity, debt paying ability, profitability, and market value, and each category contains multiple relevant indicators [43, 44]. In order to comprehensively summarize the financial situation of companies, a total of 28 financial ratios from four categories including liquidity, debt paying ability, profitability, and market value are selected as the initial indicators as listed in Table 1. The GRDs between financial indicators and EVA are calculated and the results are listed in Table 2.

As Table 2 shows, there are significant differences among the GRDs, in which 8 GRDs are higher than 0.9, and half of all the GRDs range between 0.7 and 0.8. The results indicate that the relations between different indicators and financial situation are quite different. In this study, the corresponding GRD is regarded as the weight of the financial indicator.

5.2. Experiment Results and Analysis

In the past research, a variety of measurements are used to assess the prediction performance of the SVM in financial distress prediction. This study considers the following measurements:(a): percentage of financial distressed companies predicted as distressed,(b): percentage of normal companies predicted as distressed,(c)Correct classification rate (CCR): percentage of companies predicted correctly in all companies.

Overall the CCR measures the prediction accuracy of the model, while and provide auxiliary discrimination. As we know, different types of misclassifications result in different penalty costs, for example, misclassifying a distressed company as a normal company may cause more damage than misclassifying a normal company as a distressed one. So the model with higher is preferred if different models have the same or similar CCRs.

Three financial indicator sets are constructed for the experiment, which consist of 28 indicators (), 13 indicators (), and 8 indicators () corresponding to , , and , respectively. Because the weight coefficient in hybrid kernel function have influence on the prediction accuracy, we use grid search method to analyze the influence of , when the search step length is 0.1. When , the kernel function is a linear kernel function; when , the kernel function represents the CDF9/7 biorthogonal wavelet kernel function. The corresponding results are displayed in Table 3 and Figures 1 and 2. The results of nonweighted OCSVM based on biorthogonal wavelet hybrid kernel function (BWH-OCSVM) are also given for comparison. Penalty parameter and insensitive parameter are two parameters which have great influence on performance of SVMs. Many optimization methods have been introduced, and particle swarm optimization (PSO) algorithm [29] is applied in this study to obtain the optimal and .

Figures 1 and 2 and Table 3 illustrate that the relation is nonlinear between the prediction accuracy of the model and the number of financial indicators. No matter in the BWH-FWOCSVM or in the BWH-OCSVM, almost always provides the lowest CCR while almost always has the highest one, and the CCRs of is in between roughly. The results of and are also similar. The results demonstrate that few financial indicators are selected if the threshold of the GRD is too high, which may lead to information loss and accuracy decrease. On the other hand, the low threshold of the GRD gives rise to too many financial indicators, which may also affect the prediction accuracy because of information redundancy. As the financial indicator set depends on the threshold of the GRD, how to determine the proper threshold value is important to the financial distress prediction. In fact, how to select an appropriate financial indicator set is still an open issue [31].

As shown in Figures 1 and 2, with the increase of , the CCRs of the BWH-FWOCSVM and the CCRs of the BWH-OCSVM rise at first but fall later, whether it is for , , or . The highest CCR can be reached when . The results in Figures 3 and 4 conduct the comparison of CCRs between hybrid kernel, linear kernel (LK), and CDF9/7 biorthogonal wavelet kernel (Cdf9/7 BWK). The H-BWHK refers to the highest CCR of the SVM based on biorthogonal wavelet hybrid kernel, while the A-BWHK refers to the average of nine CCRs when changes from 0.1 to 0.9. For both the BWH-FWOCSVM and the BWH-OCSVM with indicator sets , , and , the A-BWHK is higher than the CCR of LK and Cdf9/7 BWK, and the H-BWHK has more obvious advantage. Nevertheless, the selection of has a great impact on the prediction accuracy. An inappropriate may lead to the consequence that the prediction accuracy of a hybrid kernel function SVM model is lower than that of the single kernel SVM model. In fact, besides grid search method, optimal algorithm such as PSO and genetic algorithm can also be a substitution.

In Table 3, it is obvious that weighting the financial indicators based on the EVA and the GRA can improve the prediction accuracy effectively. Considering the average value on the eleven cases when ranges from 0 to 1, CCRs of the BWH-FWOCSVM are 1.8%, 2.5%, and 1.7% higher than those of the BWH-OCSVM, respectively, with , , and , while the s of the BWH-FWOCSVM are 2.8%, 2.5%, and 1.7% higher than those of the BWH-OCSVM, respectively. Considering the CCR as a selection criteria, and of the BWH-FWOCSVM and the BWH-OCSVM with the highest CCR are selected for comparison (i.e., both of the two models with highest CCR satisfy the following conditions: the indicator set is and ). We can find that the BWH-FWOCSVM has more advantage specifically, and are, respectively, 9.3% higher and 5.1% lower than those of the BWH-OCSVM. Given that indicates the model’s effectiveness in classifying the financial distressed companies correctly, and misclassifying a financial distressed company as a normal one may cause huge risk, the BWH-FWOCSVM is more effective than nonweighted BWH-OCSVM in financial distress prediction.

The statistical significance of the difference between the BWH-FWOCSVM and the BWH-OCSVM on the CCR is assessed by using the Wilcoxon signed ranks test. Wilcoxon test is a nonparametric test method and applied when overall distribution is unknown [45]. For the CCR in Table 3, the results of Wilcoxon test are shown in Table 4. The value of Z statistics of , , and are −2.943, −2.949, and −2.944, respectively. Concomitant probability α of three financial indicator sets are all 0.003, less than the significance level of 0.01, which indicates the CCRs of BWH-FWOCSVM are higher than those of the BWH-OCSVM in the significance level of 0.01.

Many researches demonstrate that the SVM is superior to the DA, logistic regression, and ANNs in financial distress prediction, so similar comparisons will not be conducted in this study. In order to verify the performance of the proposed CDF9/7 biorthogonal wavelet kernel and hybrid kernel function , we compare the two kinds of kernel functions with the other five single kernel functions, including Poly kernel, Sigmoid kernel, RBF kernel, Morlet kernel, and Coif3 kernel, as shown in Table 5.

From Table 5 we can conclude that biorthogonal wavelet hybrid kernel has the best performance in both the FWOCSVM and the OCSVM. No matter which indicator set is used, SVM models based on get higher CCRs, higher s and lower s than other single kernel functions. Taking the FWOCSVM as an example, the CCRs of are, respectively, 7.3%, 8.3%, and 6.7% higher than the average CCR of the other five single kernels with , , and , while s of are 6.4%, 14.1%, and 11.4% higher. Also, the results illustrate that the proposed Cdf9/7 biorthogonal wavelet kernel outperforms the above five single kernels overall in CCR, , and , only with the exception of 0.5% lower than the Coif3 in the FWOCSVM with .

For the SVMs based on same kernel, the FWOCSVM has higher CCR, higher , and lower . For instance, in Figures 5 and 6 with , the average CCR of the FWOCSVM based on eight different kernels is 2.5% higher than the OCSVM, and the maximum value of CCR is 4.2% higher. Meanwhile, the average is 4% higher and the maximum value of is 11.7% higher. It also proves that weighting the financial indicators based on the GRA and the EVA improves the accuracy of the financial distress prediction effectively. The statistical significance of the difference on the CCR between the FWOCSVM and the OCSVM based on different kernels is also assessed by using the Wilcoxon test and the results are shown in Table 6. The value of Z statistics of , , and are −2.521, −2.524, and −2.524, respectively. Concomitant probability α of three financial indicator sets are all 0.012, smaller than the significance level of 0.05, which indicates the CCRs of the FWOCSVM based on different kernels are higher than those of the BWH-OCSVM in the significance level of 0.05.

6. Conclusion

In this paper, we develop and implement a framework of a financial distress prediction model based on publicly available data. To this end, a novel biorthogonal wavelet hybrid kernel function is constructed by combining linear kernel function with biorthogonal wavelet kernel function, and a new feature weighted approach based on the EVA and the GRA is presented. The novel biorthogonal wavelet hybrid kernel function takes full advantage of different kernel functions’ mapping abilities and has good learning performance, while the new feature weighted approach is applied to weight financial indicators. Considering the imbalance between financially distressed companies and normal ones, the feature weighted one-class SVM based on the biorthogonal wavelet hybrid kernel (BWH-FWOCSVM) is further proposed and the empirical analysis conducted on the 284 listed companies in the GEM of China shows good performance of the proposed approach. The financial distress prediction model proposed in this paper is a reliable and feasible system for evaluating financial situation of the listed companies.

However, in this study, the optimal weighting coefficient is obtained by the grid search method with the search step length 0.1, which may not be precise enough. In further studies, other optimization methods such as genetic algorithm will be applied to obtain in hybrid kernel. Besides, linear combination of different kernels is applied in this paper and in further studies nonlinear combination of different kernels will be studied in financial distress prediction.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

This research was supported by the NSFC grant (no. 71201024).