Abstract

Banking systemic risk is a complex nonlinear phenomenon and has shed light on the importance of safeguarding financial stability by recent financial crisis. According to the complex nonlinear characteristics of banking systemic risk, in this paper we apply support vector machine (SVM) to the prediction of banking systemic risk in an attempt to suggest a new model with better explanatory power and stability. We conduct a case study of an SVM-based prediction model for Chinese banking systemic risk and find the experiment results showing that support vector machine is an efficient method in such case.

1. Introduction

Financial crises occurred over the past decade have shown that banking crises are usually in the core position of financial crises. Therefore, banking sectors’ stability is a key to maintain financial stability. However, the major threat to banking sectors is systemic risk, because its contagion effect makes a single bank crisis evolve into the whole banking crisis, which may cause a financial crisis. The 2007–2009 financial crisis has threatened the stability of the international monetary market, shed light on the importance of systemic risk, and revealed the necessity of predicting one in order to safeguard financial stability.

The prediction of banking systemic crisis has been done since the mid-1990s. Frankel and Rose [1] use a probit model to evaluate the predictive power of several indicators for studying the determinants of currency crashes. Kaminsky and Reinhart [2] extend the analysis of Frankel and Rose [1] to a wider set of crises by introducing the so-called “signal” approach to evaluate the leading properties of indicators. Based on the study of Kaminsky and Reinhart [2], Demirgüç-Kunt and Detragiache [3] use a multivariate logit model for the prediction of banking crises. Other approaches are introduced to early warning models for systemic risk in the subsequent years, such as the regression model [46], the neurofuzzy approach [7], binary recursive trees [8, 9], and the alternative approach [10, 11].

In fact, banking systemic risk is a complex nonlinear phenomenon, which originates from diversity and uncertainty of risk sources, multiple contagion channels and their relationships as well as complexity and evolution of banking system structures [1214]. The above prediction approaches either have certain limitations to analyze the complex nonlinear property of banking systemic risk or need a lot of samples. Recently, support vector machine (SVM) introduced by Vapnik [15] has become a promising tool for solving nonlinear regression problems and has been applied in prediction in many areas [1618]. SVM has many advantages; for example, it combines the advantages of multivariate nonlinear regression in that only a small amount of data is required to produce a good generalization. In addition, the weakness of the transformational models in multivariate nonlinear regression can be overcome by mapping the data points to a sufficiently high-dimensional feature space [17]. Therefore, it is believed that SVM will provide a new approach for predicting banking systemic risk.

In this paper, we attempt to develop an SVM-based predictive model for banking systemic risk. Our paper is organized as follows. Section 2 provides the brief description of SVM theory. Section 3 elaborates a case study of an SVM-based prediction model for Chinese banking systemic risk. Section 4 draws conclusions.

2. Support Vector Machine

Based on statistical learning theory, Vapnik [15] put forward support vector machine. Its basic idea is to construct an optimal separating hyperplane with high classification accuracy [19]. A brief description of the SVM algorithm is provided as follows [16, 18, 19].

Consider the training dataset , where is an input vector and denotes a target label. In the linear separable case, SVM algorithm is to find an optimal separating plane . In the nonlinear separable case, a nonlinear function must be applied to map input space to a higher dimensional feature space, and the constrained optimization model of soft margin based on SVM is as follows [19]: subject to where is a slack variable and is a tuning parameter. By applying the Lagrangian technique to (2), we can obtain its dual problem as follows: subject to where is a vector of all ones, is a positive semidefinite matrix, and is Lagrange multiplier. The element of is equal to , where is called kernel function. There are different kernel functions, such as linear, polynomial, and radial basis functions (RBF). Based on the above analysis, we can obtain the ultimate classifier function of SVM as follows:

3. Prediction Model for Banking Systemic Risk

In this section, we take the Chinese banking system as an example to construct an SVM-based prediction model for banking systemic risk and conduct empirical analysis based on the data of the Chinese banking system.

3.1. Prediction Indicators for Banking Systemic Risk

Microprudential indicators cannot provide a systemic perspective on banking systemic risk, while macro-prudential indicators cannot provide distress warnings from individual banks. Recent financial crises show that interbank connections play an important role in propagation mechanisms of banking systemic risk. Thus, the prediction indicators for banking systemic risk should incorporate both Microprudential and macro-prudential perspectives, as well as the characteristics of interbank connections. Based on the relative studies [2023], prediction indicators and their prudential intervals for banking systemic risk in this paper are given in Table 1, where all indicators are processed chemotactically, are Microprudential indicators, are indicators of interbank connections, and are macro-prudential indicators.

3.2. Evaluation for Levels of Banking Systemic Risk

In light of the lack of data on levels of banking systemic risk in China, we adopt the principal component analysis to classify levels of banking systemic risk. The basic idea is as follows: constructing three critical value samples based on data of prediction indicators processed chemotactically in Table 1; conducting the principal component analysis on selected samples and the three critical value samples; classifying scores of selected experimental samples according to scores of the three critical value samples. In this paper, we select data of prediction indicators from the first quarter of 2004 to the third quarter of 2012 to collect the experimental data. By conducting the principal component analysis, we can obtain the levels of banking systemic risk, which is presented in Figure 1.

3.3. Empirical Analysis of Prediction Accuracy

In this section, we analyze the prediction accuracy of SVM and also compare its performance with those of back-propagation neural network (BPNN), multiple discriminant analysis (MDA), and logistic regression analysis (Logit). We take the data of prediction indicators from the first quarter of 2004 to the second quarter of 2008 as the training sample and the rest of the data as the testing sample. At the same time, we construct a safety sample based on critical values of the safety condition into the training sample, because of the lack of safety samples in the training sample. Therefore, the size of the training sample is equal to 19, and the sizes of the testing sample is equal to 17. In case of BPNN, the rest data is split into the validation sample and the testing sample, and the size of the validation sample and the testing sample are 4 and 13, respectively.

3.3.1. Prediction Accuracy of SVM

Construction of prediction models for banking systemic risk based on SVM is to choose the kernel function and the values of model parameters. In this paper, the radial basis function (RBF) is used as the basic kernel function of SVM, because it usually gets better results than other kernel functions [24]. There are two parameters associated with RBF kernels: and , playing a crucial role in the performance of SVM. Therefore, improper selection of these two parameters can cause overfitting or underfitting problems. In this paper, we determine the values of parameters and by using Grid Search, Genetic Algorithms, and Particle Swarm Optimization.

According to the above method, we first analyze prediction accuracy of prediction models for banking systemic risk based on SVM. Based on the above data, we can obtain values of optimal parameters and for SVM model by Grid Search, Genetic Algorithms, and Particle Swarm Optimization. Figures 2, 3, and 4 report, respectively, the results of Grid Search, Genetic Algorithms, and Particle Swarm Optimization. As we observe from Figures 2, 3 and 4, the optimal parameters of Grid Search, Genetic Algorithms, and Particle Swarm Optimization are , , and , respectively. After the optimal parametes are found, we can obtain that the best prediction accuracy of the testing sample is 94.12%. The performance of SVM under the above mentioned three methods is summarized in Table 2.

3.3.2. Comparison of Prediction Accuracy of SVM, BPNN, MDA, and Logit

Back-propagation neural network (BPNN), multiple discriminant analysis (MDA), and logistic regression analysis (Logit) are widely applied methods for prediction. Therefore, in this paper, we compare the prediction accuracy of SVM with those of BPNN, MDA, and Logit. In case of BPNN, a three-layer fully connected back-propagation neural network is used as a benchmark; 5, 10, 15, 20, 25, and 30 hidden nodes in the hidden layer are analyzed; the maximum number of learning epochs is set to 1000; the learning rate is set to 0.01; the momentum term is set to 0.95; the activation function of the hidden layer and the output layer are, respectively, Tansig and Purelin; the training function is Trainlm. According to parameter adjustment, we can obtain that the best prediction accuracy of the testing sample is found when the number of hidden nodes is 20. The prediction accuracy of the testing sample, the training sample, and the validation sample is 84.62%, 100%, and 100%, respectively.

Table 3 summarizes the best prediction performance of SVM, BPNN, MDA, and Logit in the training sample and the testing sample. From Table 3, we can see that SVM outperforms BPNN, MDA, and Logit: 9.5%, 17.65% and 17.65%, respectively, for the testing sample. In addition, we conduct the McNemar test to examine whether SVM significantly outperforms the other three methods. Table 4 shows the results of the McNemar test. As shown in Table 4, SVM outperforms BPNN at 10% statistical significance level, and SVM outperforms MDA and Logit at 5% statistical significance level. However, Table 4 also shows that the prediction performances of BPNN, MDA, and Logit do not significantly differ from each other. The above result shows that support vector machine has certain advantages for the prediction of banking systemic risk with the complex nonlinear characteristics.

4. Conclusion

In this paper, we apply SVM to predict banking systemic risk. To validate the prediction performance of this approach, we conduct a case study of an SVM-based prediction model for Chinese banking systemic risk. First, we construct the prediction indicators for banking systemic risk from both Microprudential and macro-prudential perspectives, as well as the characteristics of interbank connections. Second, we adopt the principal component analysis to classify levels of banking systemic risk. Based on the above analysis, we conduct empirical analysis of prediction accuracy of the SVM-based prediction model for banking systemic risk. The results of empirical analysis show the capability, accuracy, and high efficiency of the SVM-based prediction model. In addition, compared with the BPNN-based prediction model, multiple discriminant analysis, and logistic regression analysis, the SVM-based prediction model shows superior prediction power. With these results, we claim that SVM can serve as a promising alternative in the prediction of banking systemic risk.

Acknowledgments

This research is supported by NSFC (no. 71071034, no. 71201023), NBRR (no. 2010CB328104-02), and Humanities and Social Science Youth Foundation of the Ministry of Education of China (no. 12YJC630101).