Abstract

In this paper, we propose an early warning model of credit risk for cross-border e-commerce. Our proposed model, i.e., KPCA-MPSO-BP, is constructed using kernel principal component analysis (KPCA), improved particle swarm optimization (IPSO), and BP neural network. Initially, we use KPCA to reduce the credit risk index for cross-border e-commerce. Next, the inertia weight and threshold of BP neural network are searched using MPSO. Finally, BP neural network is used for training the data of 13 different enterprises of cross-border e-commerce’s credit risk. To analyze the efficiency of our proposed approach, we use the data of five different enterprises for testing and evaluation. The experimental results show that the mean absolute error (MAE) and root mean square error (RMSE) of our model are the lowest in comparison to the existing models and have much better efficiency.

1. Introduction

According to the Wall Street Journal, on September 22, 2014, Alibaba, the giant of cross-border e-commerce [1, 2] was listed as the largest IPO ever with a financing amount of 25 billion USD. Though this achievement was mainly due to the development of cross-border e-commerce applications, the credit risk still exists for these applications. To understand this risk, various studies have been conducted in this context. However, there exists little research when it comes to early warning of credit risk for cross-border e-commerce applications. Based on the risk level, the early warning of credit risk for these applications can be divided into four levels. Among these levels, level 1 represents the lowest risk and level 4 represents the highest risk. During cross-border e-commerce transactions, it is always desirable to select enterprises having better performance for transactions, though the credit risk level of cross-border e-commerce is always dynamic. Selecting an enterprise with better credit among various cross-border e-commerce enterprises for transactions is a major challenge that needs to be studied in detail [3]. Distinguishing the risk level of these enterprises is one of the main reasons that have made this selection quite cumbersome.

In literature, there exist very few studies for early warning of credit risk for cross-border e-commerce. For example, the authors of [4, 5] used the least squares approximate support vector regression model to study early warning for cross-border e-commerce credit risk. In [68], the authors employed the Z-score model to analyze the early warning and attained relatively better results. In [9], BP neural network was used to study the early warning of cross-border e-commerce credit risk. From the aforementioned studies, numerous factors were derived that have a direct impact on the credit risk for cross-border e-commerce applications.

In this context, using the financial information disclosed for cross-border e-commerce by listed companies and selecting better and reliable enterprises for conducting research on credit risk seems a viable option. However, there are two main challenges associated with it [10]. First, due to the larger number of financial indicators of listed enterprises, some researchers mainly focused on using the principal component analysis (PCA) for the reduction of dimensions, but PCA is used for dimension reduction using linear transformation, which is not effective for solving nonlinear problems. There are some nonlinear problems in the credit risk indicators of cross-border e-commerce, which have been proved by previous studies [11]. In this context, neural network seems a much better option but it has few main drawbacks associated with it such as overfitting, poorly generalized performance, and a much slower convergence speed, among others. Neural network can cooperate with PSO algorithm for determining the inertia weight and threshold of a given neural network. However, the PSO algorithm may easily fall into local minimum which ultimately leads to loss of efficiency. Besides, the results of neural network based on PSO are not accurate in most circumstances.

To solve the aforementioned problems, we use KPCA, MPSO, and BP neural network for establishing a framework for early warning of credit risk in cross-border e-commerce applications. The major contributions are as follows:(1)Initially, the kernel principal component analysis (KPCA) is employed to solve the nonlinear feature vector extraction.(2)Next, MPSO is used to solve the local pole problem associated with particle swarm optimization (PSO). A small value phenomenon is resolved by dynamically adjusting threshold and inertia weight for a BP neural network. Based on the principle of KPCA, MPSO, and BP neural network, this paper makes an empirical study using the credit risk’s early warning of cross-border e-commerce.

The rest of this paper is organized as follows. In Section 2, we discuss our proposed combination model. In Section 3, we discuss the empirical analysis along with experimental results for our proposed model. Finally, the paper is concluded and future research directions are provided in Section 4.

2. Combination Model

In this section, first we discuss the principle of KPCA in Section 2.1. Next, we discuss the principle of MPSO in Section 2.2 followed by the operational principle of BP algorithm in Section 2.3. Finally, we discuss our proposed combination model in Section 2.4.

2.1. Principle of KPCA

KPCA is a multivariate statistical analysis technique that uses a kernel method by applying it to PCA. It maps multiple variables to a high-dimensional feature space via nonlinear mapping, and conducts analysis in a high-dimensional space. Based on ensuring that the relationship between the relevant variables remains unchanged, it strives to minimize the loss of original data information and reduces the credit risk index of cross-border e-commerce. Several main indexes that can reflect the characteristics of the original problem are obtained for this purpose [1012]. KPCA is a nonlinear feature extraction method, but it has not been applied to the credit risk early warning of cross-border e-commerce. It avoids the complex problem of solving eigenvectors in eigenspace and is realized by solving eigenvectors and eigenvalues of the kernel matrix.

Firstly, for a given cross-border e-commerce credit risk sample, is mapped to the feature space HD, where HD is a nonlinear mapping, then the covariance matrix C of the feature space sample is

In equation (1), the eigenvalue of C is decomposed into

In this equation, is the eigenvalue of matrix C and V is the eigenvector corresponding to λ. Since, V is a linear combination of , then there exists .

In equation (3), V belongs to the mapping function of the characteristic space HD; the generating space is

In equation (4), the matrix Ki,j defining n × n has

In equation (5), the eigenvalue problem of , αi depends on the kernel function Kij, K is a dot product kernel matrix, and equation (5) can be written as

In equation (6), for the kernel principal component extraction of cross-border e-commerce credit risk data, the projection P(t) of test set t in the direction of feature space vector Vl is calculated as follows:

In equation (7), it has been assumed that the training set satisfies equation (1) in the feature space, , and it is extremely difficult to satisfy equation (1) in the feature space. To relax the assumption , the matrix expression is introduced, and the kernel matrix is replaced by the following form:

In equation (8), the maximum number of principal components extracted by KPCA is n. If the first few feature vectors can reflect all the characteristics of cross-border e-commerce credit risk, the number of principal components extracted can be reduced [1315].

2.2. Principle of MPSO

Particle swarm optimization (PSO) is an intelligent algorithm inspired from the natural social behavior of fish and birds. In PSO, each “particle” reflects an entity that can be compared to a bird within a search space. The particle is used for solving the optimization problems. Each particle can search, follow, and memorize the current optimal particle within a solution space. The particle has a fitness value, which is determined by an optimization function and a speed that determines its flight direction and distance [16, 17]. Finally, a PSO algorithm finds the optimal solution via iterations. At first, it is initialized as a group of random particles. Particles chase two extremums and update their position in each iteration. One is the global extremum best, which is the optimal solution currently found by the whole group, and the other is the individual extremum best, which is the current optimal solution found by the particles themselves. When the fitness value of this time (current time) is equal to that of the previous time, the PSO will fall into the local minimum. To solve the abovementioned problem, PSO is improved by dynamically adjusting the weight of inertia. When the fitness value of this time is equal to the fitness value of the previous time, the particle swarm will fall into a local minimum. It can be predicted that in all future iterations if the inertia weight is equal, it is impossible to generate such an equal inertia weight. If the number of subsequent iterations is the best fitness value of the inertia weight generated by the current iteration, the inertia weight will inevitably change, and the change is irreversible, which makes PSO fall into the state of local extremum. The improvement method of PSO is introduced below. In the past, some improved algorithms for inertia weight were proposed based on equation (9). The improved inertia weight has serious disadvantages, and the improvement depends on the current iteration times and the maximum iteration times of the algorithm, which will make it fall into a local minimum. Because of these shortcomings, this paper proposes an algorithm to dynamically adjust the inertia weight, i.e., the method of first increasing and then decreasing controlled by the algorithm itself to improve the PSO algorithm [18, 19].

In equation (9), iter is the current number of iterations, itermax is the maximum number of iterations, ωmin is the minimum value of inertia weight, and ωmax is the maximum inertia weight. The basic idea is as follows: using a dynamic inertia adjustment strategy, first select the inertia weight of particles with the smaller initial value, so that the initial development ability of particles is stronger. If the fitness value ft −1 of the previous iteration is compared with the fitness value ft of the current iteration, the particle will fall into the local minimum at two fitness values f = ft −1. To enhance the global search ability of particles, we can call the linear function: ω = ω + α × ω, where α is a local random number in the interval (0-1), adjust the size of inertia weight, and increase the inertia weight of particles, and particles can jump out of the local extreme point. When the inertia weight increases to 1.4, the exploration ability among particles is the strongest, but the connection between particles is the weakest, i.e., the development ability of particles is the weakest. To enhance the connection between particles, a linear function, ω = ω + α × ω, is used. Here, a local random number in the (0-1) interval is used to linearly reduce the inertia weight to enhance the connection between particles. This method is repeatedly used to make the particles control and adjust the inertia weight dynamically, and the connection of particles is developed in the direction of enhancement so that the exploration ability and development ability of particles can reach the dynamic strategic balance. The adjustment formula of inertia weight is as follows:

In this equation, α is a random number between 0 and 1, ω is a linear function, ft is the fitness value of generation t, and ft −1 is the fitness value of generation t − 1. The dynamic balance between global search and local search of particle swarm optimization is represented by the above equation [2023].

2.3. BP Algorithm Principle

BP neural network is a predominant artificial neural network (ANN). It comprises an input layer, several hidden layers, and an output layer that has several nodes. Each node reflects a neuron, the lower and the upper nodes are connected through weights. The nodes between layers are fully interconnected, and there exists no connection among the nodes at each layer. A typical case of BP neural network is a three-layer structure network model with a hidden layer, as shown in Figure 1.

In an NN model, the relationship between an output yt and input can be expressed using the following equation:

In this equation, and are model parameters, n denotes the number of nodes at the hidden layer, m reflects the number of nodes at the model input layer, and is a conversion function. The commonly used function is .

BP neural network model described in equation (11) reflects the linear function mapping relationship between the input and output yt of the previous observation value of the sequence

In equation (12), ω is the parameter vector of all models; function is determined by network structure and weight parameters [24, 25], respectively.

2.4. Combination Model

Our combination model, i.e., KPCA-MPSO-BP, is discussed in detail here. The basic idea of this model is as follows: firstly, the eigenvalues and eigenvectors of the indicators of cross-border e-commerce credit risk are calculated. Then, the kernel principal component eigenvectors with cumulative contribution rate greater than 85% are calculated, and finally, MPSO is used to search the weights and thresholds of BP neural network. The current position and fitness of the particles in the swarm are expressed by the weights, thresholds, and errors of the BP neural network and the results are compared. To analyze whether or not to update the PSO algorithm, BP neural network is used for early warning calculation. The flowchart of our proposed model is shown in Figure 2.

The operation steps of our model are as follows:(1)Data collection and data preprocessing:The credit risk indicators of cross-border e-commerce are standardized, and the reverse indicators are converted into positive indicators. The method is 1-index value, that is, xij = 1 − xij, where xij is the reverse index and xij is the positive index after conversion. The formula of normalization is as follows:(2)Calculate the kernel matrix:After data standardization, the cross-border e-commerce credit risk data is analyzed by KPCA, and the eigenvalues and eigenvectors of covariance are calculated.(3)Feature values extraction:KPCA method is used to reduce the dimension of the input eigenvalue. The standard of dimension reduction is to select the kernel principal component eigenvector which contains more than 85% information.(4)Initialize PSO:Initialize the initial position and initial velocity of the population, set the minimum fitness value as ε and the maximum number of iterations as m, and transfer the initial position of particles to the initial optimal position of particles.(5)BP neural network initialization:The initial position of particles is transferred to BP neural network as the initial threshold and initial value of the BP neural network.(6)Output and average error:The output and average error of the BP neural network are calculated. The comprehensive score of sample data is taken as the expected output value, the average error of the BP neural network is calculated, the fitness value of particle swarm optimization algorithm is determined by the network error εt, and the fitness value of particle swarm optimization algorithm is determined [6].(7)Training set of BP neural network:Part of the data of cross-border e-commerce credit risk is taken as the training set of BP neural network, and the training calculation is carried out in the network.(8)Test set of BP network:Another part of the data of cross-border e-commerce credit risk is taken as the test set of BP neural network and tested in BP neural network.(9)Credit warning:According to the score of the BP neural network, the credit warning of cross-border e-commerce enterprises is carried out. Based on the results of test/inspection, the credit risk of cross-border e-commerce enterprises is given early warning according to the risk level.(10)Error analysis:To test the superiority of the KPCA-MPSO-BP model, mean absolute percentage error (MAPE) and root mean square error (RMSE) are calculated. These two metrics are used for error analysis:In equations (14) and (15), smaller the mean absolute error and root mean square error, the better will be the results.

3. Empirical Analysis

According to the characteristics of cross-border e-commerce credit risk enterprises, data sources use the index system of [1, 3] to make an empirical study on cross-border e-commerce credit risk.

3.1. Selection of Research Samples and Indicators

Taking the data of 18 international cross-border e-commerce enterprises as the sample, the selected indicators for our studies are sales profit rate X1, return on net assets X2, return on total assets X3, total assets turnover X4, cost and expense profit rate X5, inventory turnover X6, accounts receivable turnover X7, current assets turnover X8, interest multiple X9, digital certificate grade X10, quick ratio X11, current ratio X12, capital accumulation rate X13, platform service provider credit X14, fixed assets renewal rate X15, total assets growth rate X16, contract breaking rate X17, overdue payment rate X18, and trade volume growth rate X19. The 18 enterprises are named enterprise A, enterprise B, and enterprise R, respectively. The index meanings are shown in Table 1.

According to the definition of Table 1, the original data in Table 2 can be obtained by using the data in [3].

3.2. Data Processing

According to the combination processing method in Section 2.4, the standardized transformation of the original data (including the normalization of the reverse index into the positive index) enables the standardized cross-border e-commerce credit risk matrix. We adopt KPCA standard of the dimension reduction for selecting the cumulative contribution rate of the eigenvalue, which accounts for 85%. The initial position of the particle is transferred to BP neural network as the initial threshold and initial value of the BP neural network. First, 13 groups of data are used to form the training set, then the BP neural network is used to train, and then the other 5 groups of data are used to form the test set. Using the prediction test, the credit risk of cross-border e-commerce is divided into four categories: level 1 risk (80100), level 2 risk [60, 80], level 3 risk [30, 60], and level 4 risk [0, 30]. Among them, level 1 and level 2 are in a risk state, while level 3 and level 4 are in a risk-free state. If the score is greater than or equal to 60, it is no alarm, and if the score is less than 60, it is an alarm. The data after standardized processing are shown in Table 3.

3.3. Credit Risk Early Warning Analysis of Cross-Border E-Commerce

According to the early warning of cross-border e-commerce credit risk in Table 3, enterprises are classified, as shown in Table 4.

As can be seen from Table 4, there are 4 enterprises in good condition (level 1 risk). These enterprises include D, E, I, and P, respectively. These enterprises are in good financial condition. Next, there are 6 enterprises in Level 2 risk (normal condition) that include A, G, I, K, N, and Q, respectively. The financial status of these enterprises is good and there is no credit risk. There are 5 enterprises with level 3 risk (slight risk). The financial status of the cross-border e-commerce enterprises in this section is not very good, with slight credit risk. Finally, there are 3 enterprises with grade 4 risk (serious risk), including J, F, and O as the table shows. The financial status of the enterprises in this section is extremely dangerous, with a relatively serious credit crisis.

3.4. Model Combination Fitting Test

For the test of the model, the average absolute error and root mean square error of the model can be compared by BP, PSO-BP, MPSO-BP, and KPCA-MPSO-BP, as shown in Table 5.

It can be seen from Table 5 that the RMSE and MAPE of the KPCA-MPSO-BP model are the smallest, which indicates that the KPCA-MPSO-BP model has higher early warning accuracy than the single model, the MPSO-BP model has higher early warning accuracy than the PSO-BP model, and KPCA-MPSO-BP model has the highest early warning accuracy, which indicates that the KPCA-MPSO-BP model has the good fitting degree and the KPCA-MPSO-BP model proposed in this paper is scientific and reasonable.

4. Conclusion

To improve the ability to prevent the credit risk of cross-border e-commerce enterprises, this paper proposes a combination model of KPCA-MPSO-BP. This model is used to early warn the credit risk of cross-border e-commerce enterprises and analyze the relevant sample data. The results show that the KPCA-MPSO-BP combination model is effective in early warning for any credit risk. Compared with other models, KPCA-MPSO-BP has fewer parameters, faster calculation speed, and better fitting. Though our model has much better results, there is still room for improvement. For example, for enterprises with grade 1 risk (excellent state), there is no credit risk, so it is necessary to strengthen the data collection of credit rating to avoid the occurrence of credit risk. For the enterprises in the level 2 risk (normal state), there is a possibility of credit risk, hence we should strengthen the analysis of the potential factors of credit risk, strengthen the management, and prevent the occurrence of credit risk. For enterprises with level 3 risks (slight risk status), if there is a slight risk and they are in early warning status, we should strengthen the work of enterprise credit risk, improve the credit, and reduce the occurrence of credit risk. The enterprises with level 4 risk (serious risk status) are in the early warning status and have serious credit risk, so they should stop business, strengthen rectification and clean-up, and enter the risk plan procedure to prevent the occurrence of enterprise bankruptcy.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.