Abstract

Refined composite multivariate multiscale fractional fuzzy entropy (RCmvMFFE), which aims to sensitively discriminate different short noisy multichannel financial data, is proposed as a new measure to quantify the complexity dynamics of multichannel time series in this work. To better comprehend the RCmvMFFE measure, the dynamical complexity analyses of multichannel synthetic dataset are comparatively studied with multivariate multiscale fuzzy entropy (mvMFE), refined composite multivariate multiscale fuzzy entropy (RCmvMFE), and refined composite multivariate multiscale fractional fuzzy entropy (RCmvMFFE). Then, these measures are firstly employed to explore actual multichannel financial index series to the best of our knowledge. The empirical analyses report that RCmvMFFE measure is able to deeply and sensitively dig up the market information hidden in the multichannel financial data and can better discriminate markets in different area compared to the traditional measures to some extent.

1. Introduction

Financial market is a typical complex dynamical system, and financial time series is believed to possess strong nonlinearity [1, 2], nonstationarity [3, 4], multifractality [5, 6], long-range correlation, and structural dynamics [79]. Digging up the statistical characteristics hidden in financial variables (price, volume, etc.) is going through a vital important time in theoretical and applied research fields, especially in risk management and modeling market dynamics [10]. The complexity measure is such a statistical value related to the unpredictability or structural dynamics of a time series. The larger complexity measure of a signal means it possesses more randomness in small time scale and more structural dynamics in large time scale. As a matter of fact, 1/f noise (holding long-range correlation) is more regular and possesses more structural information than white Gaussian noise (WGN) [8]. In the past thirty years, complexity analysis has gone a rapid improvement, and a lot of complexity measures and improved versions have sprung up. Among them, the most prominent one is entropy measure, for its simplicity and convenience implementation in modern computers. Entropy methods, such as approximate entropy (AppEn) [11, 12], sample entropy (SampEn) [12, 13], permutation entropy (PermEn) [1416], and fuzzy entropy (FuzzyEn) [13, 17, 18], have been successively introduced to explore the dynamical complexity hidden in financial and physiological data, which are commonly described as short and noisy series [1921]. Among them, AppEn measures regularity or randomness of a series through constructing difference of mean logarithmic function of the probability of vector pairs within tolerance of template vectors with two adjacent integer embedding dimensions [11]. SampEn, which is a revised method of AppEn, overcomes the dependency on parameters by excluding self-matching [12]. Moreover, FuzzyEn is also a revised version of SampEn by introducing continuous differentiable exponential fuzzy membership function to overcome the sensitivity to parameters, tolerance in particular [13]. While structural property is universal in nature [8, 22], some counterintuitive results are reported in applications with the above single-scale entropy measures for inherent natural multiple spatial and temporal scales in complex dynamics system, so multiscale methods are proposed in response to the proper time and conditions. While it is found that traditional methods cannot distinguish different time series very well, fractional fuzzy entropy (FFE) [18] is proposed by combining entropy measure with fractional calculus [2325], and it can dig up fractional order information hidden in a nonlinear complex system sensitively and can obtain more exact and deep understanding. Composite multiscale entropy family has been put forward by combining the conventional entropy methods with refined and composite technique to estimate dynamical complexity of short-term financial data, and they show significant superiority in reliability and stability, such as refined composite multiscale permutation entropy (RCMPE) and refined composite multiscale fuzzy entropy (RCMFE) [19, 26]. Moreover, the real world is essentially multivariate, financial data in particular. Multivariate entropy is proposed to analyze complexity of multichannel data and has been utilized to detect multichannel physiological and environmental series [27], and it is firstly used to discuss the dynamical complexity of multichannel financial time series compared with RCmvMFFE to the best of our knowledge.

In this work, inspired by the influential studies [17, 20, 27], combining multivariate entropy with fractional calculus, multiscale, and composite technique, a developed complexity version, refined composite multivariate multiscale fractional fuzzy entropy (RCmvMFFE), is proposed to explore and discriminate the complexity dynamics of multichannel time series. Moreover, through the analyses of the multichannel synthetic dataset and financial data, the experimental results validate that the proposed RCmvMFFE displays a better performance compared to traditional benchmark complexity measures to some extent.

The remainder of this paper is organized as follows. Section 2 reviews the FuzzyEn, FFE, RCMFE, RCmvMFE, and RCmvMFFE methods briefly. Section 3 evaluates the effectiveness of mvMFE, RCmvMFE, and RCmvMFFE with multichannel synthetic dataset. Section 4 presents the complexity analysis results of multichannel financial time series in different regions, followed by conclusions in Section 5.

2. Methodologies

2.1. Fractional Fuzzy Entropy

Fuzzy entropy (FuzzyEn), which is a revised complexity measure of SampEn based on vectors shapes and fuzzy membership function family, shows significant advantages in reliability and stability to noise and freedom to parameters selection and gets widespread applications in physiological and financial fields. The highlight of FuzzyEn is that continuous differentiable exponential function family in place of rigid Heaviside function is used to evaluate similarity degree between vectors to avoid the sensitiveness to parameter selection.

The FuzzyEn algorithm can be briefly reviewed as follows; for more details, see [13, 17]. Given a time series with length , , firstly an -dimensional vector series, , is reconstructed by the well-known phase space reconstruction theory [28], and each subtracts its mean value to eliminate the influence of tendency as follows:where is predefined embedding dimension which controls the amount of information each vector conveys and is the arithmetic mean of the vector for baseline removal defined as follows:

Every point in phase space represents an instantaneous state of a system considered. Then, for vector series reconstructed above, the similarity degree of an arbitrary to another is calculated by a fuzzy membership map family as follows:where the predefined parameter represents the gradient of boundary and controls the weight of vector similarity, the width of the fuzzy function, and the infinite norm of the -dimension difference vector in the work.

Average of all the similarity degrees and ensemble similarity degree of any vector pairs in the mean sense can be obtained as follows:

Analogously, for dimension case, with reconstructed when next point is added, the ensemble similarity degree can be also obtained as follows:

Finally, for the given with finite length, the FuzzyEn is estimated as follows:

Then, fractional fuzzy entropy, which is a developed version of FuzzyEn measure, is introduced to sensitively dig up fractional order dynamics of a complex system by combining FuzzyEn with fractional calculus [23, 24]. The fractional fuzzy entropy (FFE) of the given series is estimated as follows [18]:where represents the order of fractional derivative, the gamma function, and the digamma function with the form , respectively. Furthermore, FFE degenerates into conventional FuzzyEn as .

Conventionally, the hyperparameters and are preset to be two small positive integers for avoiding the loss of information, is suggested to be between 3 and 7 [12], is fixed to be 2 in the work, and tolerance is set to be , where SD presents the standard deviation of the original series.

2.2. Refined Composite Multiscale Fuzzy Entropy

With the extensive applications of entropy measures in more and more fields, some counterintuitive results are reported in single-scale entropy analysis for not considering the natural complex spatial and temporal scales inherent in complex systems considered, especially when pathologic signals are compared to healthy signals [8]. Multiscale entropy is proposed by combining the concept of coarse graining with entropy analysis, to measure the multiscale structural information inherent in dataset, for example, multiscale sample entropy (MSE) and multiscale fuzzy entropy (MFE) [8, 14]. Refined composite multiscale fuzzy entropy (RCMFE) is proposed by calculating FuzzyEn values with averaging similarity degree of multiple embedded vector series with different starting points originated by the Takens embedding theorem to overcome the instability of the above MFE measure [19]. The algorithm of RCMFE for a given mainly consists of the following two steps:(1)Coarse graining for multiscale evaluation of fuzzy entropy: a “coarse graining” process with scale factor is firstly utilized for the original time series to get coarse-grained sequences, and the corresponding sequence is defined as follows:where represents rounding toward negative infinity.(2)Calculating refined composite multiscale fuzzy entropy of sequences considered: for a fixed scale factor , the two corresponding values and are, respectively, calculated for sequence with embedding dimension and based on the above MFE. Then, the mean of and for all indexes denoted as and are computed, respectively; i.e., , and . Finally, RCMFE can be estimated as follows:In particular, the RCMFE degenerates into traditional MFE measure when .

2.3. Refined Composite Multivariate Multiscale Fractional Fuzzy Entropy

Recent rapid developments in multimodal sensing and computer science have enhanced the necessity for more insight into the dynamical system considered, financial system in particular. Multivariate entropy (mvEn) generalizes univariate entropy to analyze complexity dynamics of multichannel data, and it has been widely used to analyze real-world multichannel physiological and environmental data. The highlight of mvEn is considering all the composite delay vectors, which are reconstructed by multivariate embedding theory [29], in the subspaces and then comparing them within and across the channels.

The multivariate multiscale fuzzy entropy (mvMFE) is improved by combining coarse graining and multivariate fuzzy entropy. The mvMFE algorithm is briefly described as follows for a given multivariate data , where represents the number of samples in each channel and the number of variates (i.e., channels).(i)Define coarse graining series with a fixed temporal scale. Given a scale , the elements of the multichannel coarse-grained series are defined as follows:(ii)For a -variate time series , form composite delay vectors by the multivariate embedded reconstruction theory as follows:where , is corresponding embedding vector, embedding dimension, and time delay vector.(iii)Calculate the distance for any composite delay vector pairs and as the maximum norm of the difference vector . For any given and a threshold , the similarity degree of to is defined as ; then, define a global quantity as the average membership degree as follows:(iv)Extend the embedding dimension in (11) from to . This can be performed in different patterns, for example, from to for any .(v)Calculate similarly which represents the average of all similarity degrees in -dimensional phase space. Finally, mvMFE is estimated as follows:

As mentioned above, according to the proposed refined composite technique, the refined composite multivariate multiscale entropy-based approach firstly generates coarse-grained multivariate time series , , as follows:For each scale , different multivariate signals with different starting points are obtained. For each multivariate sequence , membership degrees and are separately calculated. Next, the average values and on scale factor are calculated. Then, the RCmvMFE is estimated as follows:

Finally, the corresponding RCmvMFFE value is calculated as follows:where and the RCmvMFFE degenerates into traditional RCmvMFE measure as .

3. Validation of RCmvMFFE for Multichannel Synthetic Dataset

In this section, we explore the complexity dynamics of three-channel synthetic dataset whose channels consist of Gaussian white noise (WGN) and 1/f noise. We first apply mvMFE, RCmvMFE, and RCmvMFFE to analyze this dataset, which are commonly utilized to assess the performance of entropy-based methods. All three channels are independent, and the number of sample points of each channel is 5500. We choose four datasets with different constitutions of WGN and 1/f noise: all three channels contain 1/f noise (group 1); two channels contain 1/f noise and one WGN (group 2); one channel contains 1/f noise and others WGN (group 3); and all three channels contain WGN (group 4). The corresponding mvMFE, RCmvMFE, and RCmvMFFE results are shown in Tables 16 and Figures 1 and 2.

The mean mvMFE and RCmvMFE values of the three-channel synthetic dataset (with 20 independent simulations) with different time scales are listed in Tables 1 and 2, and the corresponding standard deviation is presented in Tables 3 and 4. For each synthetic dataset, the two kinds of entropy values decrease as the scales increase, respectively. The standard deviation values gradually increase as the scales increase (except for group 4), which could be a shorter coarse grain series for a larger time scale . Moreover, for a given scale , entropy measures for group 1, group 2, group 3, group 4 are from big to small, which indicates that multichannel series with more 1/f noise possesses more structural complexity than those with less 1/f noise, which is in keeping with the anticipation [8]. Moreover, compared with mvMFE, the standard deviations of RCmvMFE in each scale factor are less, which shows that RCmvMFE combined with refined and composite technique can produce more stable results.

The corresponding results are also depicted in Figure 1; the plots show arithmetic means of 20 independent simulations, and error bars show the standard deviation. Figures 1(a) and 1(b) illustrate the results of mvMFE and RCmvMFE for three-channel simulation dataset containing independent WGN and 1/f series, respectively. The results are in line with the fact that WGN has less complex structure than 1/f noise and multivariate data with more WGN have less complex structure than those with more 1/f series. For all multichannel data, complexity curves descend as scale factor increases, and the fluctuations of the RCmvMFE curves are significantly smaller than those of the mvMFE. On the other hand, mvMFE and RCmvMFE cannot discriminate these series significantly at large scales, for group 1 and group 2 in particular, which may seriously hinder the practical applications. Moreover, it can be found that group 1 and group 2 nearly cannot be separated in Figure 1(a) at large scales, and they can be separated in Figure 1(b).

Then, the RCmvMFFE is chosen to explore the dynamical complexity of the above three-channel synthetic series. Table 5 lists the RCmvMFFE values with different fractional exponent from to 0.5 with step size 0.1 and fixed . According to Table 5, as increases, the RCmvMFFE touches the maximum slowly, with at approximately 0.10, and then decays quickly. Figure 2(a) depicts the corresponding RCmvMFFE curves, where all curves are significantly separated, around 0.1 in particular. This further validates the feasibility of the RCmvMFFE measure, which can better distinguish different multichannel synthetic data. Next, Table 6 lists the RCmvMFFE with different time scales and fixed . Figure 2(b) depicts the corresponding RCmvMFFE curves and shows a similar pattern to that in Figure 1(b), but there is a more significant discrimination. Furthermore, RCmvMFFE method gets larger separation between entropy values in synthetic data than mvMFE and RCmvMFE; hence, RCmvMFFE can distinguish these data more significantly than others. In brief, RCmvMFFE method can conquer the shortcomings of mvMFE and RCmvMFE methods to some extent, and with a suitable fractional exponent it is relatively stable and can sensitively dig up the inherent properties of different time series.

4. Complexity Analyses for Multichannel Financial Time Series

In this section, the complexity dynamics of the multichannel financial time series are explored. We choose 9 important international stock indices from three different regions (i.e., Americas, Europe, and Asia and Pacific) in order to better confirm the application of the proposed complexity measure in real markets. In addition, there are five developed (D) and four emerging (E) stock indices. The 9 indices are BVSP (Brazil, E), MXX (Mexico, E), and GSPC (USA, D) (from Americas market); FCHI (France, D), GDAXI (Germany, D), and N100 (France, D) (from Europe market); and SSEC (China, E), TWII (China, E), and N225 (Japan, D) (from Asia and Pacific market), respectively. The corresponding data are collected from the Yahoo Financial website (https://in.finance.yahoo.com). The time interval of stock indices (closing price and trading volume) is from Jan. 2, 2003, to Oct. 18, 2019, with approximate 4100 trading days (but there are some slight differences because of the slight different festival and holiday days in the above stock markets, and some individual missing transaction data are supplemented by interpolation method for the sake of synchronicity). In the following work, we adopt logarithmic return given by , where represents the closing price of -th trading day. In terms of trading volume, we also adopt the similar logarithmic difference to decline the diversity between them. We also standardize the above data considered to keep uniformity.

4.1. Complexity Measure of Bivariate Stock Indices

In order to explore the different complexity behavior of different financial indices under investigation, we consider a bivariate dataset, whose one channel contains the logarithmic return and the other contains logarithmic trading volume difference of financial indices.

The mvMFE, RCmvMFE, and RCmvMFFE analyses are used to explore the above bivariate dataset. The hyperparameters and are fixed as 2 and 0.15 in this paper for simplification purposes, and the corresponding results are exhibited in Figure 3 and Tables 79. Figure 3 displays entropy of the bivariate dataset with mvMFE, RCmvMFE, and RCmvMFFE measures with scale from 1 to 30 with step size 1, and fractional order exponent is fixed to be 0.10 in RCmvMFFE method. In Figure 3, for all bivariate stock indices, similar to MFE curves, RCmvMFE and RCmvMFFE curves decrease when scale increases. Moreover, entropy curves of FCHI and SP500 are under those of SSEC, TWII, and MXX in large scales in RCmvMFE and RCmvMFFE method, which may be because security markets in developed countries are more mature and efficient than in developing countries and possess more random dynamics, while there are long-range dependence in developing countries (similar to 1/f noise) in some sense. This validates that the entropy value of series with long-range correlations is theoretically larger than that of a random signal in large scales [8]. In addition, for mvMFE and RCmvMFE methods, the entropy curves of Asian market are well discriminated, while difference of developed countries is not significant. Although the graphs in Figure 3 could distinguish different stock indices, the gaps between them are relatively small. Furthermore, in the RCmvMFFE analysis, the gap between the two kinds of markets is more significant intuitively, and the gaps between different curves are larger than in mvMFE and RCmvMFE. This means that RCmvMFFE is superior to mvMFE and RCmvMFE in describing the complexity behavior of multichannel financial time series in some sense.

4.2. Complexity Measure of Trivariate Stock Indices in Different Regions

In order to explore the different dynamical complexity behavior of stock indices in different regions under investigation, we consider a trivariate dataset, whose channels contain the logarithmic return of three stock indices from the same region: Americas (BVSP, GSPC, MXX), Europe (FCHI, GDAXI, N100), and Asia and Pacific (SSEC, TWII, N225).

Figures 4(a) and 4(b) show mvMFE and RCmvMFE of the above three-channel dataset. Figures 4(c) and 4(d) show RCmvMFFE for fixed and for fixed scale = 20, respectively. It is interesting that entropy curves of Americas are interwoven with those of Asia and Pacific, and those curves gradually decline and approximate to a fixed number with the increasing scale, but entropy curves of Europe decrease quickly to the minimum value and then gradually increase, and they are always below other curves. Compared with Figure 4(a), Figure 4(b) looks smoother, which confirms that the RCmvMFE is more stable than mvMFE to some extent. Compared with Figure 4(b), Figure 4(c) shows larger gaps, which indicates that the proposed RCmvMFFE has better performance than RCmvMFE in some sense in complexity analysis of short and noisy multichannel financial data.

Tables 10 and 11 list mvMFE and RCmvMFE values of trivariate stock indices with different scale , where scale is chosen to be 1, 5, 10, 15, 20, 25, 30, 35, 40, respectively. For all trivariate stock indices, entropy measures decrease when scale increases, and for a fixed scale , entropy measures of SSEC, TWII, and MXX are larger than those of FCHI and SP500, which is similar to Figure 3.

Then, the RCmvMFFE analysis is used to explore the dynamical complexity of the trivariate stock indices. Table 12 displays RCmvMFFE values of trivariate stock indices with different time scale factor and fixed . Table 13 exhibits the RCmvMFFE with different from to 0.4 with step size 0.1 and scale = 20. As increases, the RCmvMFFE values touch the maximum and then decay quickly. It can be found that, in Table 13 and Figure 4(d), RCmvMFFE touches the maximum value with at approximately 0.08. Figure 4(d) depicts RCmvMFFE curves of return series with different , where all curves are markedly separated. This further validates the effectiveness of the RCmvMFFE measure in discriminating different multichannel financial data.

5. Conclusions

In this paper, a novel complexity measure for multichannel time series, i.e., RCmvMFFE, is proposed by combining traditional mvMFE measure with fractal theory, refined and composite technique, which can stably and sensitively dig up underlying fractional order behavior and explore structural dynamical complexity in a multivariable complex system. To better comprehend the RCmvMFFE method, the dynamical complexity analyses of multichannel synthetic dataset are comparatively performed with mvMFE, RCmvMFE, and RCmvMFFE. Then, RCmvMFFE method is also employed to explore multiple multichannel financial index series. The experimental analyses report that RCmvMFFE measure is able to deeply and sensitively dig up the market information hidden in the multichannel financial data and can better discriminate different area markets compared to the traditional measures to some extent.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

The authors were supported by the Natural Science Foundation of Universities of Anhui (KJ2019A0141), the Startup Foundation for Introducing Talent of AHPU (2018YQQ024), the Pre-research Project of National Natural Science Foundation of AHPU (2019yyzr15), and the Youth Fund of AHPU (2016YQ29).