Abstract

In the era of big data and cloud computing, data research focuses not only on describing the individual characteristics but also on depicting the relationships among individuals. Studying dependence and constraint relationships among industries has aroused significant interest in the academic field. From the network perspective, this paper tries to analyze industrial relational structures based on cycle degree. The cycle degree of a vertex, that is, the number of cycles through a vertex in an industrial network, can describe the roles of the vertices of strong components in industrial circulation. In most cases, different vertices in a strong component have different cycle degrees, and the one with a larger cycle degree plays more important roles. However, the concept of cycle degree does not involve the lengths of the cycles, which are also important for circulations. The more indirect the relationship between two industries is, the weaker it is. In order to analyze strong components thoroughly, this paper proposes the concept of circular centrality taking into consideration the influence by two factors: the lengths and the numbers of cycles through a vertex. Exemplification indicates that a profound analysis of strong components in an industrial network can reveal the features of an economy.

1. Introduction

With the rapid development of the Internet, Internet of Things, and cloud computing technology, data has the potential for an explosive growth. The big-data era, which depends on cloud computing and cloud storage, has arrived. Large scale, diversity, and fast processing speed are the major characteristics of big data. Currently, data research focuses not only on describing individual characteristics but also on depicting the relationships among individuals [1, 2]. Relational data has been explored in the research of the economic management sector; for example, Acemoglu et al. used a degree sequence of a relational indicator to study the influences of the relationships among different departments on the fluctuation of the total output [3]; Hidalgo et al. studied an industry (product) upgrading issue by studying internet relational stricter among industries (products) [4]; Zhao et al. studied the linkage structure effect of blue economy under the perspective of an industrial network [5]; McNerney et al. studied the structure of interindustry relationships using networks of money flows between industries in 45 national economies [6]. On the basis of an analysis of industrial relational structures using relational data, this paper proposes the concept of circular centrality to detect unknown and potentially useful circular relational data hidden behind Input-Output Tables and study the topological properties of industrial circulation relationships.

In a connected component of an undirected graph, all the vertices are reachable from each other, but this is not necessarily true for a digraph. When there is no path from one vertex to another in a digraph, it is not reachable from the vertex. If there are mutually reachable paths between two vertices, they must be in the same subdigraph, a strong component. A strong component of a digraph, also known as a strongly connected subdigraph, is a maximally induced subdigraph of the digraph. Because any two vertices in a strong component can reach each other, a strong component has features that are different from other components. These make strong components meaningful to a digraph, especially to the one with practical significance such as an industrial network.

With vertices being the sectors and arcs representing significant industrial linkages, an economy can be abstracted into a network—an industrial network. The strong components of an industrial network depict industrial circulation linkages in an economy, where the former sector provides the input for the latter while the output of the latter comes back to the former. As shown in Figure 1, the sectors of coal mining, basic metals, and machinery and the linkages among them make up a strong component, and there are industrial circulation linkages among them. As fundamental linkages, industrial circulation linkages appear in various economic phenomena such as industrial clusters and circular economy [7].

In an industrial network, the general properties of strong components make them an important factor influencing the economy. Since any two sectors in a strong component can reach each other, any change of a sector can be felt by the others, and feedback effects also can influence the initial sector. The effects circulate with diminishing strength, as is shown in Figure 2.

Figure 2 describes an industrial network representing product flows. It contains 13 vertices, and 6 of them are in a strong component, forming vertices set . If any sector of undergoes some changes, such as shrink or expansion, the changes would pass on to all sectors of and pass it on to some vertices outside the strong component. For instance, if sector alters, all sectors of would change accordingly, and the effects circulate again and again over. As for the sectors outside, such as , would push it along the directed path . When the influence circulates within the strong component and reaches again, the effects on would repeat again with lessened strength.

Any changes of a strong component can influence the circulation of the network. For instance, if becomes isolated in Figure 2, there would be no circulation, and any changes of a sector in the network would pass on along unidirectional paths and stop.

Consequently, strong components play significant roles in an economy. Campbell [8, 9] recognized the importance and proposed that a strong component could be regarded as a vertex to build the condensation digraph. Some scholars followed this method, such as Morillas et al. [10]. This approach would do a lot to clarify the relationship structure of the whole network but did not involve the internal structure of a strong component. In order to do further researches on strong components, Zhao et al. [11, 12] developed the concepts of cycle degree and cycle length distribution of sectors and testified their applicability to show the structure of strong components.

Suppose that is an industrial network, is the set of all sectors in , and sector . An industry cycle through is defined as a directed cycle containing vertex , where no two vertices are the same. It is obvious that an industry cycle, the fundamental circular unit of an industrial network, is a closed path through . Any sector in a strong component is in one or more industry cycles. An industry cycle contains two sectors at least (regardless of loops—arcs beginning and ending at the same vertex) and all sectors of an industrial network at most. In Figure 2, there are four industry cycles through , that is, , , , and .

The cycle degree of a sector in an industrial network, denoted by , means the number of industry cycles through the sector, describing the number of circular linkages between the sector and the others. As a sector with bigger can influence more industry cycles, it has greater driving circulating power. In Figure 2, the cycle degrees of sectors , , , , , and are 4, 3, 3, 1, 3, and 2, respectively. If becomes isolated, there would be no cycles in the network. It can be seen that the sectors with big cycle degrees have more influences on the strong component. Now that the sectors outside the strong component do not exist in any cycles, their cycle degrees are all 0.

Differing from the degree, the cycle degree of a sector depicts the structure information from the whole network. A vertex with small degree is not necessarily with small cycle degree. With in-degree and out-degree both being 1, maybe a sector is only adjacent to two sectors, but the three sectors are all in multiple cycles, so that the cycle degree of the sector is large still. In Figure 2, with the in-degree and out-degree being 1, is 3, while is 1.

The concept of cycle degree can depict some features of the strong component, but it is not enough. For a vertex, its impact on circulation is related to its cycle degree, but also to the steps in circulations, which is reflected in the lengths of cycles. The effects circulating in a shorter cycle are stronger than in a longer one. If an indicator can capture the two aspects, it is more accurate. For this reason, the concept of circular centrality is proposed in this paper. In addition, in order to better analyze the strong component of an industrial network, the cycle degree of a strong component is presented too.

In order to analyze the effectiveness of these concepts, some practical calculations are made. With more than 30 years of sustained rapid growth, China’s economy has become one of the fastest growing economies in the world. This paper analyzes China’s strong components and compares them to those of the US and Japan. The result shows the differences between China and the other two countries.

The organization of this paper is as follows. Section 2 describes the indicators to research strong components based on cycle degree. Section 3 briefly explains the methodology we employed in this study. The next part presents the main empirical results and analyses. Finally, some conclusions are offered.

2. Relevant Indicators to Analyze Strong Component Based on Cycle Degree

Strong components of an industrial network can be analyzed from a vertex or from the whole network. From a vertex perspective, as different vertices play different roles in circulations, it is important to analyze these differences. From the whole network, the strong components with different numbers of cycles create different circular effects. In addition, the distribution of sectors with large cycle degree can also show the characteristic of the economy.

2.1. Circular Centrality

The cycle degree of a vertex depicts the number of closed paths through the vertex in circulations. For a vertex, cycle degree can show its effect on circulation, but that is not wholly so. As is seen in Figure 1, there are two cycles through vertex ①, ①→②→① and ①→③→②→①, but they have difference. Vertex ① gets to itself through two steps along the first cycle, but three steps along the second one. The length of the path affects the linkage between the two sectors. As the path becomes longer, the linkage would be weaker. To a closed directed path, it is obvious that the cycle’s length is a factor which can affect the roles of a vertex in circulation. If a vertex is contained in a special long cycle, the effect of the sector on itself would be slow and weak.

In order to express the effect of a sector on circulation, two factors should be taken into consideration, the cycle degree and lengths of cycles through the sector.

Freeman [13] proposed the concept of closeness centrality and suggested that the shorter the distance between a vertex and the others is, the higher closeness centrality it has. A vertex with big cycle degree and short cycles has a high circulation. Considering the two factors, we define the circular centrality of a vertex, which involves the concept of length distribution of cycles [11].

Length distribution of cycles depicts the numbers of cycles with different lengths through a sector. Regardless of loop, the length of the shortest cycle is 2. As the vertices in a cycle are different, the length of the longest cycle is no more than the number of vertices in the strong component, here assumed to be . Supposing denote the number of cycles through sector , we get length distribution of cycles . In Figure 2, there are 4 cycles through vertex , with two cycles of the length of 5 and the others of 2 and 4. So, the length distribution of industry cycles through is .

Length distribution of cycles involves the lengths of cycles and the corresponding cycle degrees at each length through a vertex. To a certain length, the ratio of cycle degree to its length shows the circular effect. The sum of all ratios depicts the role of the sector in circulation, which is defined as the circular centrality of a sector here. Denote the circular centrality of sector by , and then

The percentage is applicable when the vertices within strong component are compared with each other. The coefficient of circular centrality of sector , denoted by , is the percentage of circular centrality of sector in those of all sectors in the strong component, and then

The circular centrality of a vertex is a concept based on cycle degree but involves the lengths of cycles. A vertex with big coefficient of circular centrality plays an important role in circulation.

2.2. The Cycle Degree of a Strong Component

Cycles are the fundamental units of a strong component. There must be some cycles in a strong component, at least one. Complete strong components contain the most cycles. The more cycles a strong component contains, the stronger the circularity is. The number of cycles in a strong component is related to the number of vertices and the features of arcs.

The cycle degree of a strong component, denoted by , is the number of cycles it contains. As the number of vertices in a strong component is constant, it is highly correlated to the arcs there. In general, the cycle degree of a strong component is highly positively correlated to the number of arcs. Sometimes, it is also related to the directions of arcs, and so forth. In Figure 2, if the arc becomes , there would remain only one cycle.

As a cycle including several vertices, the cycle degree of the strong component is not the sum of cycle degrees of all vertices there. If there are vertices in a cycle, the cycle is counted times, for it is considered in calculating the cycle degree of each vertex in the cycle. In order to calculate it, we can classify all cycles according to their lengths and sum up the cycle degree at each length. Suppose there are cycles through sector and sectors in a strong component, . As the number of vertices is that of arcs in a cycle, the number of cycles in the strong component is .

Regardless of loops, arcs beginning and ending at the same vertex, as an industry cycle contains two sectors at least, the length of the shortest cycle is 2. And that of the longest cycle is , for sectors in the strong component. So, the cycle degree of a strong component, denoted by , is the sum at each length. That is,

The cycle degree of a strong component depicts network connectivity of the strong component. The bigger it is, the stronger the connectivity is.

3. The Methodology to Identify Strong Components and Calculate the Cycle Degree of a Vertex

In order to study strong components, we need to identify them from an industrial network first. In addition, calculating the cycle degree of a vertex is the basis to study the strong component. Here, we employ the methodology proposed by Zhao et al. [11] to do these.

3.1. Identifying Strong Components of an Industrial Network

A strong component should be identified from its unique features. All sectors in a strong component can reach one another along directed paths. If arcs are drawn along all directed paths, a strong component would become a complete subdigraph, where there are arcs between any two sectors, as is shown in Figure 3. In the adjacent matrix attached to a complete subdigraph, the entries off the principal dialogue are all “1.” According to this, we can distinguish complete subdigraphs from others and identify the strong components accordingly.

In order to identify the strong component of an industrial network, we draw arcs along all directed paths and get a new network, which is called the expansion network here. In the expansion network, the strong components of the primary network become complete strong components. Suppose that the adjacent matrixes of an industrial network and its expansion network are and , respectively. In the industrial network, if there are directed walks, length of from to , the entries at the intersection of row and column of would be [14]. Suppose that an industrial network contains vertices; then the length of the longest path (without cycles) of the industrial network will be not more than . The sum of the power sequence of the () matrices gives all walks no more than steps. When arcs are drawn along all paths, the entries of the corresponding adjacent matrix will become 1 at all positive entries of the sum matrix. So we can get the adjacent matrix from the Boolean summation (i.e., 1 + (#) 1 = 1) of the power sequence matrices; that is,where # denotes Boolean summation.

From , we search for all the appropriate and for satisfying and . All the vertexes and are in one complete subdigraph of the industry linkage expansion network; that is, they all belong to one strong component in the industrial network.

3.2. Calculating the Cycle Degree of a Sector

In a strong component, cycle degree of sector means the number of all cycles through sector , denoted by . In order to calculate, divide the cycles through sector into groups according to their length and denote the cycle degree of length by . It is obvious that the sum of cycle degrees of all lengths cycles is the cycle degree through sector . Consider

Ignoring the loops, the shortest cycle is 2 cycles (length of 2), and the longest is no more than the number of vertices in the strong component.

From the adjacency matrix of an industrial network, , we can get the liaisons of any two sectors. If , there is a direct linkage from sector to ; if ( all differ from one another, so that there is a directed path length of from sector to ), there is an indirect linkage from sector to , as is shown in Figure 4; otherwise, there is no linkage. When sectors and are the same one, the directed path becomes a directed cycle. In other words, there is a cycle through sector .

In order to calculate , we search for the directed closed paths (, … all differ from one another) first and then count. The number of all these paths is , and the sum of all lengths of cycles is .

As vertices are different from each other, the searching process is complex, especially to larger . But an industrial network has its unique characteristics, which make the calculation possible. Firstly, the number of vertices in an industrial network is clear and small, and so is that of a strong component. Secondly, as industrial networks are sparse, if we only calculate unit entries and omit nil entries of the adjacent matrixes, the calculation will be reduced greatly. The algorithm is as follows.

Suppose that there are sectors in a strong component, and is the adjacent matrix attached to the strong component. Let be equal to 0 for sector first. As is shown in Figure 5, the steps are as follows.

Step 1. Search for the suitable for .

Step 2. Search for the suitable for . As no loop is in a strong component, it is certain that . If , there must be a 2-cycle; then ; else, go to next step.

Step 3. Search for the suitable for , and . If , ; else, go to next step.

Step . Search for the suitable for , and . If , ; else, go to next step.

Step  . Search for the suitable for , and . If , ; else, go to next step.

Step  . Sum up (); then we get .

Applying the procedure to all vertices of a strong component, we obtain all cycle degrees.

4. Empirical Calculations and Analyses

In order to study the effectiveness of the concepts mentioned, some practical calculations are made. China’s rapid development has intrigued a great deal of economists to try to understand the process [1518]. So we try to analyze China’s strong components and compare them to those of the US and Japan.

For comparison purposes, the data employed are derived from the OECD Input-Output Database, which is a part of the structural analysis (STAN) database. Data are used from Domestic Input-Output Tables 2005 of China, the US, and Japan (the 2005 Input-Output Tables were the last ones available). The tables all have 37 homogeneous sectors, as presented in Appendix C, making it possible to provide internationally comparable data for research. In order to find the fundamental relationships from the impacts on total production, we calculate the corrected influence coefficients of industrial linkage (Chen and Zhao [19], see Appendix A), on the basis of which industrial networks are constructed (Zhao et al., 2011 [20], see Appendix B).

4.1. Identifying the Strong Component of China’s Industrial Network

Based on the methodology of Section 3, we obtain China’s strong component. There is only one strong component, containing 23 sectors and 144 arcs, as is shown in Figure 6. The cycle degree of the strong component of China is 1263356.

4.2. Cycle Degrees of Sectors in China’s Strong Component

The lengths distribution of cycles and cycle degrees of sectors in China are calculated, as is shown in Table 1. It can be seen that, to one sector, when the lengths of cycles are short, as the length is growing, the corresponding cycle degrees rise gradually. When the cycle degree reaches the maximum value, it starts to fall until reaching zero with the increase of the length.

To represent length distribution of industry cycles graphically, with abscissa being the lengths of cycles and ordinate being cycle degrees, we get the line charts of the top ten sectors, as is shown in Figure 7.

From Figure 7, it appears that the curves look like bells, increasing first and then decreasing. In the declining process, several curves overlap gradually. Given closer analysis it can be seen that as cycle becomes longer, there are more sectors in one cycle, so the cycle degrees of the top sectors increase simultaneously.

Dividing curves into halves in the middle parts, it can be seen that the differences of cycle degrees mainly come from the shorter cycles. Since shorter cycles circulate faster than the longer ones, the sectors with larger cycle degrees play more important roles and show greater competitiveness in an economy.

The cycle degree through a sector is an absolute value describing all closed paths through the sector. When we analyze how much a sector impacts the whole structure, relative cycle degree is needed. The relative cycle degree through sector is the percentage of the cycle degree through sector in that of the strong component, denoted by ; then

The relative cycle degrees of sectors in China are calculated, and the top ten ones are taken out, as is shown in Table 2.

4.3. Strong Component Comparisons among China, the US, and Japan

The cycle degree of the strong component of China is 1263356, while those of the US and Japan are 72712187 and 94706, respectively. It shows that, within an economy, the economic circulation of China is in the middle of the US and Japan. Relatively, Japan is the most dependent on foreign economy, and the US is the weakest.

For comparison, the coefficients of circular centrality and the relative cycle degrees of sectors are calculated and ordered in the three economies. Listing the top ten sectors, we get Table 2.

From Table 2, the top ten cycle degree sectors in the three economies basically share the same rankings with their coefficients of circular centrality. It is thus evident that the sectors with bigger cycle degrees have higher circular centrality generally.

Comparing the ten largest cycle degree sectors in the three economies, we can find that there exist some similarities but more differences in the three economies, as is shown in Figure 8. Overall, there are three common sectors, sectors 23, 25, and 32. They all belong to the service sector.

The distributions of the top ten sectors in the three economies are largely in accordance with their characteristics.

In China, the top ten sectors are distributed in three industries, one belonging to the resource industry, five to the manufacturing industry, and four to the service industry. Sector 2 (mining and quarrying) ranks in the top ten only in China. Of the five sectors belonging to the manufacturing industry, four are in the top five. It is evident that the manufacturing industry is the predominant part in China. The US shows its unique characteristics. Of the top ten sectors, nine belong to the service sector. Obviously, the service industry plays a dominant part in the US. Among them, sectors 28 (real estate activities), 31 (research and development), 36 (other community, social, and personal services), and 3 (food products, beverages, and tobacco) are in the top ten sectors only in the US, and sectors 28, 31, and 36 are even in the top five. Japan is somewhere between the US and China. The top ten sectors are both evenly distributed in the manufacturing and the service industry.

5. Conclusions and Further Research

Strong components are important components of a digraph for the circulations among vertices. Studying the strong components of an industrial network makes it easier to further identify the structural characteristics of the vertices and the whole network. To describe the strong components of an industrial network more accurately, this paper proposes the concept of circular centrality of a vertex and cycle degree of a strong component on the basis of cycle degree. As circular centrality of a vertex takes into consideration the influence by the lengths and the numbers of cycles through the vertex, it can better describe the real world. Using these indicators, the features of the strong components of China are analyzed and compared with those of the US and Japan.

We can analyze the strong components of an industrial network from two aspects: a vertex and strong component. From the viewpoint of a vertex, circular centrality is a concept involving the cycle degree and lengths of the cycles. Generally, a sector with a higher circular centrality has a stronger circulation and a larger influence on the economy. The study on the strong components of China and the other two economies shows that a vertex with a large cycle degree, with a few exceptions, has a high coefficient of circular centrality. From the viewpoint of a strong component, its cycle degree describes the number of circular paths within it. With a constant vertex number, a strong component with a larger cycle degree circulates stronger.

Although the concept of circular centrality of a vertex can describe strong components thoroughly, the calculation methods also have some limitations. Since the calculation process involves circulation, it may take a long time when there are many vertices and arcs in a strong component. As the intermediate inputs of industries are uneven (see Appendix B), the industrial networks are relatively sparse. This makes the method effective in most cases.

Generally, a sector with higher circular centrality has more competitiveness. In order to improve the competitiveness of an economy, we can try to change some linkages through policy guidance to enhance the circular level of the sector and the whole network. It is meaningful to find these linkages and take appropriate action to change them in future research. In addition, there are probably some vertices with more intimate relationships than others within a strong component. Finding these groups will be useful to further analyze the network.

Appendix

A. The Corrected Influence Coefficients of Industrial Linkage

From the perspective of the total output of an economy, corrected influence coefficients of industrial linkage delineate the effects of the linkages between two sectors on the total output with the unchanged final demand. The hypothetical extraction method (HEM) is a suitable method for quantitatively calculating the coefficients.

The original idea of the HEM is to extract a sector hypothetically from an economy and examine the impact of the extraction on the economy (Schultz [21]; Cella [22]; Clements [23]; Dietzenbacher and Van Der Linden [24]; Miller and Lahr [25]; Cai and Leung [26]). The differences of total output show effect of the linkages.

Similarly, to assess the linkage effect between two specific sectors, we assume that sector would purchase import goods to substitute for the inputs of sector completely. Thus, by comparing the total outputs before and after the hypothetical extraction, the linkage between sectors and can be calculated. Based on the Leontief model, the industry linkage effect is studied from backward linkages.

Formally, denote the technical coefficient matrix of an economy with sectors by , , and the total output vector by , , the final total demand vector by , . The impact of a sector on its own is not considered.

In accordance with the Leontief model, the basic balance equation is

According to HEM, it is assumed that the technical coefficient matrix is partitioned into two groups: group one () being the two sectors to be studied (sectors and ) and group two (, ) being the remaining sectors. and are divided accordingly. Considerwherewith .

Equation (A.1) can be rewritten as

Solve (A.4): where , , and denotes the identity matrix.

Now, suppose that the input from sector to sector is extracted from the economy; that is, . becomes , , and becomes . Equation (A.4) can be expressed as

Solve (A.6): where .

The difference between (A.5) and (A.7), denoted by , means the industry linkage effect of sectors and . Considerwhere , , and are row summation vectors for , , and .

The corrected influence coefficient between sectors and , , is the percentage of in the total output. Consider

B. Constructing the Industrial Network

The industrial network is a fundamental model to analyze economic structures. With information in Input-Output Tables, the industrial network can be constructed with sectors being vertices, while arcs (directed edges) represent industrial linkages. The numeric values matrices, such as input-output quantitative transactions, or coefficients calculated from them, such as corrected influence coefficients, depict various relationships and determine the existence of arcs between sectors. In principle, with a defined threshold value, a numeric values matrix can be transferred into an adjacency matrix attached to a directed graph, with all positive entries larger than or equal to the filter to unity and the rest of the entries to zero.

It can be observed that the intermediate inputs of any sector are uneven. In the technical coefficient matrix of any economy, only few values are large. For instance, in Chinese IO tables for the year 2005 (37 × 37), the 53 highest intermediate flows comprise 50% of the total intermediate consumption, while the remaining ones (1316) account for the other 50%. Nonuniform inputs make the industrial networks determined by several important relations.

Considering that the inputs have obvious inflection point, Zhao et al. [20] proposed the method to establish networks with Weaver-Thomas index.

The Weaver-Thomas index is an effective tool for finding the significant index developed by Weaver first and improved by O. Thomas later. Comparing an observed distribution with an assumed one, the closest approximation distribution is established to identify the key elements in numerical sequences. For the availability of finding crucial factors, it is widely used in regional economics.

Let denote the coefficient matrix of an economy with sectors (it can be technical coefficient matrix, or other matrixes); . According to the sequence from big to small order, rearrange and get the vector . Denote the th element of by ; then the Weaver index iswhere

Let ; the th element of , , is the threshold value. If , there would be an arc from sector to sector in the industrial network with the direction in keeping with the flow. Denote the adjacency matrix by ; then

C. OECD Sector Classification

See Table 3.

Competing Interests

The authors declare that they have no competing interests.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (no. 71371108).