Abstract

The hierarchically nested structure is widely observed in a broad range of real systems, encompassing ecological networks, economic and trade networks, communication networks, among many others. However, there remain statistical challenges of the prevalence of nestedness. In response to this problem, we focus on the effect of incomplete information and the inputted matrix size, the role of network density and degree sequences, and the relevance of degree-degree correlation to conduct systematic research on the significance of the nested structure according to multiplex world trade networks. Firstly, the nested structure can observe significantly when suffering incomplete information and varying inputted matrix size. Secondly, to analyze the role of network density and degree sequences in nested structure, we use “swappable rows, swappable columns” null model which conserves the network size and density, and “fixed in-degree, fixed out-degree” null model which not only preserves the network size and density but also keeps node degree to do randomization tests of nestedness. The randomizations of two null models remark that most nested structures are not determined by network density and degree sequences but closely related to them. Finally, we investigate degree-degree correlation in nested networks, and the results show that the nested structure is negatively related with degree-degree correlation of the network. Following the empirical analysis train of thought, we argue that nestedness is still a unique feature of the network.

1. Introduction

The nested structure has been observed in ecological and socioeconomic systems as diverse as ecological mutualistic organizations, world trade, and interorganizational relations, among many others [1, 2]. Given a network of interacting nodes, nested structure can be described as the tendency for nodes to interact with subsets of the interaction partners of better-connected nodes. A perfectly nested matrix can be shown that the entries in each successive row and column are respectively a strict subset of those in the previous row and column [3]. In ecological systems, nested patterns in mutualistic networks are still open challenges that have led to multiple debates to date such as biodiversity and structural stability [47]. In economic systems, one has often access to only limited information about the nested structure of the social and economic networks to capture economic complexity [8, 9]. The nested feature has already inspired a number of methods quantifying different aspects of socioeconomic systems which include exploring the nested nature of the observed networks [1, 10], as well as predicting the evolution of industrial ecosystems [11].

Although scholars are currently debating around the role of the nested structure in ecological and economic systems, the observed architecture of ecological and socioeconomic networks differs significantly from that of random networks [12]. A lot of works are fascinated by statistical challenges in null model analysis to uncover the network structure [13, 14]. For instance, Strona et al. [15] developed a new procedure named the curveball algorithm, in that it focused rather on matrix information content than on matrix structure. Fosdick et al. [16] investigated the configuring random graph models with fixed degree sequences. Orsini et al. [17] gave the release solution to the evaluation of network randomness, and the null model analysis was used to confirm the prevalence of nestedness. Jonsson [18] analyzed randomization tests of nestedness in 52 mutualistic networks and showed that they were highly nested, and nestedness increased with the number of interactions of the network. Staniczenko et al. [4] showed that complex ecological networks were binary nested, but quantitative preferences were nonnested. Payratoborras et al. [19] applied a theoretical approach to the analysis of 167 real mutualistic networks that nestedness was not an irreducibly macroscopic feature but an entropic consequence of the degree sequences. Furthermore, the similar efforts could try to distinguish nestedness and network property such as modularity [20, 21], core-periphery structure [22], and degree-degree correlation [23]. These studies are trying to solve statistical challenges of the prevalence of nestedness, but the networks that they used are different types or through multiple channels. Naturally, the network structure of world trade networks is complex [24, 25]. Although the scale of the networks is limited by the number of countries and the number of products, not all countries participate in trade which ultimately leads to different network sizes for different products. Therefore, these networks are not the same size and scale, and other network characteristics are also different (as shown in Figure 3). So, the multiplex world trade network is a good sample for analyzing the significance of nested structure. Recently, the network structure can be significantly investigated in the worldwide trade network [26] or be extended to multilayer network [27]. There have also been many recent studies on the null model or the prevalence of nestedness in world trade networks. For instance, the randomization method can assess in detail the role patterns of the world trade networks in binary [28] or weighted [29], monopartite or bipartite [30], aggregated or disaggregated by commodity [31], or across several years [32].

In this paper, we remark the significance of nested structure according to hundreds of world trade networks. The statistical challenges of the prevalence of nestedness are mainly solved from the following three aspects: firstly, how is the robustness of the nested structure when suffering incomplete information and varying the inputted matrix size. Secondly, to analyze the role of network density and degree sequences in nested structure, we will use two null models to do randomization tests of nestedness. Finally, we will investigate the relationships between nestedness and four types of the degree-degree correlation in multiplex world trade networks.

2. Methods and Materials

Figure 1 illustrates the adjacency matrices of three toy networks with the same number of elements but the different level of nestedness. The network visualizations of these three matrices are given in Figure 1(a). Although the number of nodes and links are the same in these networks, the level of nestedness is totally different as the nestedness is determined by the particular directionality of links as revealed if the rows and columns of these matrices are both reordered by descending degree () in Figure 1(b). The matrix with the triangular structure on the right is fully nested, while the matrix with random shape on the left is not nested.

Generally, we can define that the network consists of sets of generalist nodes and sets of specialist nodes. The specialists interact only with a small subset of generalist nodes, while the generalists interact with (almost) all other nodes in the network. This network organization reveals the nested structure. Nestedness is a statistical property of interaction data presented in matrix form. Here, a simple and widely used metric is named the Nested Overlap and Decreasing Fill (NODF) [33]. In the beginning of calculation, the row and the column of a matrix () are swapped and rank-ordered by the sum of the presences in each of these rows and columns, respectively. The transformed matrices are then ready to be processed by the following equation:

Here, is the number of 1 in row, i.e., degree. The takes values between 0 (unnested) and 1 (perfectly nested). Supplementary Information S1 describes the step by step calculation of the nestedness measurement (NODF), and Figure S1 gives an example of calculation of nestedness which is from ref [3]. According to the NODF metric, the nestedness of three toy networks, respectively, corresponds to 0, 0.5, and 1, as shown in Figure 1.

To further analyze the characteristics of nested structure, we use the world trade datasets to construct commodity networks as analysis objects. These datasets consist of imports and exports both by destination. The data comprises 261 countries and 786 products (categories). The detailed introduction of the data is seen in Supplementary Information S2 and Figure S2. For a target product , we can select the trade interactions just contained the target product trade information and then build a matrix , which can be expressed as the country exports the target product to the country . To build a binary network, it is necessary to check whether a trade interaction performance revealed comparative advantage [34]. We need to estimate whether a trade interaction between two countries is significant in the trade network of the given product and can calculate whether a trade interaction’s share of a country’s trade is larger or smaller than the country’s share of the entire whole market. Mathematically,where is equal to the dollar volume of a trade interaction between country and country . The natural cutoff used to determine whether a trade interaction has revealed comparative advantage is . We can therefore construct the binary country-country matrix from the RCA matrix, if we consider,

Thus, we can construct a directed network for a given product from view of network science. The nodes represent countries, and then if , there is a link connected between two countries and . The same steps for each product; we can build 786 networks. It is emphasized that we only remain each network of countries which are more than ten (i.e., the number of nodes is more than ten). We investigate the relevant characteristics of the multiplex world trade network as shown in Figure 2. We can see that the matrix size, network density, and average degree in hundreds of networks are extensive. From Figure 2(a), we find that the inputted matrix size is mainly distributed between 15000 and 45000, and in Figure 2(b), the network density is mainly around 0.06 and 0.1, then the percentage gradually decreases after being greater than 0.1. Further, the statistics of the average degree of the network is shown in Figure 2(c); we can observe that the average degree of the network is mostly around 35 to 45. The network size, network density, and network degree of each layer of the multilayer network are different. So, the multiplex world trade network is a good sample for analyzing the significance of nested structure.

After inputting matrices, we can quantify the nestedness in each layer of the multilayer network with the NODF metric and report their distribution in Figure 3. We set and the nestedness distributions under other threshold as qualitatively similar (the effect of on results is shown in Figures S3, S4, and S5). One can see that the nestedness of layers varies significantly. The range of nestedness values is [0.1, 0.6], which is close to normal distribution. In addition, the matrices are representations of the different layers of world trade networks which respectively correspond to the network of bovine, motorcycles, and medical instruments seen in Figure S6. The nestedness of bovine, motorcycles, and medical instruments is 0.12, 0.37, and 0.57 respectively, which locates in the low, medium, and high areas of nestedness distribution. Although the scale of the matrices are limited by the number of countries and the number of products, not all countries participate in trade which ultimately leads to different matrix sizes for different products. Therefore, these networks are not the same size and scale as toy networks. The network size, network density (the ratio between the number of ones and network size), and other network characteristics of the three networks are different which also suggests a good sample for analyzing the significance of nested structure.

3. Results

As previously shown by the network of bovine, motorcycles, and medical instruments, the structure of the three networks is different. We are interested in the relations between nestedness and the common network features including the network size, network density, and degree.

3.1. The Effect of Incomplete Information and Inputted Matrix Size

The collected world trade data are inevitable to exist absences and some noise. For example, it is difficult to collect trade data of some countries such as Iraq and North Korea. In addition, the trade volume in the collected data could not be completely accuracy in statistics. This is practically meaningful, as in reality, the missing trading data could happen commonly [35, 36], so we need to consider these two situations and employ two ways to simulate these problems. The first is that randomly swapping the interactions is limited to two products in the pairs of same countries with the exporter and the importer. We randomly choose an amount of trade interactions and independently run the swapping simulations. The swapping ratio is between the chosen interactions and the total interactions. The second is randomly removing a certain amount of trade interactions. The removing ratio is between a certain amount of removed trade interactions and the total interaction. We then investigate whether the nestedness ranking is influenced significantly in the case of incomplete and noisy information and compute Kendall’s tau correlation coefficient [37] between the original nestedness list of products and their new nestedness list. As revealed in Figure 4(a), the lower correlation value is that the larger the link swapping ratio and link removing ratio are, the correlation just drops a little. It indicates that the nested structure is robust against incomplete and noisy information. We can also find that when links have changed, the nested structure of whole trade networks after removing links is more robust than that after swapping links.

The other problem is that some countries do not have data or different thresholds of RCA, resulting in different sizes of the inputted matrix. So, we analyze the impact of the size of the matrix on the nestedness of the each network. It has already been pointed out that in sparse networks, the high nestedness might not result from the significant triangular pattern in the adjacency matrix, but due to the large number of empty rows and columns [6, 23, 38]. In these cases, the nestedness metric will become much smaller if these empty rows and columns are removed. To examine whether the nested structure in the multilayer trading networks is truly significant, in each layer, we one by one remove the remaining nodes with the smallest degree. We then compute Kendall’s tau rank correlation between original nestedness of layers and their new nestedness list after a fraction of smallest degree nodes are removed. The detailed process is as follows. We first introduced basic two definitions. Network size is defined as the length of column multiplied by the length of row in inputted matrix. The given network has links, and then network density is the total number of links divided by network size. For each inputted matrix, the size of matrix is different. We investigate effects of network size on measuring nestedness. The transformed matrices are then ready to be processed by the nestedness metric, which is that the rows and the columns of a matrix are swapped and rank-ordered by the sum of the presences in each of these rows and columns, respectively. After this, the rows and columns are closed to the border of the matrix, which are very sparse. We first remove a certain number of rows and columns from the margin of a triangular matrix defined as . Then, we calculate the nestedness of the resulting removed matrix and the removal ratio

Finally, the Kendall rank correlation is employed to analyze the relation between original nestedness list and new nestedness list. In Figure 4(b), one can see that as the removal ratio increases, the correlation gradually decreases, indicating that as the matrix size changes, the nestedness list of the networks also changes, but the correlation can reach 0.8 even if 80% matrix is removed. Until 80% matrix is removed, the correlation drops sharply.

3.2. The Relations with Network Density and Degree Sequence

Besides network size, what is the relation with network density and degree? We will use in the entire trading networks with a variety of nested structure features to investigate relationship with statistical network density and degree, as shown in Figures 5(a) and 5(b). We find that not only network density but also network average degree (i.e., the mean of in-degree and out-degree) are all positively related to nestedness. That is to say, the larger the density of the network is, the stronger the nestedness of the network will be. The higher the average degree of the network is, the stronger the nestedness of the network will be.

Does that mean that nestedness can be expressed in terms of density and average degree? The significance of the nested pattern of the multilayer trading network is further examined by comparing it with two null models [38]. Here, the two null models (detailed introduction is also seen in Supplementary Information S4 and Figures S7 and S8). In the first one, the “swappable rows, swappable columns” (SS) null model conserves matrix dimensions (numbers of rows and columns) and fill. It works by shuffling elements at random within the matrix; however, it differs in that degenerating matrices (those containing rows/columns with no connections) are not permitted. In the second one, the “fixed in-degree, fixed out-degree” (FF) null model has subsequently been a popular choice for application to the nested networks, which preserves the degree distribution of the original network. The two links are randomly rewired but preserve the degree of the involved four nodes. This null model not only conserves the network density but also fixes degree of each node. After independently repeating 100 times of randomization for each commodity network, we adopt -score which is calculated by comparing to the nestedness scores of randomizations of the network. , where and are, respectively, the mean and standard deviation of nestedness across a sample of randomizations of the network.

We can observe the -score in Figures 5(c) and 5(d). The result of the model showed that the greater the nestedness is, the larger the value will be. It also reveals that value of extremely rare networks is closed to zero. These networks can be generated by model but their nested structure is likely to break. When the nested structure is obvious and value is also high, the network generated by model cannot generate the nested structure same as the real network. However, it is different in the model, as shown in Figure 5(d). We can find that the value of a few networks is close to 0, that is to say, the model can obtain the same nested structure as the original network. But most of the value is far greater than 0, indicating that the model cannot generate the same nestedness as the original network generally. In addition, overall, the greater the nestedness is, the larger the value will be, but the trend is weaker. In summary, the results indicate that degrees may determine the nested structure of a few networks, but the nestedness is not statistically determined by the network density and degree sequence.

3.3. The Relevance of Degree-Degree Correlations

Finally, we further expand the relationship between degree-degree correlations and nestedness. Let us first look at the definition of nestedness and degree-degree correlations. The degree-degree correlation is a standard tool for analyzing the network structure and has a simple interpretation [39, 40]. Degree-degree correlation ranges from to 1. If , high degree nodes tend to connect to other high degree nodes; otherwise, high degree nodes tend to connect to low degree nodes. A network exhibits nestedness if the neighborhood of a node is contained in the neighborhoods of the nodes with higher degrees. It is obvious that a network exhibits nestedness if the neighborhood of a node is contained in the neighborhoods of the nodes with higher degrees. The triangular structure thus is strongly correlated with negative degree-degree correlation. That is to say, if the nested structure is more obvious, the degree correlation value is more negative. Thus, according to the import and export relationship of the goods, the network has in-degree and out-degree. So, the degree-degree correlation could be of four types. We use the degree-degree correlation equation [39, 40] and the detailed illustrations are also seen in Figure S9:where let index be the degree type and and be the and degree of the source node and target node for the edge. is the number of links. Figure 6 shows the scatter plot of the nestedness versus four-type degree-degree correlation. One can see that the higher nestedness of a network indicates stronger negative degree-degree correlation, which is also confirmed in modeled scale-free nested networks [23]. It can be also seen that when the nestedness is small, a part of the network degree correlation is close to 0 or even positive. But when the nestedness becomes larger, it shows larger negative degree correlation, especially when the nestedness is higher than 0.2. Comparing the four types of degree correlations, we can see that the negative correlation between r (in, in) and nestedness is the most obvious, and r (out, out) is the least obvious. In addition, in a network with nestedness less than 0.2, r (in, in) and r (in, out) are more likely to be positive than r (out, in) and r (out, out).

4. Conclusions and Discussion

We conducted a systematic analysis of the significance of the nested structure in the multilayer commodity world trade networks. The analysis was from three aspects. Firstly, we simulated incomplete information and the effect of matrix size to measure nestedness. The results revealed that a small amount of data noise and network size did not affect the nested structure. Secondly, we adopted two null models to shuffle links. One could conserve the network density, the other could not only preserve the network size and density, but also keep node degree. We found that although density and degree were positively related to nested structure, they could not determine the nested structure. Finally, we further analyzed the relevance of degree-degree correlation. The results showed that the network with high nestedness has a significant negative degree-degree correlation. In summary, we believe that nested structure is closely related to the density, degree, and degree-degree correlation of the network, but nestedness is still a unique feature of the network.

Our work could enrich the statistical challenges of the prevalence of nestedness, and numerous extensions can be conducted. For instance, we mainly focused on the nestedness method named NODF, and could be asked what about the sensitivity of other nestedness detection methods [41]. In addition, we just used a commonly adopted metric to measure the nestedness of a network. How to design a better metric to capture nestedness more in social dynamics such as communication dynamics [42] is still worth studying. What’s more, our work could bridge the nested structure of the world trade network and its realistic economic and management significance. As we have shown with our illustration of nested structure which can characterize the world trade network, we could bring solutions to other economic systems in observation and modeling. The nestedness could provide a way of thinking about the structural characteristics of the realistic economic systems. There also are other open questions to ask for future study. For instance, we mainly focus on the prevalence of nestedness, but what are the mechanisms that lead to the triangular nested structure of the matrix remains unclear, and a temporal analysis of the networks, in such a way to allow us for observations of how the triangular nested structure of the trade relations have evolved over time. They could provide a great challenge to capture the complex structure of the interactions between a large number of economic agents, which brings solutions to a variety of issues in observation, modeling, and prediction in socioeconomic systems.

Data Availability

The datasets used to support the findings of this study are available from the website (https://comtrade.un.org/) or an alternative data source (http://atlas.media.mit.edu/en/resources/data/).

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

The work was partially supported by the National Natural Science Foundation of China (Grant no. 61803137), Zhejiang Provincial Natural Science Foundation of China (Grant no. LY21F030019), Qiantang River Talents Plan (Grant no. QJD1803005), and Hangzhou Normal University Research Funding Project and Foundation of High Level Overseas Returnees (Team) in Hangzhou for Pioneering Innovation Program.

Supplementary Materials

Supplementary Materials provide the following specific details. S1: the step by step calculation of the nestedness measurement (NODF) (Figure S1). S2: the basic information of the world trade datasets including the trivial details (Figure S2). S3: the detailed illustrations of the network construction. Furthermore, we also analyze the effect of the network structure corresponding to 12 different products with different RCA thresholds including network density (Figure S3), network size (Figure S4), and nestedness (Figure S5). The visualization of the network of bovine, motorcycles, and medical instruments (Figure S6). S6: the two null models to check the significant nestedness of the inputted matrix and detailed description of two null models (Figures S7 and S8). S7: the detailed illustrations of the degree-degree correlation in the nested networks (Figure S9). (Supplementary Materials)