Abstract
Empirical analyses of supply chain networks (SCNs) in extant literature have been rare due to scarcity of data. As a result, theoretical research have relied on arbitrary growth models to generate network topologies supposedly representative of realworld SCNs. Our study is aimed at filling the above gap by systematically analysing a set of manufacturing sector SCNs to establish their topological characteristics. In particular, we compare the differences in topologies of undirected contractual relationships (UCR) and directed material flow (DMF) SCNs. The DMF SCNs are different from the typical UCR SCNs since they are characterised by a strictly tiered and an acyclic structure which does not permit clustering. Additionally, we investigate the SCNs for any selforganized topological features. We find that most SCNs indicate disassortative mixing and power law distribution in terms of interfirm connections. Furthermore, compared to randomised ensembles, selforganized topological features were evident in some SCNs in the form of either overrepresented regimes of moderate betweenness firms or underrepresented regimes of low betweenness firms. Finally, we introduce a simple and intuitive method for estimating the robustness of DMF SCNs, considering the loss of demand due to firm disruptions. Our work could be used as a benchmark for any future analyses of SCNs.
1. Introduction
Due to the increasingly complex and interconnected nature of the global supply chain networks (SCNs), recent research has focussed on modelling supply chains as complex adaptive systems using network science concepts [1, 2]. Following on from the work published by [3], who used network science techniques to generate a network topology and investigate its topological robustness, a large number of theoretical research papers have appeared in this area [4–12]. These studies have theoretically formulated generalizable growth mechanisms underlying the firmpartnering process in SCN formation. Subsequently, the network topologies generated based on various growth models have been investigated for their topological characteristics, such as robustness and efficiency.
Despite the large number of theoretical papers published within the past few years on network modelling of SCNs, the effort on empirical validation of the theoretical findings has been limited. This is mainly due to difficulty in obtaining largescale datasets on suppliercustomer relationships, which are often proprietary and confidential. Papers which systematically analyse the topologies of realworld SCNs, the conclusions of which can then be used to inform modelling efforts, have been relatively scarce.
In light of the above, this study presents a comprehensive analysis of two sets of SCN datasets, namely, (1)The dataset of Indian automobile manufacturers [13]. This SCN includes contractual relationships between various firms (therefore it is modelled as an undirected network).(2)Twentysix SCNs across various manufacturing industry sectors, based on the dataset presented in Willems [14]. These SCNs include material flows between firms (as such, it is modelled as a directed network). All the SCNs in this dataset include the full depth in terms of tiers (from suppliers to retailers).
In particular, this study is aimed at addressing the following key research questions pertaining to SCN topology and robustness: (1)What common topological characteristics (if any) can be expected from the directed material flow SCNs in the manufacturing sector?(2)What are the key differences (if any) in the topology of directed material flow and undirected contractual relationship SCNs?(3)Are there any selforganized features present in any of the SCNs?(4)Are there any correlations between node attributes and the topological features of nodes in the directed material flow SCNs?(5)How can we determine the robustness of SCNs, considering the inherent differences between the contractual relationship and material flow networks?
The remainder of this manuscript is structured as follows. Section 2 provides the background to this study and introduces key theoretical concepts in terms of network topological analysis. Section 3 describes the structure and limitations of each dataset considered, while Section 4 presents the data analysis methodology and results. Section 5 provides a discussion of the results obtained, and Section 6 concludes the paper.
2. Background
2.1. Topology of Undirected and Directed SCNs
The interfirm relationships in SCNs are commonly modelled using undirected links. However, the links between nodes in a SCN can include a direction, depending on the specific type of relationship being modelled. The interfirm relationships in a SCN can be broadly categorised into three classes, namely, (1) material flows, (2) financial flows, and (3) information exchanges. Material flows are usually unidirectional from suppliers to retailers, while financial flows are unidirectional in the opposite direction. Both material and financial flows mostly occur vertically, across the functional tiers of a SCN (however, in some cases, two firms within the same tier, such as two suppliers, could also exchange material and finances) [15]. In contrast, information exchanges are bidirectional (i.e., undirected) and include both vertical and horizontal connections (i.e., between firms across tiers and between firms within the same tier). Therefore, the same SCN can include different topologies based on the specific type of relationship denoted by the links in the model. For instance, unlike material and financial flows, SCN topology for information exchanges can exhibit shorter path lengths and high clustering due to a relatively larger number of horizontal connections [16].
Compared to undirected network representation, in directed networks, the adjacency matrix is no longer symmetric. As a result, the degree of a node in a directed network is characterised by both indegree and outdegree. On this basis, the degree distribution of directed networks is analysed separately for in and outdegrees. Also, unlike undirected networks, in directed networks the distance between node and node is not necessarily the same as the distance between node and node . In fact, in directed networks, the presence of a path from node to node does not necessarily imply the presence of a path from node to node [17]. This has implications on node centrality metrics, such as closeness and betweenness. In addition, many dynamics such as synchronizability and percolation are different in directed networks compared to undirected networks [18, 19]. Therefore, when modelling SCNs, it is important to first identify the specific type of relationship denoted by the links, so that the network can be correctly represented as undirected or directed.
In this paper, we consider two types of SCNs, namely, (1) the SCN of Indian automobile manufacturers, which is modelled as an undirected network (referred to as the undirected contractual relationship SCN or UCRSCN throughout this paper) and (2) twentysix material flow SCNs across various manufacturing industry sectors, based on the dataset presented in Willems [14], which are modelled as directed networks (referred to as the directed material flow SCN or DMFSCN throughout this paper).
2.2. Characterising SCNs Using Network Science Metrics
Mathematical analysis of a network requires it to be represented through an adjacency matrix (). An element of the adjacency matrix , for an undirected network, is given as
In contrast, an element of the adjacency matrix, for a directed network, is given as
The key difference between the adjacency matrix of an undirected and a directed network is that the adjacency matrix of an undirected network has two entries for each link (i.e., the adjacency matrix of undirected networks is symmetric). Therefore, the total number of links () for an undirected network is calculated as , while for a directed network it is calculated as .
In this paper, we have modelled all SCNs as unweighted networks where nodes represent individual firms. For the UCRSCN, the links represent undirected contractual relationships between firms, while in the DMFSCNs, the links represent the directed material flows between firms.
The metrics used to characterise the topology of complex networks can be classified into node and networklevel metrics (see Costa et al. [20] and Rubinov and Sporns [21] for a comprehensive range of measurements used for characterization of complex networks). Table 5 in Appendix A presents the list of networklevel metrics used for analysis in this study, and their implications within a SCN context.
Nodelevel metrics characterise, in various ways, the importance of a particular node for the functionality of the overall network, based on its embedded position in the broader relationship network (known as the centrality of the node). Depending on the context, various centrality measures can be adopted to identify the key players of a given network. Table 6 in Appendix A presents the list of nodelevel metrics used in this study, and their implications within a SCN context.
2.3. DataDriven Studies
Even though most theoretical modelling efforts of SCNs have focussed on variants of preferential attachment to generate network topologies for analysis [22], there have been some studies which have adopted a datadriven approach. For instance, Kim et al. [23] have undertaken a node and networklevel topological analysis, using three case studies of automotive supply networks (namely, Honda Accord, Acura CL/TL, and Daimler Chrysler Grand Cherokee) presented by Choi and Hong [24]. Although the SCNs used in this study are complete, the SCNs are small in size (largest network includes only 34 firms), which limits the observations of emergent network topological properties. Kito et al. [25] have constructed a SCN for Toyota using the data available within an online database operated by MarkLines Automotive Information Platform. By analysing the SCN topology, the authors have identified the tier structure of Toyota to be barrelshaped, in contrast to the previously hypothesized pyramidal structure. Another fundamental observation reported in this study is that Toyota SCN topology was found to be not scalefree (even with finitesize effects taken into account). Although the dataset used in this study is sufficiently large (with 3109 firms), it is limited to only the top three tiers of the overall SCN.
More recently, using Bloomberg data, Brintrup et al. [26] and Orenstein [27] have undertaken topological analysis of various SCNs. Brintrup et al. [26] have studied the SCN of Airbus and have reported that this SCN displays assortative mixing and communities based on geographic locations of the firms. Orenstein [27] has undertaken topological analysis of retail and food industry SCNs by considering the suppliers within the top three tiers. The SCNs considered in this study were found to have scalefree topologies with degree exponents below 2. Although the dataset used in this study is sufficiently large and allows observation of temporal variations to the SCN topology, consideration of only a part of the SCN depth in terms of tiers has limited the generalizability of the results. It is noted that the key limitation in using the Bloomberg database, for constructing SCNs, is that the data are not exhaustive since the database only includes publicly listed firms. Therefore, the SCNs constructed using Bloomberg data may only provide a part of the full picture.
Although the above studies have provided a number of key insights about the topological structure of various SCNs, no study to date has systematically investigated a large collection of directed material flow SCNs in various industry sectors and compared the results against an undirected SCN of contractual (interfirm) relationships.
By considering a collection of twentysix SCNs from the manufacturing industry, our study is able to investigate and establish the general topological properties of these SCNs. This effort will complement the large body of theoretical literature on modelling SCN topologies through various growth models, by revealing what specific topological characteristics are needed to be captured in an appropriate growth model. In addition, the correlation analysis presented in this study, between various nodelevel centrality measures and two exogenous factors (stage cost and stage time), can be powerful in demonstrating how the position of firms can influence the overall functionality of the SCN (and vice versa).
Finally, this study has used the reliable dataset provided in published work by Willems [14] in relation to the DMFSCNs. The data collection procedure for the UCRSCNs is detailed in Section 3.1.1. Our study offers distinct insights from previous studies because (i) it is based on a large collection of realworld SCNs belonging to the manufacturing sector, (ii) most of the SCNs are large (have a relatively high number of nodes) so that various emergent properties can be sufficiently demonstrated, (iii) we extract and compare the topological properties of SCNs where links represent either the directed material flows or undirected contractual relationships, (iv) our correlation analysis of node centrality measures with exogenous factors in DMFSCNs provides insights into the impact of the position of firms in the dynamics of SCNs, and (v) we investigate the selforganization in all SCNs, which helps us identify nonrandom features of these systems. Thus, this study is unique on several counts from previous studies described above.
2.4. Generation of Null Models for Hypothesis Testing
An important question when testing hypothesis related to network topologies is whether the degree distribution on its own is sufficient to describe the structure of a network, i.e., whether the topological features observed in the network are explained by the ensembles of networks generated by its degree distribution while preserving the degree vector. In this regard, degreepreserving randomisation (DPR) plays an important role in generation of null models.
DPR involves rewiring the original network, to generate an ensemble of null models, while preserving the degree vector [28, 29]. At each time step, the DRP process randomly picks two connected node pairs and switches their link targets. This switching is repeatedly applied to the entire network until each link is rewired at least once. The resulting network represents a null model where each node still has the same degree, yet the paths through the network have been randomised. Comparison of properties of a given network, with the properties of an ensemble of networks generated by DPR, allows one to identify if the properties observed in the real network are unique and meaningful or whether they are common to all networks with that degree sequence [30].
In this study, we use DPR to investigate the presence of selforganized topological features in the undirected contractual relationships SCN. For the DMFSCNs, we introduce a novel DPR procedure to generate null models by preserving both the degree of nodes and the numbers of links present between adjacent tiers.
2.5. Robustness of SCNs
Past studies have gained insights into the robustness of SCNs by using (1) analytical measurements available in network science, such as network centralisation, percolation threshold, and assortativity, or (2) simulations to investigate how various networklevel metrics are affected when nodes are sequentially removed, either randomly (known as random failures) or based on degree (known as targeted attacks) [3, 8, 11, 12].
While the above methods may be suitable for obtaining highlevel insights into the robustness structure of UCRSCNs, a more specific method needs to be formulated for assessing the robustness of tiered DMFSCNs. In particular, this method should consider both the topological structure of the network and its ability to meet the consumer demands. In this regard, we introduce a simple and intuitive method to investigate the robustness of DMFSCNs, considering the demand at the retailer nodes.
3. Data Sources, Structure, and Limitations
3.1. Data Sources and Structure
3.1.1. Undirected Contractual Relationship SCN
The undirected contractual relationship SCN consists of customersupplier contractual linkages in the Indian automotive industry. The network is constructed from the citations of the autocomponent firms about the firms they offer their products and services to (in this regard, a firm could be a supplier to one firm and a customer of another firm). The list of firms is taken from the Automotive Component Manufacturers Association (ACMA) of India annual publication “Buyers Guide.” The data corresponds to the year 2001–2002.
The basic topological analysis of this data (excluding the robustness analysis and selforganization analysis presented in this paper) was presented in Parhi [13]. Although this data was originally created from a 2002 directory, it was compared with recent records as a part of a study titled “Dynamics of Distribution and Diffusion of New Technology” [31] and the authors demonstrated that there is no qualitative difference in the industry structure compared to the 2002 dataset. In fact, the actors and their topography in the supply chain in the Indian Automotive Industry has remained more or less the same over the years (note that this dataset only includes the organized sector of the Indian automotive industry), where the only change has occurred with regard to the dynamics of the interactions, thanks to the fast integration of the information system in a highly integrated world.
3.1.2. Directed Material Flow SCNs
Willems [14] provides a dataset of realworld multiechelon supply chains, used for inventory optimization purposes. The overall dataset includes a total of 38 multiechelon supply chains, from various industries. The chains described in this paper comprise actual supply chain maps created by either company analysts or consultants. Since these maps have been implemented in practice, they demonstrate how users have modelled actual supply chains.
The abovementioned dataset includes the following key information: (1)The industry sector of each supply chain network(2)For each supply chain network: (i)The stages (nodes) representing each firm involved(ii)The arcs (links) representing precedence relationship between stages(3)For each stage: (i)Its classification and tier based on its function within the overall supply chain(ii)The direct cost added at the stage (stage cost)(iii)The average processing time at the stage (stage time)(iv)The average daily demand at each retailer stage
Note that in the network models developed in this paper, we denote stages (i.e., firms) as nodes and the arcs between stages (which represent the precedence relationships between firms) as directed links.
From the original dataset, networks with more than 100 firms (i.e., nodes) were selected for our analysis, and there were twentyeight such large networks. Smaller networks were omitted in this analysis since they do not offer any interesting insights into emergence of various complex topological features. Then, using the industry sector information, these SCNs were categorised into six main groups as illustrated in Table 1. As can be seen, the set of SCNs considered by us vary in size (with a minimum of 108 to a maximum of 2025 nodes).
3.2. Limitations of the Datasets
3.2.1. Undirected Contractual Relationship SCN
The SCN data for the Indian automotive industry covers the organized sector of the industry, as there is no comprehensive database on the unorganized segment. The list of customers includes only the firms that have been selfreported by the autocomponent firms as their principal customers.
3.2.2. Directed Material Flow SCNs
A key limitation of the available dataset is the lack of information in relation to the geographical locations of individual firms. This information was not provided in the original dataset in Willems [14] due to confidentiality reasons. Unlike the virtual networks (such as WWW or social networks), the SCN structure is largely influenced by geographical aspects (since the congregation or dispersion of the suppliers depends on the raw material distribution over various geographic regions). Therefore, if geographic location information was available, indepth conclusions could have been made about various observed structural features of the SCNs.
In addition, this study is unable to investigate the dynamic nature of the SCNs since the dataset does not provide any information pertaining to temporal changes in the SCN topology. Lastly, the relationship strength between firms is not captured in the dataset in terms of the amount material flow. Although specific production capabilities of firms within each tier are known, no information is available in relation to how much each upstream firm supplies to the downstream firms.
Nevertheless, the size of the dataset, both in terms of the number of networks available and the size of each network, as well as the cost and time data associated with nodes make this a very attractive dataset to study.
4. Data Analysis Methodology and Results
4.1. Topological Results
Using each dataset, we constructed SCNs, where the nodes represent the individual firms and the links represent either the undirected contractual relationships or the directed material flows between firms (see Figure 1). Cytoscape software and JAVA programming language were used to visualise and analyse the SCNs. The results are presented in the following subsections.
4.1.1. Undirected Contractual Relationship SCN
Table 2 presents the networklevel topological features of the UCRSCN of Indian automobile manufacturers.
4.1.2. Directed Material Flow SCNs
Table 3 presents the topological features of the 26 directed material flow SCNs. The three key observations that can be made from the table are that (i) the degree distributions of the majority of SCNs indicate good correlations against the power law fit, and most of them have degree exponents in the range of 1 to 3, (ii) most SCNs are disassortative, in terms of degree as well as stage cost and stage time, and (iii) none of the SCNs indicated clustering (not presented in the table), due to the links only being present between functional tiers. These observations have important implications, which are discussed in more detail in Section 5.
In addition, for this dataset, we have investigated the nodelevel centrality metrics. Since nodelevel metrics themselves provide information about individual nodes rather than networks as a whole, here we choose to primarily study correlations between them and other nodelevel attributes available in the dataset. In particular, we studied the correlation coefficients between the centrality metrics for each node (namely, in and outdegree centrality, betweenness centrality, and closeness centrality) and its corresponding (1) stage cost and (2) stage time. The correlation plots are presented in Appendix B. The results of this assessment are discussed in detail in Section 5.
4.2. Identifying SelfOrganized Topological Features Using DegreePreserving Randomisation
DPR can help establish whether or not the observed topological property in a network is simply an artefact of the network’s inherent structural properties or a property unique to the nodes. Comparison of the original betweenness and closeness centrality distributions with the average distributions of the same metrics, for an ensemble of randomised (through DPR) networks, can reveal whether the centrality distributions observed in the original network are structural or not. In particular, if the original (i.e., observed) and randomised trends are identical, then the centrality distributions observed in the original network are purely structural, i.e., they can entirely be explained by the degree distribution without attribution to any other external mechanism. However, if the original centrality distribution diverges from the average trend obtained for the randomised ensembles, there is an underlying mechanism which induces this deviation in the original network [17].
Although the divergence between the observed centrality distribution and the randomised ensemble average is generally found through visual inspection of the plots [17, 32], in this study we have used the KolmogorovSmirnov test (KS test) to establish any statistically significant deviations. The KS test is a powerful statistical test that allows one to compare two distributions (the null hypothesis of the test is that there exists no difference between the two distributions). Importantly, the KS test does not make any assumption about the underlying distribution of the data (i.e., it is a nonparametric and distributionfree test statistic), thus allowing comparisons between arbitrary distributions.
4.2.1. Undirected Contractual Relationship SCN
For the UCRSCN, we applied DPR where, at each time step, two connected node pairs are picked and their links targets switched. This switching is repeatedly applied to the entire network until every link is rewired at least once (without allowing the creation of selfloops or multilinks between node pairs). The resulting network represents a null model where each node still has the same degree, yet the paths through the network have been randomised. Using this process, we generated 1000 randomised networks. We have then compared the betweenness and closeness centrality distributions of the original network with the average betweenness and closeness values obtained for the 1000 randomised networks (see Figure 2).
(a)
(b)
4.2.2. Directed Material Flow SCN
For the DMFSCNs, we have developed a modified DPR procedure, which we refer to as the tier constrained DPR (TCDPR). At each time step, this process picks a pair of links which lie across the same two tiers and swaps their target nodes. In particular, the following algorithm has been used.

The above process is repeatedly applied to the original network until every link is rewired at least once. By choosing link pairs between the same two tiers and swapping their targets, in addition to preserving the degree of nodes, we also preserve the number of intertier links. Therefore, the resulting network is a null model whose degree distribution and tier structure are identical to the original SCN.
Using the TCDPR process, we generated 1000 randomised networks. We have then compared the betweenness and closeness centrality distributions of the original network with the average betweenness and closeness values obtained for the 1000 randomised network ensembles. The values obtained for the KS test for each SCN is presented in Table 4. Figures 3–6 illustrate some scenarios where selforganization in terms of betweenness and closeness was identified.
4.3. Robustness Analysis
In this section, we investigate the robustness of the UCRSCN. Subsequently, a simple and intuitive methodology for assessing the robustness of tiered DMFSCNs with demand considerations is introduced.
4.3.1. Undirected SCN
Since the UCRSCN dataset does not include any node attributes, we resort to the generic topological robustness analysis technique commonly employed in network science literature. In particular, we remove nodes, either randomly (to simulate random failures) or sequentially based on their degree (to simulate targeted attacks). In each iteration, we measure the size of the largest connected component (LCC) of the network. As nodes are successively removed, the overall network disintegrates into numerous subnetworks. The number of nodes in the LCC of the fragmented network therefore provides insights into its structural integrity in terms of overall connectivity.
Figure 7 illustrates the topological robustness assessment results. The size of the LCC has been plotted against the percentage of removed nodes, under random and targeted node removal scenarios. Note that for the random node removal scenario, the result has been obtained by averaging 100 runs of the simulation.
4.3.2. Directed SCNs
Unlike the UCRSCN, the Willems [14] dataset for DMFSCNs includes average daily demand levels at retailer nodes. Therefore, in order to more accurately assess the robustness of the DMFSCNs, we develop a simple and intuitive methodology. In particular, we establish the robustness of each network as a function of lost average daily demand, as firms are sequentially removed. In developing the robustness assessment method, we assume that the firms within the same tier have substitutable capabilities with no capacity restrictions, i.e., the supplies lost due to failure of one firm can be fully replenished by another firm in the same tier.
For instance, consider the hypothetical DMFSCN scenario presented in the Figure 8. In order for retailer B to satisfy its average daily demand, it relies on three supply chains, namely, (1) BFJN, (2) BFJM, and (3) BFIM. Now consider a scenario where the manufacturing firm J is removed from the network—in this case, retailer B will lose two out of the three supply chains it relies on to satisfy its average daily demand. However, retailer B will still be able to satisfy its average demand through the supply chain BFIM, which is not affected by removal of firm J. Therefore, as long as the retailer node has access to an upstream most supplier through a complete supply chain, it will be able to satisfy its average daily demand. However, when firm J is removed, retailer D will not be able to satisfy its average daily demand—since both supply chains for D (DGJM and DGJN) are reliant on firm J. In this regard, it is important to note that removal of any retailer node implies that it will not be able to satisfy its respective average daily demand.
Based on the above idea, we develop a robustness metric termed “Robustness Score” which is defined as follows: where is the set of retailers in the SCN and is the average daily demand at retailer r. captures the availability of paths to the upstream most suppliers at each retailer and is 1 if there exists at least one path connecting the retailer to an upstream most supplier and 0 otherwise.
We applied the above idea of Robustness Score to the SCN ref 20 (powerdriven handtools), where nodes were removed randomly and sequentially based on their degree (i.e., targeted attacks). For each scenario, after the removal of each node, the drop in Robustness Score was recorded to generate a profile. This result is presented in Figure 9.
5. Discussion
5.1. Topological Structure of the SCNs
The degree distributions for the majority of the DMFSCNs indicate a good fit with power law. In particular, 14 out of the 26 networks analysed display 80% or higher correlation with a powerlaw fit for both their in and outdegree distributions The degree exponents of all DMFSCNs were generally found to lie in the range of 1 to 3: note that is the boundary between hub and spoke () and scalefree () network topologies. Similar findings were observed for the UCRSCN, which displayed a 95% correlation with a powerlaw fit for its degree distribution with a degree exponent of 2.31.
These results are in good agreement with the empirical findings on SCNs reported in recent datadriven studies [26, 27, 33, 34], which indicate the topologies of SCNs tend to have degree distributions which can be satisfactorily modelled by power law, with degree [22].
Although a number of past theoretical studies have relied on the BarabásiAlbert (BA) model to generate topologies representative of SCNs [3, 10, 12], based on our results, it is evident that the BA model cannot sufficiently explain the intricacies of realworld SCN topologies. In particular, the BA model generates network topologies with [35], while the realworld SCNs indicate in the range of 1 to 3. Also, the assortative (or disassortative) mixing observed in real SCNs is not a feature of networks generated by the BA model, as shown analytically (in the limit of large network size) by Newman [36]. Finally, the BA model cannot generate networks with pronounced community structure which has been observed in real SCNs, since all nodes in the network belong to a single weakly connected component [37].
Indeed, a range of network growth models are available in extant literature, and broadly speaking, they can be categorised as either evolving models or generative models [17]. The evolving models are aimed at capturing the microscopic mechanisms underlying the temporal evolution of a network topology. In this regard, the BA model can be regarded as an evolving model. In contrast, generative models can be used to generate a snapshot of a network topology. Since the SCNs may have evolved based on various nongeneralizable principles, rather than attempting to understand and model the underlying growth mechanism through an evolving network growth model, it would be beneficial to simply mimic the observed topologies from datadriven studies using a generative network model.
In this regard, fitnessbased generative models have recently gained prominence in theoretical research ([38–40]; Smolyarenko, 2014; [41]). In fitnessbased models, the fitness distribution and the connection rules are given by a priori arbitrary functions, which enable a considerable amount of tuning (Smolyarenko, 2014). Indeed, this tunability makes such models a useful and practical modelling tool. For example, Ghadge et al. (2010) have proposed a purely statistical method for generating a range of network topologies by randomly allocating the nodes with fitness values sampled from a lognormal distribution. The propensity of each node to attract links is determined proportionally to its fitness. This method is referred to as the lognormal fitness attachment (LNFA), and it includes a tunable parameter (the shape parameter of the lognormal distribution), which can be manipulated to generate a large spectrum of networks. At one extreme, when is zero, all nodes have the same fitness and therefore at the time a new node joins the network, it chooses any existing node as a neighbour with equal probability, thus replicating the random graph model. On the other hand, when is increased beyond a certain threshold, very few nodes will have very large levels of fitness while the overwhelming majority of nodes have extremely low levels of fitness. As a result, the majority of new connections will be made to a few nodes which have high levels of fitness. The resulting network therefore resembles a monopolistic/“winnertakeall” scenario, which can sometimes be observed in the real world. Between the above two extremes (random and monopolistic) lies a spectrum of power law networks which can closely represent many real SCNs (Ghadge et al., 2010). Nguyen and Tran [42] have illustrated that the LNFA model can indeed generate network topologies with in the range of 2–3 (and beyond), which can realistically represent the observed SCN topologies. Recent work by Bell et al. [43] has shown that a riskaverse behaviour by SCN firms can lead to a fitnessbased network growth.
It is also interesting to note that all the DMFSCNs indicate relatively lower network centralisation values llustrating the largely distributed and decentralised nature of modern SCNs. This lower centralisation could also be due to the recent supply chain practice known as modular assembly, where manufacturers obtain preassembled modules from a reduced base of suppliers (such as through intermediate subassemblers), in contrast to the traditional approach in which individual components are procured and assembled by the manufacturer [44]. In general, the SCNs which involve more complex and specialised manufacturing and assembly processes (such as aircraft parts, computer equipment and, electrical and farm machinery) were found to be more centralised than the other SCNs such as cargo and chemicals (which is also indicated by its low level of heterogeneity).
The network centralisation for the UCRSCN was found to be higher than the average network centralisation for the DMFSCN dataset. This could be due to the inherent differences between the two types of the SCNs, i.e., the contractual relationships are generally centralised through a leader firm whereas the flow networks generally link firms who wish to exchange material/goods.
In terms of degree assortativity, majority of the DMFSCNs (21 out of 26) were found to be slightly or strongly disassortative, where the highly connected hubs tend to avoid each other, instead linking to lowerdegree nodes. As a result, the network structure of these SCNs tends to display a hubandspoke character (as opposed to core periphery structure observed in assortative networks). Some SCNs in industries such as “Perfumes, Cosmetics, and Other Toilet Preparations” and “Semiconductors and Related Devices” were strongly disassortative. We found no SCN which was strongly assortative. Additionally, the UCRSCN was also found to be disassortative, in terms of node degree. This type of disassortative mixing has been observed commonly in economic systems [17] where trade typically takes place between individuals or organizations of different skills and specialities.
An unfavourable implication of the disassortativity observed in the SCNs (in terms of degree) is that since high degree nodes are less connected to one another, many paths between nodes in the network are dependent on high degree nodes. Therefore, failure of a high degree node in a disassortative network would have a relatively large impact on the overall connectedness of the network [29]. On the other hand, disassortative networks are generally resilient against cascading impacts arising from targeted attacks—since hub nodes are not connected with each other, the likelihood of disruption impacts cascading from one hub node to another is minimised [45].
Furthermore, the DMFSCNs mostly displayed slight or strong disassortative tendencies in terms of stagecost and stagetime attributes (20 out of 26 SCNs were found to be disassortative in terms of stage cost and stage time). That is, firms which contribute high stage cost are on average more likely to be connected with firms that contribute low stage cost, and vice versa, and the same is true for stage time. No SCN that we studied displayed a strong assortative tendency in terms of these attributes. The strictly tiered structure of these SCNs could be responsible for this stagecost and stagetime disassortativity as it separates the functional capabilities of firms which are linked with each other between tiers.
An important difference between the UCRSCN and the DMFSCNs was observed in their clustering properties. The clustering coefficient indicates the degree to which firms in a SCN tend to cluster together around a given firm. For example, it can indicate how various suppliers behave with respect to the final assembler at the global level [23]. Therefore, the higher the clustering coefficient, the more dependent suppliers are on each other for production [46]. The UCRSCN indicated low levels of clustering while none of the DMFSCNs indicated any clustering. It is noted that in the DMFSCN dataset considered, no horizontal connections were observed (i.e., no connections between firms within the same functional tier). Due to this inherent structural limitation, which prohibits triadic closure, these SCNs do not indicate any clustering (therefore, clustering values are not reported in Table 2). This implies that the suppliers in the material flow networks are independent of each other (since they are likely to be competitors). While this structure may limit diffusion of knowledge through the SCN, it has favourable implications in terms of network robustness, since disruptions to one supplier is unlikely to impact another. Hearnshaw and Wilson (2010) note that SCNs with a low clustering coefficient are likely to experience a more opportunistic behaviour and less collaboration which may lead to system inefficiencies due to the difficulty in systemwide coordination. Therefore, SCN efficiency can be improved by creating new horizontal connections, such as through social relationships. Choi and Wu [47] report a realworld example of such a scenario where Honda has built clustering in their supply chain by directing and facilitating the relationships of its firsttier suppliers with some secondtier suppliers.
5.2. Correlation between Topological Centrality Metrics and Other Firm Attributes in Directed Material Flow SCNs
For the DMFSCNs, node centrality correlation assessments were carried out against node attributes. In general, for the majority of SCNs (regardless of the industry), we observed the following: (1)The degree of firms was found to correlate positively with the stage cost while no correlation was identified with the stage time. This implies that the firms with higher number of connections generally add higher levels of direct costs to the market price of the final product. (i)The indegree of firms was found to correlate positively with the stage cost and negatively with stage time. This was particularly evident for the SCNs in aircraft engine and electrical industries. It implies that in these SCNs, the firms with higher number of upstream suppliers tend to add higher levels of direct costs to the market price of the final product and they tend to have relatively lower average processing times (these are firms which play the role of assemblers, which bring together many components from various suppliers for assembly purposes).(ii)With the exception of SCNs in aircraft engine and electrical industries, the outdegree of firms was found to correlate positively with the stage cost and the stage time, suggesting that the firms with high number of downstream customers (such as major distributors) tend to add higher levels of direct costs to the market price of the final product and they tend to have relatively higher average processing times. This was particularly evident for the two SCNs in the transportation of freight and cargo industry.(2)The betweenness and closeness centralities of firms were found to correlate positively with their stage cost, which implies that those firms which are more involved in the relationships between other pairs of firms and those firms which are active information generators tend to add higher values to the final product.(3)The SCNs in electrical, computerrelated, farm machinery and equipment, and aircraft engine industries indicated negative correlations between the betweenness centrality and stage time. This implies that, in these SCNs, the firms which are involved in the relationships between two other firms tend to require lower average processing times. Such central firms generally play the role of assemblers or distributors, thus requiring less processing times.
The above insights demonstrate how, from a SCN point of view, the position of an individual firm with respect to the others can influence both strategy and behaviour [48].
5.3. SelfOrganized Features
5.3.1. Undirected SCN
Comparison of the betweenness and closeness centrality distributions of the original network with the average betweenness and closeness distributions obtained for the 1000 randomised networks (shown in Figure 2) reveals that there exists a significant difference (KS test value is <0.05) between the respective centrality distributions in the observed SCN and the average for the naturally occurring scheme.
In terms of betweenness centrality, we observe that moderate betweenness firms were overrepresented in the observed SCN compared to the average of the randomised ensembles. Therefore, we can conclude that the UCRSCN topologies are nonrandom and favour moderate betweenness firms. A similar result was reported by Becker et al. [32] who constructed a manufacturing system network model from realworld data (where nodes represent separate work stations and links represent material flows between work stations). By applying the DPR process to generate an ensemble of networks with the same degree distribution, they observed that work stations with a particularly high betweenness centrality are overrepresented in the manufacturing system studied. They concluded that the manufacturing system topology is therefore nonrandom and favours the existence of a few highly connected work stations. Betweenness of a firm in the context of a UCRSCN indicates the extent to which it can intervene over interactions among other firms in the SCN by being a gatekeeper for relationships [23]. Indeed, this has specific advantages in a networked economy, since it enables the firms to acquire more market intelligence and control by playing the role of an intermediator. Therefore, it is reasonable to see firms selforganizing themselves in order to increase their betweenness in the SCN.
In relation to closeness centrality, we observed that all firms in the UCRSCN included lower closeness centralities when compared with the average of the randomised ensembles. Closeness indicates the proximity of a given firm with respect to others in the network. Complex manufacturing industries, such as the automobile manufacturing sector indicated in the UCRSCN, include longer supply and service chains which can place the firms peripherally and far away from each other (as indicated by the network diameter of 7 for the UCRSCN, which is higher than the majority of the DMFSCNs).
5.3.2. Directed SCNs
Our results indicate that only some DMFSCNs include selforganized features in terms of betweenness and closeness. Those SCNs with a value ≤0.05 for the KS test, as outlined in Table 4, indicate that there exists the significant difference between the centrality distributions of the observed SCN and the average of the randomised network ensembles. This implies that in these SCNs, there exists an external mechanism beyond the degree distribution, which has driven the firms to adopt various centrality levels. In general, as illustrated in Figures 3 and 4, comparison of observed betweenness profiles against the average of randomised ensembles reveals that firms with low betweenness are underrepresented in the observed DMFSCNs. This is an interesting observation which complements the findings obtained for the UCRSCNs which indicated the moderate betweenness firms to be overrepresented. This implies that in material flow SCNs, the firms need to maintain a certain level of betweenness to function (as evident from the plateau of low betweenness nodes in the profiles of observed SCNs presented in Figures 3 and 4).
In contrast, comparison of observed closeness profiles against the average of randomised ensembles reveals that firms with moderate closeness are overrepresented in the observed DMFSCNs (Figures 5 and 6). While it is closely related to betweenness centrality, closeness is more relevant in situations where a firm acts as a generator of information rather than a mere mediator/gatekeeper. For example, due to various hindrances, the market demand information can easily be distorted when it flows from the downstream firms towards upstream firms. Such distortions can lead to undue deviation between production plans of manufacturers and supply plans of suppliers, leading to a phenomenon known as the bullwhip effect. Firms with high closeness centrality levels therefore play a major role in sharing the actual market demand information with upstream firms in the SCN, thus diminishing the adverse impacts arising from the bullwhip effect [9]. This could be a reason why the firms in the DMFSCNs have selforganized to have higher than random levels of closeness in the moderate closeness regime.
5.4. Robustness Character
5.4.1. Undirected SCN
The point identified as in Figure 7 indicates the critical threshold at which the LCC disappears and the network is fully fragmented into individual nodes. As can be seen from this figure, the network breaks apart relatively rapidly when the nodes are removed sequentially by targeting higher degree ones first, compared with random node removals. It is evident that occurs at 60.93% while occurs at 95.04%.
It is interesting to investigate how different the observed robustness (against random node removals) of the network at hand is, when compared against a randomly wired network of the same size (in terms of the number of nodes and links). This question can be answered using the MolloyReed criterion which identifies the presence of a LCC within a network (Reed, 1995). This is on the basis that for a network to have a LCC, most nodes that belong to it must be connected to at least two other nodes [17]. In particular, the MolloyReed criterion states that a network has a LCC if where and are the first (mean) and second moment of the network’s degree distribution, respectively.
Based on the above, networks with lack a LCC and are composed of many disconnected clusters. Applying the insight provided by the MolloyReed criterion to a network with an arbitrary degree distribution, one can predict the fraction of nodes required to be randomly removed from the network in order to destroy its LCC (i.e., the critical threshold, ) as follows [49];
In contrast, the critical threshold of an Erdős–Rényi network (i.e., a randomly wired network) is given by [17]
A network is considered to display enhanced robustness if its critical threshold is higher than that of a randomly wired network of the same size (in terms of the number of nodes and links). The UCRSCN is characterised by and . Using the above formulae, we can determine to be 0.975 and to be 0.895. This implies that, in order to destroy the LCC of the UCRSCN by fragmenting it into many disconnected components, one would need to remove 97.5% of the nodes (note that this theoretical prediction is generally in agreement with of 95.04% established through the average of 100 simulations). Also, since , we can conclude that the UCRSCN displays enhanced robustness against random failure of firms. Indeed, the enhanced robustness of this SCN against random firm removals manifests owing to its hub structure. Random node removals, by definition, affect nodes irrespective of their degrees. Since scalefree networks, such as the UCRSCN considered here, comprise predominantly of less connected nodes and a few hubs, the chance of randomly removing a hub is almost negligible. Therefore, random node removals are likely to affect mainly the less connected nodes, which, although numerous, play a limited role in maintaining a network’s integrity [17].
5.4.2. Directed SCN
For the DMFSCNs, we demonstrate a simple and intuitive method which considers the impact of firm failure on the output capability of the SCN. This method can be used by practitioners to establish the robustness of their SCNs and also to compare the robustness of various SCN systems against each other.
Since the above robustness concept for the DMFSCNs depends on the demand profiles at the retailers, it may not provide generalizable insights. Therefore, we have not attempted to investigate the robustness character of the full dataset. However, it is evident from the robustness profile presented for the reference SCN 20 (powerdriven hand tools) that the targeted removal of nodes based on their degree has a drastic impact on the overall network robustness—only 5% of the firms need to be removed for the entire SCN to be incapable to meet any demand at the retailers. In comparison, the SCN is generally much more robust against random removal of firms—about 70% of the firms need to be removed randomly, before the SCN is incapable of satisfying any demand at the retailers. The above result highlights the importance of hub nodes, through which the majority of the paths traverse. Therefore, the removal of these nodes will have significant impacts on the ability of SCN to meet the demands.
Sheffi and Rice [50] note the importance of building flexibility and redundancies into SCNs as a way of improving the robustness of these systems. In this regard, parallel supply paths with minimal dependencies could be incorporated into SCNs, so that a disruption in one firm does not impact the operations of the other.
6. Conclusions and Future Directions
In general, in both DMF and UCR SCNs, we observed degree distributions which conform to power law indicating the existence of hub/leader firms. In both types of SCNs, disassortative mixing was observed in terms of the degree of firms. Interestingly, majority of DMF SCNs also showed disassortative mixing in terms of firm cost and time attributes. Since DMF SCNs are characterised by various functional tiers, no clustering was evident in these systems, while some level of clustering was identified in the UCR SCN. The node centrality correlation assessment carried out against firm attributes (cost and time) for DMFSCNs reveals a relationship between the position and the function of firms within the system.
Additionally, we identified that the UCR SCN included enhanced topological robustness against random firm failures and selforganized topological features in terms of both betweenness and closeness of firms. However, selforganized features were only present in some of the DMF SCNs.
Finally, we have developed a simple and intuitive method for assessing the robustness of DMF SCNs considering the demand loss at retailers as firms are removed. This analysis outlines the importance of hub firms, through which the majority of the supply paths traverse. Therefore, the removal of these nodes will have significant impacts on the ability of SCN to meet the demands. The robustness score concept can indeed be used to identify the most critical firms in a DMF SCN, which will provide insights beyond the purely topologybased centrality metrics. Future research could investigate the application of TCDPR we introduced here to identify the configuration, of a particular tiered SCN, which will maximise the robustness score by allocating supply paths based on demand levels at retailers.
Our work for the first time attempted to generalize the topological features of a large number of SCNs from the manufacturing sector. It is notable that since we only considered relatively large networks, finite size effects are minimal. While some topological features were indeed networkspecific, the topological similarities between the networks were striking. Therefore, this work could be used as a benchmark for developing generalized growth mechanisms for SCNs in future.
Appendix
A. Network and NodeLevel Metrics
B. NodeLevel Centrality Correlations with Stage Time and Stage Cost
B.1 Correlation Analysis for Total Degree.
B.2 Correlation Analysis for InDegree.
B.3 Correlation Analysis for OutDegree.
B.4 Correlation Analysis for Betweenness Centrality.
B.5 Correlation Analysis for Closeness Centrality.
Data Availability
The DMF SCN dataset is publicly available under the published work of Willems [14]. The adjacency matrix for the UCR SCN can be provided upon request by the corresponding author.
Conflicts of Interest
The authors declare that there is no conflict of interest regarding the publication of this paper.
Acknowledgments
The authors acknowledge the Australian Research Council (ARC) for funding this work under grant DP140103643.