Abstract

Network efficiency analysis becomes important in railways in order to contribute towards improving the safety and capacity of the rail network, making rail travel more attractive for passengers, and improving industry practice and informing policy development. However, a physical railway network structure is a complicated system, and the operation, maintenance, and management of such a network is a difficult task which may be affected by many influential factors. By using efficiency analysis technology for a railway network, combining physical structure with operation functions can help railway industry to optimize the railway network while improving its efficiency and reliability. This paper presents a new methodology based on complex network principles that combines the physical railway structure with railway operation strategy for a railway network efficiency analysis. In this method, two network models of railway physical and train flow networks are developed for the identification of key stations in the railway network based on network efficiency contribution in which the terms of degree, strength, betweenness, clustering coefficient, and a comprehensive factor are taken into consideration. Once the key stations have been identified and analysed, the railway network efficiency is then studied on the basis of selective and random modes of the station failures. A case study is presented in this paper to demonstrate the application of the proposed methodology. The results show that the identified key stations in the railway network play an important role in improving the overall railway network efficiency, which can provide useful information to railway designers, engineers, operators and maintainers to operate and maintain railway network effectively and efficiently.

1. Introduction

In comparison with road transportation, railways are by far one of the safest means of ground transportation, especially for their passengers and employees. However, there are some issues involved in both maintaining this position in reality and sustaining the public perception of railway safety excellence. The railway now finds itself in a situation where actual and perceived safeties are real issues, to be dealt with in a new public culture of rapid change, short-term pressures, and instant communications [13]. However, operation and maintenance of railway networks are becoming difficult, particularly, it is related to railway network efficiency, reliability, and safety. For example, if a key station fails to operate in a railway network, it would affect the overall railway network transportation efficiency. A station failure to operate can be classified into physical failures of the railway physical network such as failures of signal system, power supply system, track system, and vehicle system, and the function failures of the train flow network such as inaccurate train route planning and scheduling [4, 5]. A network efficiency is related to its demands, flows, costs, and behaviour on the network, which can be used to measure the importance of network components and their rankings [6, 7]. In this study, network efficiency measure is used to assess the importance of railway network stations and their rankings, which is discussed in Section 4 and expressed in Equation (8). Based on the definition of network efficiency, therefore, the identification of key stations in the railway network efficiency analysis is the major task in the railway development and maintenance activities [8, 4]. The railway network is a complex system with hundreds of stations and correlative operations. Therefore, the network efficiency analysis is becoming more important in aspect of ensuring safe and reliable operation to be applied in the railway network. This paper presents a new methodology to analyse the efficiency of the railway networks using complex network principles for the key station identification, in which railway network efficiency is evaluated based on station failure modes. This will provide useful method and information to the industry in the design, operation, and maintenance of the railway network effectively and efficiently.

In the context of the network theory, a complex network can be defined as a graph that is composed of relatively many mutually related nodes including structural and functional relations, and it could also be defined as a network that has nonobservable topological features that do not arise in simple networks such as random ones but often occur in graph models of real systems [9]. A railway network can be classified as a complex network and investigated through complex network analysis [4, 5]. Complex network anlaysis has been successfully applied to analyse the efficiency of networks, for example, a biology network [10], a research cooperating network [11, 12], an electricity supply system [13], a traffic network [4, 14], and even an Internet network [15]. Dey et al. [16] successfully applied complex network theory to analyse safety and reliability of topology impact on the propagation of cascading failure in a national power grid. Zio and Sansavini [17] also applied complex network method for modelling interdependent network systems in order to identify cascade-safe operating margins. These researchers have developed various network analysis models and also studied the structural characteristics of the networks including system indicators such as node degree, length of the path, and clustering coefficient, in which the complex network vulnerability can be analysed in the selective and random node failure modes [1821]. In the literature, some of the studies have also been conducted to investigate characteristics of transportation networks based on complex network theory. Xu et al. [22] studied urban bus transport systems, Porta et al. [23] and Wang et al. [7] looked at insight of urban street networks, Bagler [24] studied airline network, and Dall’Asta et al. [25] investigated USA airline network. Guidotti et al. [26] proposed a probabilistic methodology to quantify the network reliability based on current network efficiency and a measure of connectivity, i.e., eccentricity and heterogeneity. This method was applied to analyse a highway transportation network reliability. Qian et al. [27] developed a cascading failure model of the complex network to simulate the road traffic status by using the delay of the time, incident dissipation factor and load capacity. Chen et al. [28] presented a particle swarm optimization (PSO) algorithm to optimize the invulnerability of China railway traffic network by introducing the concept of the edge to the network. The results produced from these researches provide useful information for maintaining and operating of the complex road and airspace transportation networks.

Because a railway network is a complex network that is also suitable to be examined by using complex network principles. Lin et al. [29] and Li et al. [8] studied China’s high-speed rail network as a complex system and analysed network safety and reliability based on complex network theory. Ouyang et al. [5] also applied complex network theory to study the performance and vulnerability of railways under various types of attacks and hazards. The complex network theory has been widely applied to the reliability and safety analysis of the complex networks and has also been used in railway network reliability analysis. However, these studies are limited to properties of the physical networks, and the railway operation functions are neglected in their analyses. It is essential to develop new methods and models to take not only the characteristics of physical railway network, but also the operation functions into considerationl in this case, train flow network needs to be integrated into the network efficiency analysis process in order to obtain reliable results.

This paper presents a new methodology to analyse railway network efficiency that combines characteristics of railway physical network with the functions of train flow network. In other words, the proposed method considers not only the physical network topologies such as degree and clustering coefficient, but also the dynamic operation parameters such as train running paths, stop-schedules, and service frequencies. The proposed method can be used to identify the key stations in the network and analyse railway network efficiency based on selective and random failures of stations, which provides a useful method and more reliable and accurate information to railway designers, engineers, operators, and maintainers for operating and maintaining railway network effectively and efficiently.

This paper is organised into the following sections. After the introduction, Section 2 presents the development of railway physical network (RPN) and train flow network (TFN) and considerations in the RPN and TFN are discussed more detailed. Specific terms in mathematics are defined in Section 3, which will form the basis of the proposed method using complex network principles for key station identification in the railway network. Section 4 describes a new proposed network efficiency analysis method. A case study of a high-speed rail network efficiency analysis is presented in Section 5 to demonstrate the application of the proposed method, and recommendations are given in this section for improvement of railway network at planning, operation and maintenance in order to satisfy the railway network efficiency requirement. Finally, conclusions are given in Section 6.

2. Development of Railway Physical Network (RPN) and Train Flow Network (TFN)

A new approach for the analysis of network efficiency is proposed, which combines the RPN with the TFN in order to consider network structure properties together with network operation functions in the analysis process. The RPN considers the physical connecting properties in the network and provides constraints to the TFN. The RFN takes train service plan and the operation functions of TFN into consideration. The proposed railway network efficiency analysis process includes three steps, i.e., development of railway network efficiency model, identification of key station indices, station ranking, and network efficiency analysis as shown in Figure 1.

A RPN can be developed based on the existing railway network, and then a TFN of a service plan can be established by integrating RPN into operation strategies such as train running routes, stop-schedules and service frequencies [4, 26]. Key station identification indices can be obtained by evaluating the importance of the stations in a TFN. Stations in the RPN can be ranked based on the evaluated importance of each station. Once key stations are identified, the key station identification indices are then used to analyse network efficiency by the choice of failure modes of stations or the random selection of failure modes of stations in the railway network.

2.1. Railway Physical Network (RPN)

The stations can be considered as nodes, and the connection between any two stations can be expressed as an edge in the RPN [7, 16, 22, 24]. For example, Figure 2(a) shows a simple RPN in which 8 stations are connected by two rail lines, i.e., A-B-C-D-E (5 stations) and F-G-C-H (4 stations). The station C is a junction station, and the dark nodes show that these stations are terminal stations that can be as original or destination stations of the trains. The RPN can be expressed as , where is a set of railway stations in the network, and is a set of rail tracks. The RPN presents the physical connectivity among the stations in the RPN in which takes the track length, section capacity and station capacity into consideration. The RPN can be used to analyse transportation capacity constraints for train service plan.

2.2. Train Flow Network (TFN)

As the stations are represented as nodes in the RPN, if a train has been scheduled to be operated between two stations, this will produce one edge between these two stations as shown in Figure 2(b). The numbers of trains scheduled to stop at any two connected stations determine the weight of edge between the two stations [4]. The total number of edges can be calculated by and denotes a train that has a total number of stations to stop as scheduled. For example, Figure 2(b) shows 8 stations that are connected by 28 edges, i.e., between two stations based on the proposed train service plan as shown in Figure 3. The number on each edge presents frequency of trains, for example, number of 5 on edge A to C presents that 5 trains are scheduled running on this edge and stop at station A and C as shown in Figure 2(b). The TFN can be expressed as , where is the set of stations that any train can stop at these stations, and is a set of edges that creates any two stops at any stations in the RPN. Therefore, the TFN can be developed based on train service plan in which the train stop schedule creates the edges in the RPN, and frequency of trains determines the weight of the edge. Obviously, if the frequency of trains running on an edge is high, the weight of this edge is high.

2.3. RPN and TFN Considerations

As described earlier in this paper, the RPN can be improved by taking the train service plan into consideration to produce the TFN. For example, Figure 3 shows a proposed train service plan which includes 9 stop-schedules, i.e., T1, T2, …,T9 and frequencies of trains are 2, 1, 1, 3, 2, 1, 1, 1, and 1, respectively. The nodes as shown in Figure 3 for each stop-schedule means that a train stops at these stations, for example, the stop-schedule of T4(3) indicates that 3 trains have a same stop-schedule with different departure times, and all of these 3 trains will stop at stations of A, B, C, D, and E. Based on the proposed train service plan, the TFN can be produced as shown in Figure 3 in which train service plan has been taken into consideration.

As can be seen from Figure 2(b), the edge between nodes A and B is created by stop-schedules of T4, T7, and T9 as shown in Figure 3, and the weight of edge is the sum of frequencies of these three stop-schedules, i.e., 3 + 1 + 1 = 5. It should be noted that there is an edge between nodes F and C in TFN as shown in Figure 2(b), although there is not a direct connection between F and C in RPN as shown in Figure 2(a). However, the edge between nodes F and C in TFN is created by stop-schedules of T5, T6, and T7 as shown in Figure 3, and the edge weight is the sum of frequencies of these three stop-schedules, i.e., 2 + 1 + 1 = 4 as shown in Figure 2(b). Similarly, other edges based on stop-schedules can be produced, and the weights of edges in TFN can be calculated. In this case, the railway network physical topology in the RPN and the operation strategies such as train running routes, original stations and destination stations, stop-schedules, and service frequencies can be considered together to produce the TFN by taking the relations and weights of edges into account.

3. Factors Used in Key Station Identification Analysis

As described in Section 2, the TFN can be developed based on the RPN by taking train service plan into consideration. Key station identification indices can be then calculated based on complex network principles [30], including degree centrality (DC), strength centrality (SC), betweenness centrality (BC), clustering coefficient (CC), and comprehensive factor (CF). These indices are then used to assess the importance of each station in the railway network, which are described as below.

3.1. Degree Centrality (DC)

Assume denotes the node in the TFN, the DC of a node is the number of the connections between and other nodes in the RPN, which describes the physical connective influence of a node by the number of its neighbour nodes. For example, the DC of Node A is 7 (i.e., 5 + 1 + 1 = 7) as shown in Figure 2(b), which means that 7 Nodes of B, C, D, F, E, G, and H are connected to Node A directly. The DC of a node in the TFN can be defined as

where is the DC of a node, is the number of the nodes in the RPN, and is a variable of 0 or 1, i.e., if there is a connection between nodes and , then , otherwise, . If a node in the TFN is connected with more edges, it will have a large value of DC . In other words, the DC of a node describes the reachability of the station.

3.2. Strength Centrality (SC)

In the TFN, some of the edges are more important than others that depend on weights of edges. The weight of an edge presents the importance of this edge in the TFN in which it depends on frequencies of train service running on this edge, i.e., the more frequently the edge is used by trains, the more important it is. In this study, the SC is used to describe the weight of each node. For example, the SC of Node A is 25 (5 + 6 + 5 + 6 + 1 + 1 + 1 = 25) as shown in Figure 2(b). Assume SC of a node is the sum of the weights of the edges between and other nodes, and it can be defined as

where is the SC of a node , and is the weight of the edge between node and . The weight of an edge between nodes and in the TFN is the number of trains that do stop at the and stations. The SC of a node also describes the service capability of a specific station, which represents the convenience of the passengers from this station to other stations in the network without any change of the train; in other words, trains can reach more stations from the station.

3.3. Betweenness Centrality (BC)

The BC as defined in the complex network principles [5] describes the influence of a node in the network; in this case, it is a station in the RPN. In this study, the BC relates to the shortest paths from one node to the other one, i.e., from one station to other station. For every pair of nodes, i.e., between two stations in a network, at least there is one shortest path either it has the minimum number of the edges or it has the minimum value of the weights of the edges. A path is defined as from a node (i.e., Station ) to a node (i.e., Station ), which indicates a path passing between two stations in the network based on train service plan [4, 9]. For example, there are 15 shortest paths between any two nodes in the network as shown in Figure 4(a). Among all the 15 shortest paths, 9 paths pass through node S2 including S1-S2, S1-S2-S4, S1-S2-S4-S3, S1-S2-S4-S5, S1-S2-S4-S6, S2-S4, S2-S4-S3, S2-S4-S5, and S2-S4-S6. But 6 paths that are S3-S4, S3-S4-S5, S3-S4-S6, S5-S4, S6-S4, and S5-S4-S6 do not include the node S2. Therefore, the BC of the node S2 is 9/15 = 0.6.

Assume BC of a node in the network without the weight of the edge, which can be calculated by:

where is the number of shortest paths with the minimum number of the edges from a node to a node , and is the number of shortest paths with the minimum number of the edges, which pass through the node from a node to a node .

Similarly, the BC of a node with the weight of the edge is defined as capacity BC, which can be calculated by

where is the number of shortest paths with the minimum sum value of the weights of the edges from a node to a node , and is the number of shortest paths with the minimum sum value of the weights of the edges, in which trains pass through the node from a node to a node . The BC reflects the influence of the nodes throughout the network. A node that has a high impact on network efficiency is called as an influential node. For example, the node S2 in Figure 4(a) is such a node because 9 out of 15 short paths go through this node. Therefore, on the basis of the BC analysis, influential nodes can be obtained in different perspectives of connectivity and transportation capacity.

3.4. Clustering Coefficient (CC)

The CC represents a node that links with a certain node, and whether those nodes have also connected each other or not. The higher the value of the CC of a node is, the more densely connected nodes will be. The CC of a node is defined as

where is CC, is the DC of the node that has a maximum number of edges that equals to , and is the number of the edges that actually exist based on train service plan of the node. In other words, the CC represents the influence of the stations in the network. For example, as shown in Figure 4(b), node S4 has actual connections with nodes S2, S3, S5, and S6. In this case, which consists of 6 edges. i.e., S4-S2, S4-S3, S4-S5, S4-S6, S2-S3, and S3-S6. The maximum number of edges of node S4 is . However, only 6 connections actually exist in the network and other 4 edges are not actually existing. In other words, these 4 edges indicate that the connections can be created by using other actual connections, for example, node S4 has not an actual link with node S1 but can use the path S2-S1 to create an edge such as S4-S2-S1. Therefore, by using Equation (5), the CC of S4 is . The CC represents the influence of the stations in the network, but it has some problems by using the CC in the ranking importance of stations; for example, in some cases, if some of the nodes have same connections in the network, these nodes will have a same value of the CC. This will be demonstrated in the section of case study.

3.5. Comprehensive Factor (CF)

In this study, an important factor of CF is introduced in order to integrate degree centrality (DC), strength centrality (SC), betweenness centrality (BC and ) and Clustering Coefficient (CC) in a unique manner for network identification analysis in the TFN. DC, SC, BC ( and ) and CC can be normalized by

Then, the CF of a node can be calculated by

where is the number 1, 2, 3, 4, and 5, i.e., 1 denotes DC, 2 is SC, 3 is BC , 4 is BC , and 5 is CC, represents the values of DC, SC, BC ( and ), and CC of a node , is the minimum value of DC, SC, BC ( and ), and CC of the nodes in the TFN, is the maximum value of DC, SC, BC ( and ) and CC of the nodes in TFN, is normalized value of the node , is the weight of the , which shows the impact of different in the CF. The selection of depends on the evaluation purposes such as the connectivity and transportation capacity. For example, if DC, SC, BC ( and ), and CC of the nodes are taken as equally important, then can be chosen as 1/5. If transportation capacity in the TFN is more important than connectivity in the RFN, then can be 2/5. Expert judgement and engineering judgement such as the Delphi method [30] can be used in the selection of .

4. A New Methodology for Network Efficiency Analysis

Network reliability can be obtained by the analysis of the characteristics of the network under selective and random station failure modes in the railway network [29]. Selective failure mode will enable network analysts to select stations in the railway network based on the current status of the network and their experience to analyse the network reliability, while random failure mode will enable network analysts to assess network efficiency by selecting stations randomly in the railway network. The network efficiency and relative network efficiency are given below to evaluate the reliability of the TFN, which are derived by

where is the total number of nodes in the network after a number of station failures, and denotes the number of edges in the shortest path between nodes and , which represents the distance between nodes and (if is not connected with , then , and ), and is the network efficiency after the failures of selected stations, and is the initial network efficiency, and denote two different nodes.

5. Case Study

5.1. Background

This section presents a case study on network efficiency analysis for a high-speed rail network to demonstrate the application of the proposed methodology for railway network efficiency analysis. The data and information have been collected from the railway industry for one-day train operation, which indicates that in a total of 2487 trains were operated in such a high-speed rail network. Figure 5 shows the established RPN with 485 nodes (i.e., stations in the network). Based on train service plan on the day, the TFN has also been established with the same number of nodes as the RPN has, i.e., 485 nodes, and the number of edges is analysed as described in Section 2.2; in this case, there are 68198 edges which make up the TFN is more complex than the RPN. Since a large amount of data and information in the network has to be analysed, the new methodology described above has been converted into computer code.

5.2. Degree Centrality (DC) Calculation

As stated in Section 3.1, the distribution of DC can be calculated by Equation (1), for convenience, it has been converted into exponential distribution by

Figure 6(a) shows the results of distribution of DC, and the results of exponential distribution of DC are shown in Figure 6(b). As can be seen that only 11 stations that it is about 2.26% of a total number of stations in the network have a value of DC more than 150. But most of those stations are identified as the hub stations such as Nos. 8, 10, 38, and 48 in Figure 5.

5.3. Strength Centrality (SC) Calculation

The distribution of SC is shown in Figure 7(a), which can be calculated by Equation (2). The results show that only 12 stations (about 4.53%) of total number of stations in the network have a value more than 1000 of SC, and 377 stations (about 77.73%) of total number of stations in the network have a value less than 400 of SC, which indicates that the distribution of SC of the stations in the TFN is extremely deviated, which represents that only a few stations having high service capacity in the network. In other words, it would be more convenient for the passengers to travel from these high service capacity stations than others. Comparing distributions of DC with SC are shown in Figure 7(b), which has been converted by using power law

where can be obtained by Equation (1), and k can be calculated by Equation (2). Equation (11) demonstrates if the connectivity of a station in the current transportation operation strategy is , the ability to serve the passengers is . As can be seen in Figure 7(b), the SC is increased faster than the DC, in other words, the transportation capacity of a station is growing faster than the growth of connectivity.

5.4. Betweenness Centrality (BC) Calculation

The distributions of BC and (i.e., with/without the weight of an edge) in the TFN are given in Table 1 and shown in Figure 8, which can be calculated by Equations (3) and (4). Most of the stations have a small value of probability of and . Value ranges of BC and BC in Table 1 are presented as percentage. Only 5 stations have a large value of , which shows that 1.8% of total stations in the railway network with a value between 0.05730 and 0.06548. In other words, these 5 stations are important and contribute significantly to the efficiency of the TFN.

5.5. Clustering Coefficient Calculation

The CC can be calculated by using Equation (5). Figure 9(a) shows the distribution of the CC in the cast study. The average of CC in the TFN is 0.697, which demonstrates high aggregation characteristics of the TFN, i.e., stations within the railway network are closely connected. The relationship between the CC and DC of each node is shown in Figure 9(b). As can be seen from Figure 9(b) that, obviously, a node with high CC has a low value of DC. In other words, the lower value of the DC the station has, the greater value of the CC that the station has.

5.6. Comprehensive Factor

The CF can be calculated by Equations (6) and (7). Table 2 shows the results and station important rankings of top 20 stations based on their values of the CF, and these stations are also presented in Figure 5 denoted by red spots. Table 2 also shows rankings based on values of DC, SC, TBC, CBC, and CC, respectively. For example, in the column of station (DC), the rankings are based on the values of the DC of the stations, which station identity numbers are given that can be found in Figure 5 and the values of the DC of the stations. In comparison of the rankings based on values of the CF of the stations with the rankings based on values of the DC, SC, BC (, BC ), CBC and CC of stations, as can be seen that rankings of stations are different. The reason is that the evaluations of DC, SC, BC ( and ) and CC have different focuses as discussed in Sections 3.1–3.5. However, the CF addresses all of issues that focuses on DC, SC, BC , BC and CC have; in other words, CF not only takes the railway network physical topology, but also the operation strategies into consideration such as train running routes, original stations and destination stations, stop-schedules, and service frequencies. Therefore, by using CF in the assessment railway network efficiency, more reliable results can be obtained. This can be demonstrated that, for example, in an emergency, a train may be delayed or cancelled; in this case, the RPN is not changed, but the weights and edges of the TFN are changed, and application of a single DC or SC or BC , or BC or CC cannot provide reliable results; CF takes all the factors that DC, SC, BC , BC , and CC used in the consideration; therefore, by using CF, more reliable results can be obtained. The results of this case study have been further confirmed by the industry from their observation in the railway network [5].

Additionally, based on the results produced by the evaluation using CF, as can be seen that most of nodes (i.e., stations) with the high values of CF are in the central and eastern regions in the railway network as shown in Figure 5. This is particularly true because this is also confirmed that these areas have a high economic development with high populations and high level of demand of transportation. However, it should be noted that it is not all of these top 20 stations are in the areas with high economic development and high populations. For example, although station Nos. 11, 40, 41, 58, 96, and 174 have higher values of CF, these stations only have one rail line pass through as shown in Figure 5. In order words, these stations (i.e., nodes) have lower physical connectivity in the RPN but have a higher transportation capacity because the frequency of trains using these stations is high. In other words, more trains are scheduled to use these stations. Another interesting finding is that the stations Nos. 259 and 266 in the capital city, are ranked in the places of 17th and 14th, and this is because there are four stations in the capital city to decentralize transport pressure. Other issues should be noted as stated in Section 3.4, in some cases, the application of the CC in the ranking importance of stations is not effective. In this case, as can be seen from Table 2, the top 20 stations have a same value of the CC because these stations have a same number of connections with other stations in the network.

5.7. Network Efficiency Analysis

In this case, the average network distance of RPN and TFN can be obtained as 30.84 and 2.89 by analysing the network. In other words, the average network distance 30.84 in the RPN represents that a passenger travels from an original station to a destination station that at an average the passenger needs to pass through about 31 (30.84 ≈ 31) stations, and the average network distance 2.89 in the TFN represents the average number of trains that a passenger changes trains during the travel in the railway network. In this case, the passenger needs to change 2 (2.89 − 1 = 1.89 ≈ 2) trains at an average during the travel from an original station to a destination station in the railway network. The network efficiency can be calculated by Equation (8) as described in Section 4, which shows that the network efficiency of the TFN is 2.10, and the RPN is 0.06. This demonstrates that the network efficiency of the TFN is higher than the RPN, in other words, this represents physical connectivity of the railway network in this case is not very dense, but it has a high service capacity and convenient transportation services. The relative network efficiency can be calculated by Equation (9). Figure 10 shows distributions of under different station failure modes based on selective and random failure modes. Selective mode 5 shows the percentage of failures of top 20 stations, which presents that relative network efficiency is declined sharply until 10% of top 20 station failure. When 40% of top 20 station failures, efficiency of the railway network is near the zero. Random models 1, 2, 3, and 4 show efficiency under randomly selected percentage of the station failures in the network. The following assumptions are made to implement the selection of random modes [4, 9]:(i)All stations in the network have an equal opportunity to be selected.(ii)Each station may be/may not be selected at every time or not.(iii)Selections of stations start from 10% of station failures to 80% of station failures in every random mode.

As can be seen from Figure 10, when 80% of stations in the network fail to provide service in random modes, the network efficiency is near the zero. Therefore, in comparison results of selective mode with random modes, the identified top 20 stations as key stations in the network have high impacts on the network efficiency, which require more attention in operation and further development in the railway network. Based on the results produced by the railway network efficiency analysis, recommendations for improvement and optimisation of a railway network can be made, which should address two aspects in the planning and operation.

5.7.1. Railway Network Infrastructure Planning

The network efficiency should be considered in the future railway infrastructure planning by taking the economic and demographic factors into consideration. In this case, some of stations with a high CF in the network only have one railway line passing through, and these stations should consider a new rail line to be constructed, which will not only balance the distribution of the key stations in the network and relieves the transportation pressure but also help to improve the network efficiency.

The combination of RPN and TFN indicates that some stations are located in the same city, for instance, 4 stations in the capital city. Some hub links between these stations should be allocated in the future, which can not only to improve the physical connectivity of the network and transportation services but also improve the efficiency of the whole network in different failure modes.

5.7.2. Railway Transportation Operation

According to the efficiency analysis of the railway network, once the key stations are failed or lost service capacity, the connectivity and efficiency of the overall network would drop down rapidly. To ensure the railway network to provide service, it is recommended to establish operation and maintenance strategies to protect these key stations in the railway network in case of emergency, for example, accidents and incidents, and extreme weather. Furthermore, service capacity can be improved by optimising operation scheme with the constraints of the existing RPN. As described in Section 3.1, the higher k is in the network, the higher service capacity of the network can provide. Therefore, a better operation scheme can also increase efficiency of the railway network.

6. Conclusions

The paper presented a new method to analyse the efficiency of the railway network by identifying the key stations based on the two network models of RPN and TFN. Both physical network topology and dynamic operation strategies are considered in this method. Considering the key stations, railway network efficiency can be analysed under selective and random failure modes. A real case study on a high-speed railway network is presented to demonstrate the application of the proposed method. In this case, the key stations can be identified based on the CF in which connectivity, transportation capacity and local influence are taken into consideration. The results show that the identified key stations in the railway network play an important role in improving the overall railway network efficiency, which provide useful information to railway designers, engineers, operators and maintainers to operate and maintain railway network effectively and efficiently.

As stated above, the proposed new method considers physical network topology and dynamic operation strategies in the railway network efficiency analysis process. As suggestions, the following aspects in future work may need to be considered to obtain information on availability and stability of the railway network. Research should address (1) application of the proposed method by taking railway transportation organization strategy into physical network consideration to establish a vehicle flow network for railway transportation organization availability analysis, (2) methods to maintain key stations within a railway network as normal service in an emergency because of an accident or incident occurred, and (3) methods to assess the existing networks by increasing number of new stations while improving efficiency of the railway network.

Data Availability

The readers can find the data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work describes herein is part of research projects funded by National Key Research and Development Programme of China on “Safety and Security Technology of High-speed Railway System (Grant No. 2016YFB1200401), National Natural Science Foundation of China on “Research on Theory and Methodology of Train Operation Adjustment in Urban Regional Railway Network under Special Conditions (Grant No. 71701010), and Highways England on “New Methodology for Maintenance Decision Making of Structures (Grant No. 546037-PMRB13). Their support is gratefully acknowledged.