Abstract

Complex systems contain a large number of components, and in some cases, failure of one or more of these components can cause the entire system to fail. Replacing failed components with other functioning components properly in the original system can be an attractive way for improving system reliability. This paper proposes a new system reliability optimization model to achieve optimal component reliability and the ideal component-swapping strategy under a certain set of constraints. Furthermore, the survival signature is introduced to more efficient calculation of system reliability under various component-swapping cases, and an artificial bee colony (ABC) algorithm with local search method for component swapping is applied to solve the optimization problem. Finally, numerical examples are presented to illustrate the optimization process.

1. Introduction

As science and technology continue to progress, the complexity and scale of systems are rapidly increasing. Moreover, to achieve diverse sophisticated functions, these complex systems must be capable of operating under severe conditions such as uninterrupted high loads and extreme temperatures. The high risk and uncertainties associated with extreme environments can lead to serious accidents resulting in huge economic losses and threats to human lives [14].

Two fundamental methods are commonly applied for enhancement of system reliability. The first way is to improve reliability of components (reliability allocation), and the second way is to provide parallel redundancy components (redundancy allocation) [5, 6]. Since effort for improvement usually requires resources, system reliability optimization aims to obtain the optimal component reliability level and redundancy level to maximize the system reliability within a certain amount of resources [7]. Multiple heuristic algorithms have been proposed to solve such a nonlinear programming problem, and excellent results are achieved [812]. In addition, other mathematical methods have also been presented. Caserta and Voss [13] translated the reliability redundancy allocation problem into a knapsack problem solved by the branch and cut algorithm. Hemmati et al. [14] introduced neural networks in reliability redundancy allocation. Birnbaum importance and reliability sensitivity are also applied as reference for system reliability optimization [1517]. Besides, some studies have widened the research subjects of system reliability optimization, including phased mission system [18], complex network system [19, 20], and system with uncertainties [21, 22].

As is clear from the literature, the existing research studies tend to focus on developing algorithms with high efficiency and applying them to various systems. However, the basic model of system reliability optimization is still limited in fundamental patterns, i.e., reliability and redundancy allocation, which will increase the cost, volume, and weight of the entire system and do not always yield ideal results. Component swapping is a new effective strategy for system reliability enhancement through swap between failed component and functioning component in a system [23]. This method is attractive since it makes full utilization of the existing components in the original system to avoid unnecessary cost and ponderosity of the entire system. However, not enough attention is paid on component swapping in related research and the algorithms for the optimal swapping strategy are rarely studied. Therefore, in this paper, a system reliability optimization model based on component swapping is proposed. Survival signature is introduced for efficient reliability calculation, and artificial bee colony algorithm is applied for optimization.

The remainder of the paper is organized as follows: Section 2 provides a description of the component-swapping strategy for improving system reliability and introduces the survival signature for more efficient reliability calculations under different swapping cases; Section 3 introduces the component swapping-based system reliability optimization model and artificial bee colony (ABC) algorithm for optimization; Section 4 presents three numerical cases to illustrate the application of newly proposed reliability optimization model based on component swapping; Finally, some conclusions and directions for future work are presented in Section 5.

2. Swapping between Components in the Original System

In practical systems, certain components have a higher degree of importance than others, and failure of these components can cause the entire system to malfunction [24]. To make systems more resilient to possible breakdowns, failed components with higher importance can be replaced with other components in the original system of lower importance that are still functioning [23]. Consider a system with three components, as shown in Figure 1. Component 1 and 2 are of the same type, and failure of component 1 will lead to the failure of the entire system. If component 1 fails and is replaced by component 2, the system can continue working.

For the purpose of discussion, the components to be replaced are called “target components” and the components to replace target components are called “spare components.” The following conditions are defined in this paper:(1)Each target component corresponds to one unique spare component; that is to say, the maximum number of swapping sets for n components of same type is n/2 or (n − 1)/2, for n is an even or odd integer(2)Swapping only occurs when the target component and the whole system fail(3)Swapping only occurs between the same type of components(4)The time required for component swapping is ignored

2.1. Efficient Reliability Analysis with Survival Signature under Component Swapping

For a system with n components, the vector describes the working state of all components where or 0 denotes functioning or failure, respectively, of the ith component. Then, the structure function of the system can be mapped as an associated state vector x with a value of 1 or 0 denoting functioning or failure of the system, respectively [25].

For example, the structure function of the system presented in Figure 1 can be expressed as

Obtaining the structure function of a complex system with multiple components requires a large amount of calculations [26]. Furthermore, the structure of the system changes after component swapping, which makes it difficult to describe the system using the structure function. To calculate the system reliability more efficiently under various component swapping cases, truth table and survival signature are introduced.

The reliability truth table is a matrix describing the working status of the components and the system itself using Boolean values [27]. For a system with n components, the truth table is a matrix of 2n rows and n columns. Each row represents a single possible combination of component statuses, and the entire truth table contains all possible combinations of component statuses. Each row of the truth table also has a corresponding indicator with a value of 0 or 1 representing either the failure or functioning of the whole system based on the statuses of components. Together, all 2n indicators make up the indicator vector of the system.

For a system with 3 components, as shown in Figure 1, the truth table and the indicator vector are presented in Table 1.

When component swapping is introduced, the new structure of the system can be expressed by modifying the truth table of the original system. Since swapping only occurs when a target component and the whole system have failed, rows with a system status indicator value of 1 should remain unchanged. For the rest of the truth table, all rows with the target component value of 0 and the spare component value of 1 are found, and the values of 0 and 1 in each row are swapped. If the system can function after the rows are modified, the corresponding indicator changes from 0 to 1. Thus, the modified truth table and indicator vector can clearly describe the structure of the system after introducing component swapping. For example, to describe the swapping case where component 2 replaces component 1 in the system presented in Figure 1, transformation of the truth table is illustrated in Figure 2.

After the truth table and indicator vector are obtained, the reliability of the system can be calculated using the survival signature, which is an effective tool for the reliability analysis of complex systems with multiple component types. The survival signature represents the conditional probability that a system can maintain normal operation while a certain number of components in the system are functioning normally [28, 29]. For a system containing K different types of components, the survival signature can be calculated aswhere are the number of functioning components of each type, is the total number of components of type k, is the structural function of the system, and si denotes the set of all possible combinations of functioning components of each type, given the number of functioning components of each type . Since the truth table and indicator vector are already known for the system with component swapping, can be easily obtained by searching the matrix.

For the system shown in Figure 1, the survival signatures of the original system, Φ(l1, l2), and the system with component 1 swapped with component 2, Φs(l1, l2), are presented in Table 2.

The survival signature is essentially the conditional probability of system reliability; thus, the full probability equation can be used to derive the reliability of the system at a certain moment. Letting denote the number of functioning components of type k at time t, the probability of the system functioning at time t can be calculated as follows [28]:

By using the cumulative distribution function (CDF) of type k components, denoted by , the above equation can be further derived into the following form:

2.2. Influence of Different Swapping Strategies on System Reliability

A bridge system consisting of 7 components of 2 types is presented in Figure 3 to illustrate all possible swapping cases. Letting [a1-b1, …, an-bn] denote a certain swapping case in which target component bi is replaced with spare component ai(), all possible swapping cases for each type of component are listed in Tables 3 and 4.

As shown in Tables 3 and 4, the total number of swapping cases for type 1 and type 2 components is 25 and 7, respectively. Thus, the number of swapping cases for the whole system is 25 × 7 = 175.

To illustrate the influence of different swapping strategies on system reliability, four cases are considered and discussed:

Case 1. Component 2 replaces component 1 in type 1.

Case 2. Component 2 replaces component 5 in type 1.

Case 3. Component 2 replaces component 1 in type 1 and component 3 replaces component 4 in type 2.

Case 4. Component 2 replaces component 1 in type 1 and component 4 replaces component 3 in type 2.
Truth tables and indicator vectors for the original bridge system under four different swapping cases can be applied according to Section 2.1. The survival signature of the original system, Φ(l1, l2), and the survival signature of the system under four different swapping cases, Φ1(l1, l2), Φ2(l1, l2), Φ3(l1, l2), and Φ4(l1, l2), were calculated by equation (2), and the results are presented in Table 5. Rows with Φ(l1, l2) = Φ1∼5(l1, l2) = 0 are omitted since these data do not affect the calculations.
The CDFs of each type of component in Figure 3 can be expressed as follows:Reliability curves of the original system and the system under four swapping cases were derived and are shown in Figure 4.
As shown in Figure 4, component swapping considerably improves the reliability of the entire system, and a clear gap between improvements can be observed among the different swapping strategies. It is worth noting that more components were swapped in Case 3 compared to Case 1. However, a smaller improvement in reliability was observed for Case 3 because swapped components can affect other components, leading to disconnection of some pathways in some cases. For example, if components 1, 4, and 5 all fail at the same time, the system will not work properly and all of the failed components will need to be replaced (Figure 5).
The system structure after separately implementing swapping strategies 1 and 3 is shown in Figures 6 and 7, respectively.
In Case 1, failed component 1 is replaced by component 2 and a connected path [2 3 6 7] is created, which allows the system to continue operating normally. However, in Case 3, component 4 is also replaced by component 3 in addition to swapping components 1 and 2 and no connected path is created. Thus, the system cannot continue to function properly.
This can be explained by the Birnbaum importance [30]. The Birnbaum importance of a component can reflect the impact of component status on system reliability, which can be measured bywhere Ri is the reliability of ith component at time t and h(1i, R(t)) and h(0i, R(t)) represent the reliability of the system with ith component that is working and failed, respectively. The importance curves of component 3 and component 4 in the bridge system obtained by equation (6) are shown in Figure 8.
As can be seen above, the importance of component 3 is always higher than that of component 4 during the working period, which means replacing component 4 with component 3 is not an effective strategy to improve system reliability.
With the increase in scale and complexity of the system, it becomes more complicated to determine the effective swapping strategy based on component importance. Therefore, a swapping-based system reliability optimization model is built and a heuristic algorithm is applied to achieve the optimal swapping strategy.

3. Component Swapping-Based System Reliability Optimization

Section 2 shows large variation in system reliability improvement depending on the component-swapping case selected. For complex systems with a large number of components, more feasible swapping strategies are available to choose from; therefore, new optimization approaches for these problems are necessary. In practice, many factors inevitably restrict swapping of components; thus, component swapping should be considered an optional way to improve system reliability under various constraints, instead of component redundancy. A new reliability optimization model can be constructed that searches for the optimal component reliability as well as the best component-swapping strategy to maximize reliability of the system under certain constraints, such as cost, weight, and volume.

3.1. Establishment of Swapping-Based Reliability Optimization Model

Consider a system comprised of K different types of components with components of type . Components of the same type share the same reliability, and swapping can only occur between the same component type. For ease of programming, the swapping strategy for components of type k can be expressed by matrix with n rows and 2 columns:where x represents spare components of type k, y represents target components of type k, and nk represents the number of component-swapping set in type k, which should not exceed since each target component corresponds to only one unique spare component. It should be noted that matrices can only describe swapping of a single component type; therefore, exchanging rows within the matrix will have no effect on the calculations. Moreover, if no type k component is swapped, hk is an empty matrix.

Taking the bridge system presented in Figure 3 as an example, for type 1, component 2 replaces component 1 and component 5 replaces component 7 and for type 2, component 4 replaces component 3. Then, the swapping strategy can be expressed using the following matrices:

The survival signature of the system for a certain swapping case can be obtained from equation (2), and under a certain component reliability and certain swapping strategy, the system reliability can be expressed by equation (4). Therefore, the swapping-based system reliability optimization problem can be seen as a nonlinear mixed-integer programming problem for determining the optimal component reliability of each component type and the best component-swapping matrix for each. The mathematical model can be defined as follows:where is a variable vector containing the reliability of components in each type, is a collection containing K matrices representing the swapping strategy for components in each type, and are the total volume, cost, and weight, respectively, as a function of the component reliability and swapping case.

Obviously, component swapping does not directly increase the weight and volume of the system. However, the movement of spare components to the target component position will undoubtedly bring some inconvenience to the process of swapping considering the weight and volume of the components. In order to reflect the limitation of the weight and volume of components on the number of swapping in the optimization model, the constraint functions are defined as follows:where is the number of swapping sets in type k components, i.e., the number of rows of matrix ; and are the weight and volume factors of type k components; the additional terms in the constraint functions, and , represent the volume and weight of the original components in the system. The cost function increases with component reliability and the number of swapping sets in the system and can be expressed as follows [7]:where and are known constants representing the scaling factor and the shaping factor of the component, respectively, and is the mission time during which the system must function normally.

3.2. Penalty Guided Artificial Bee Colony Algorithm

Once a system reliability optimization model is established, the optimal solution can be quickly obtained through intelligent algorithms. In this paper, the artificial bee colony (ABC) algorithm is used [31, 32]. The ABC algorithm based on the forging behavior of honey bees can search over both feasible and infeasible space to find the best solution; thus, the optimal swapping strategy can be obtained from finite possible strategies after multiple iterations.

For the ABC algorithm to be capable of solving the constrained optimization problem, it is necessary to process constraints in the optimization problem and add them to the objective function as a penalty factor. The penalized optimization objective function can be expressed as [32]where RS is the system reliability obtained by using (9) and γ is a positive amplification constant. A flowchart of the algorithm is presented in Figure 9. The main processes for solving the swapping-based reliability optimization problem using the ABC algorithm are as follows.(1)Setting Parameters of the AlgorithmThe maximum cycle number of the algorithm M, population of solutions N, and limit update number L are set.(2)Solution InitializationThe initial population is formed by randomly generating N solutions. Reliability of components of type k can be generated as follows [32]:where rmax and rmin represent the upper and lower bounds of reliability, respectively.The swapping matrices of the system are generated by stochastic extraction and random arrangement. First, a random integer nk is generated to determine the number of component pairs participating in swapping of type k:Then, 2nk elements are randomly selected from the set of type k components and randomly arranged to form matrix hk. The process can be realized using the randperm function in MATLAB.(3)Fitness Evaluation for Each SolutionTo describe the degree of excellence of each solution, a fitness value is calculated using the following equation:where Rpi is the optimization result of the ith solution calculated by equation (12).(4) Local Search and Generating New SolutionsLet be the ith solution in the whole swarm population. To generate a new candidate solution nearby , a variable index is generated and solution is randomly selected. Except for the zth variable, the rest of the newly generated solution is the same as the original solution .If , the variable to be modified is the reliability of type z components, which can be generated as follows [32]:where is a random parameter between 1 and −1; thus, the new solution will beIf , the variable to be modified is the swapping matrix for components of type z-K which can be generated as follows:where a is a matrix of z1 rows and 2 columns, consisting of rows of the same content between matrix and . For example, if is [1 4; 2 5; 9 3; 6 7] and is [9 3, 6 8, 1 4], then a can be formed as [9 3, 1 4]. In addition, a matrix b with z2 rows and 2 columns can be randomly generated using the method described in step 1: removing all elements of matrix a from components of type z-K, then randomly selecting 2z2 elements from the remainder, and randomly arranging them to form rest of the matrix, where z2 is a random integer index, generated as follows:Thus, the new solution isThen, the solutions are substituted into the objective function to calculate the fitness using equation (15). If the newly generated solution is better than the original one, the original is replaced with the new one; otherwise, the original solution is retained.(5)Assign Onlooker BeesEvaluate the whole swarm population and accept the solution with probability calculated as follows [32]:where fit is the fitness value of the solution reflecting the superiority of solution vector and N is the total number of solutions of the whole swarm; moreover, the better the solution, the more likely it will be accepted.(6)Determine the Abandoned SolutionIf a solution is not accepted after L cycles of update, the solution is abandoned and replaced with a new solution randomly generated using Step 1.(7)Repeat the Aforementioned Steps for M Iterations

4. Numerical Examples

A swapping-based system reliability optimization model was established for two complex systems, and the ABC algorithm was applied to determine the optimal component reliability and best swapping strategy. Multiple calculations were carried out for each example to verify the stability of the algorithm.

Example 1. A system consisting of 9 components of 2 types is shown in Figure 10. Components 1, 2, 3, 5, and 8 are type 1, and components 4, 6, 7, and 9 are type 2. Parameters used in the model are listed in Table 6. Assuming the failure time of components in the system obeys an exponential distribution, and failure rates of each component, type 1 and type 2, at time T are 0.05 and 0.07, respectively.
The swapping-based reliability optimization mathematical model can be defined as follows:The maximum cycle number was set to 2000, the population of solutions was set to 50, and the limit update number was set to 100. Then, optimization was performed by applying the penalty guided ABC algorithm. The optimal component reliability and swapping strategy are shown in Table 7.
After multiple calculations, the optimal reliability at time t = T was 0.90847271 and 0.86742572 for type 1 and type 2 components, respectively. The optimal swapping strategy for the system is as follows: component 8 is set to replace component 5 in type 1; component 7 is set to replace component 4 and component 9 is set to replace component 6 in type 2.
The survival signature of the original system, Φ(l1, l2), and the survival signature under the optimal swapping strategy, Φs(l1, l2), were calculated using equation (2), and the results are presented in Table 8. Rows with Φ(l1, l2) = Φs(l1, l2) = 0 are omitted since these data have no effect on the calculations.
Birnbaum importance of components is listed in Table 9 to examine the effectiveness of the swapping strategy. As can be seen from Table 9, the importance of target components is higher than that of spare components in each type, which means that the current swapping strategy is effective in reliability improving.
Since the component failure time obeys an exponential distribution and the component reliability at time T is already known, we can obtain the reliability of the system at time before and after the optimization using equation (4). Reliability curves are shown in Figure 11, and a comparison of the system and component parameters before and after the optimization is presented in Table 10.
The reliability curves show that the component swapping-based reliability optimization can considerably improve system reliability. Furthermore, the system can still achieve high reliability levels even if the component reliability is lower after the swapping-based optimization, as seen from Table 10, which can reduce unnecessary costs.

Example 2. A system consisting of 16 components of 3 types is shown in Figure 12. Components 1, 4, 7, 10, 12, and 15 are type 1, components 3, 8, 11, and 16 are type 2, and components 2, 5, 6, 9, 13, and 14 are type 3. Parameters of the model are listed in Table 11. Similar to Example 1, failure times of the components in the system are assumed to obey an exponential distribution, and failure rates of each component, type 1, type 2, and type 3, at time T are 0.06, 0.03, and 0.02, respectively.
The replacement-based reliability optimization mathematical model for Example 2 can be described as follows:The maximum number of cycles of the algorithm was set to 4000, the population of solutions was set to 50, and the limit update number was set to 100. Then, the optimization was performed by applying the penalty guided ABC algorithm. Results for the optimal component reliability and swapping strategy are presented in Table 12.
After multiple calculations, the optimal reliabilities of each type of component at time t = T were 0.82219070, 0.93655845, and 0.90963046. The optimal swapping strategy for the system is as follows: component 10 is set to replace component 1 and component 12 is set to replace component 7 in type 1; component 11 is set to replace component 16 and component 8 is set to replace component 3 in type 2; component 5 is set to replace component 13 and component 14 is set to replace component 2 in type 3. Importance of components is listed in Table 13 which shows that importance of target components is higher than that of spare components in each type. Reliability curves of the system before and after the optimization are shown in Figure 13. A comparison of the system and component parameters before and after the optimization is presented in Table 14.

Example 3. An enterprise core directed network system is shown in Figure 14. The condition for the system to remain functioning is that the source and sink vertices are connected by at least one path. It is assumed that all vertices are reliable, and the edges are of three types. Type 1 consists of e1, e2, e3, e4, e5, e6, and e7 with failure rate 0.08 at time T; type 2 consists of e10, e11, e13, and e14 with failure rate 0.07 at time T; type 3 consists of e8, e9, e12, e15, e16, and e17 with failure rate 0.05 at time T. The failure times of the edges are all assumed to obey exponential distribution. Parameters used in the model are listed in Table 15.
The replacement-based reliability optimization mathematical model for Example 3 can be described as follows:The maximum number of cycles of the algorithm was set to 5000, the population of solutions was set to 50, and the limit update number was set to 100. Then, the optimization was performed by applying the penalty guided ABC algorithm. Results for the optimal component reliability and swapping strategy are presented in Table 16.
After multiple calculations, the optimal reliabilities of each type of component at time t = T were 0.89355334, 0.80957855, and 0.91690166. The optimal swapping strategy for the system is as follows: component 4 is set to replace component 3, component 6 is set to replace component 7, and component 5 is set to replace component 2 in type 1; component 10 is set to replace component 11 in type 2; component 12 is set to replace component 16 and component 8 is set to replace component 15 in type 3. Importance of components (edges) is listed in Table 17 which shows that importance of target components is higher than that of spare components in each type. Reliability curves of the network system before and after the optimization are shown in Figure 15. A comparison of the system and component parameters before and after the optimization is presented in Table 18.

5. Conclusions

In this paper, a new system reliability optimization model based on component swapping is proposed to achieve the optimal component reliability and to determine the best swapping strategy for each type of component. The analysis was performed under certain constraint functions for volume, weight, and cost. The survival signature and truth table are introduced into the system reliability calculations, by which the structure of the system can be expressed as a matrix, and the system reliability under different swapping strategies can be calculated more efficiently using matrix operations.

The artificial bee colony algorithm is applied to solve this type of optimization problem. The solution initialization and local search of the original ABC are modified using matrix operations for component swapping. Three swapping-based system reliability optimization examples are established, and optimizations were performed using the ABC algorithm. The results show that the component swapping-based optimization can effectively improve system reliability within limited costs.

It should be noted that the component-swapping strategy discussed in this paper is limited to a one-to-one component pattern, whereas in practice, swapping strategies can be more flexible. For example, more than one component could potentially be used to replace an important component if necessary, making the system more robust. The component swapping proposed in this paper is time-consuming which would be impossible to achieve in practice. The time required for swapping operations can be viewed as a factor affecting the system reliability and optimization costs and will therefore be considered in future work. Furthermore, the proposed swapping-based optimization could be combined with redundancy allocation to form a new system reliability optimization model for determining the optimal redundancy level and component-swapping strategy.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

The authors gratefully acknowledge the support of the National Natural Science Foundation of China (51975110, 51575094, and 51875094).