Research Article | Open Access
Jean Chamberlain Chedjou, Kyandoghere Kyamakya, "A Universal Concept for Robust Solving of Shortest Path Problems in Dynamically Reconfigurable Graphs", Mathematical Problems in Engineering, vol. 2015, Article ID 345049, 23 pages, 2015. https://doi.org/10.1155/2015/345049
A Universal Concept for Robust Solving of Shortest Path Problems in Dynamically Reconfigurable Graphs
This paper develops a flexible analytical concept for robust shortest path detection in dynamically reconfigurable graphs. The concept is expressed by a mathematical model representing the shortest path problem solver. The proposed mathematical model is characterized by three fundamental parameters expressing (a) the graph topology (through the “incidence matrix”), (b) the edge weights (with dynamic external weights’ setting capability), and (c) the dynamic reconfigurability through external input(s) of the source-destination nodes pair. In order to demonstrate the universality of the developed concept, a general algorithm is proposed to determine the three fundamental parameters (of the mathematical model developed) for all types of graphs regardless of their topology, magnitude, and size. It is demonstrated that the main advantage of the developed concept is that arc costs, the origin-destination pair setting, and the graph topology are dynamically provided by external commands, which are inputs of the shortest path solver model. This enables high flexibility and full reconfigurability of the developed concept, without any retraining need. To validate the concept developed, benchmarking is performed leading to a comparison of its performance with the performances of two well-known concepts based on neural networks.
The shortest path problem (SPP) is one of the classical combinatorial optimization problems [1–3]. This problem relates to finding the minimum cost path in a predefined source-destination node pair in a weighted graph network . The interest devoted to shortest path finding can be explained by the various potential applications of the SPP in science and engineering. Indeed, the SPP is essentially involved in many use cases, in, for example, the following: vehicle routing in transportation systems , traffic routing in communication networks , path planning in robotic systems [6, 7] and scheduling , and video image analysis . Further applications of SP are found in electronics (e.g., for VLSI physical design) , medical imagery (e.g., for virtual endoscopy) , and image processing (e.g., for energy minimization in vision) , just to name a few.
In intelligent transportation systems, the SPP can be considered as a subproblem for many broader problems such as route guidance , vehicle dispatching , real-time traffic information sensing , and production systems planning. These specific problems require real-time (e.g., ultrafast) processing in order to achieve results in extreme short time deadlines, examples of results being the efficient schedules and identification of new routes (or paths) in transportation networks.
In computer science SP algorithms can be used in the automatic search of directions between physical locations (e.g., driving directions on web mapping websites like in the American free online web mapping service (MapQuest) and also in the desktop web mapping service (Google Maps)) . SP can also be used in applications like network management (e.g., finding the most vital node of a shortest path)  and graph structure in the web .
Thus, there is an explicit necessity of enriching the state of the art by developing new and efficient (i.e., extremely fast) shortest path problem (SPP) solver concepts. The efficiency in this context relates to some key performance metrics: (a) fast computing, (b) low memory consumption, (c) robustness, (d) accuracy, and (f) dynamic/runtime reconfigurability.
In the aforementioned applications, the shortest path detection problem is an important problem to be addressed. This problem has been studied extensively in the fields of computer science, operations research, and transportation engineering [4–9]. The well-known polynomial-time algorithms for solving shortest path problems include Bellman’s dynamic programming algorithm , Dijkstra’s algorithm , and Bellman-Ford successive algorithm [19, 20]. However, polynomial-time algorithms are suitable only for nonnegative costs of edges and for cases of additive linear path cost models. Thus, these algorithms cannot solve the SPP in the presence of negative costs  or in the presence of nonlinear/nonadditive path cost models . Further, polynomial algorithms may appear to be too time consuming (slow) specifically when dealing with real-time applications in real traffic networks. These algorithms can also fail to provide the correct solution in certain cases such as when multiple shortest paths of equal total cost but with differences in the number of hops (i.e., number of involved edges) exist in a given graph.
A dynamic algorithm was developed as an extension of polynomial Dijkstra’s and Bellman-Ford algorithms in order to analyze shortest path tree problems . Despite the fact that the dynamic algorithm can easily adapt to changes in the graph topology, it is however very time consuming and the convergence is an issue specifically for huge size problems . Further methods have been developed to address SP problems while assuming integer values of the costs of edges such as the scaling technique , the integer matrix multiplication technique , the fast integer sorting technique , and the component hierarchy technique . These methods are not valid for noninteger values of edge weights and therefore are not applicable to real-life or technical systems.
A first basic mathematical approach for SPP solving consists of modeling the shortest path detection problem into a linear programming constrained problem, which is further solved using Dantzig’s simplex method . This method is prone to failures and the accuracy is poor . Several interesting contributions for SPP solving presented in the related literature do involve selected artificial intelligence concepts. Genetic algorithm (GA)  and particle swarm optimization (PSO)  can solve both deterministic and stochastic shortest path problems. In both cases, the solutions converge towards the optimal paths. However, GA is computationally very expensive compared to PSO  while the PSO also needs additional heuristics. Likewise, evolutionary algorithms (EA) are prone to invalid paths detection . Heuristic search algorithms (e.g., )  are developed to make use of additional knowledge in order to reduce the search efforts; however, heuristic search algorithms depend on the quality of the heuristic function used. This dependency affects the accuracy of results. Further, when the search area increases, more computation effort is required . Hence, the solutions provided by heuristic algorithms are possibly less accurate and cost inefficient.
The concept of cellular automata (CA) has been intensively used to address SP problems in complex graphs [31–33]. Despite the potential significant improvement of the computing speed due to the parallel nature of the CA concept (i.e., CA is suitable for parallel computing), the reconfigurability is however an issue due to the strong dependence of the CA concept on initial conditions/states .
Another interesting SPP solver concept does involve artificial neural networks (ANN). The ANN SPP solver has been given tremendous attention due to its capability of performing parallel computing as well as its easy hardware implementability . However, the basic ANN approach is prone to limitations such as lack of adaptability to dynamic graph topological changes and poor accuracy of results [35–37]. The Hopfield neural networks were developed based on linear programming to provide approximate solutions (to SPP) faster than the aforementioned algorithmic solutions [35, 38–40]. Mehmet Ali and Kamoun  proposed a variation of Hopfield neural networks as a new concept that can adapt to external varying conditions. This method fails however to converge towards valid solutions. Further, the computing performance degrades with increasing magnitude and size of the graph. To address these last mentioned limitations, Park and Choi  proposed a concept that is capable of handling graphs with huge sizes. This last-named concept is however prone to convergence issues . The dependent variable Hopfield neural network (DVHNN)  was also proposed as a new neural network concept capable of addressing the inherent limitations of the previously mentioned classical concepts involving neural networks. Specifically, it was demonstrated that the DVHNN can efficiently tackle issues related to accuracy, convergence, and reliability when dealing with SPP . Despite these strong points of the DVHNN, the method cannot efficiently handle real-time SPP detection problems in reconfigurable graphs since new training or retraining (of the neural network) is needed for each new source-destination node-pair setting. Furthermore, the DVHNN does not consider negative cost of edges and the accuracy degrades with the increasing costs of edges (higher values of weight values). This is justified by the assumption in  which considered only small values of edge costs.
In view of the above underlined limitations which are inherent to most neural network based concepts, a new neural network approach (called dynamic neural network (DNN)) was introduced and some interesting related works do demonstrate the effectiveness and efficiency of the DNN approach (see  and the references therein). However, despite the very good features of the DNN for SPP solving, some crucial limitations are underscored in , which are specifically related to the convergence failure observed under specific parameter settings. Another interesting issue worth mentioning is the route to convergence. This issue, which is extremely sensitive to changes in both initial values and the parameter settings of the dynamic mathematical model, is characterized by the duration of the transient phase and the computing time needed to reach convergence.
The key/main objective of this paper is to contribute to the enrichment of the SPP related state of the art by proposing a new approach expressed in form of ordinary differential equations. These equations do in essence represent a novel NAOP SPP solver model. This new form of NAOP can/does efficiently overcome the limitations of the abovementioned DVHNN and DNN concepts in  and , respectively. For proof of concept and for stress-testing purposes of the novel NAOP SPP solver approach developed, extensive benchmarking is conducted whereby its performance is compared with that of the competing concepts presented in [41, 42]. The performance metrics involved in the benchmarking are the convergence under high values of edge costs, the convergence in the presence of negative edge-cost values, the convergence potential under various parameter settings, the transient phase cancellation/duration, and the necessary computing time until convergence.
The rest of the paper is organized as follows. Section 2 describes at an abstract level the new proposed NAOP SPP solver concept for dynamically reconfigurable network graphs. A synoptic representation of this concept is proposed and a description of the key parameters as well as the complete system model is presented. Section 3 presents the general methodology for finding shortest paths in graphs through NAOP. The BDMM application to the shortest path problem is then addressed. The resulting coupled ODE equations are derived. The comprehensive benchmarking of the novel NAOP SPP solver is presented in Section 4 whereby a series of selected examples published in  (i.e., DVHNN concept) and  (i.e., DNN concept) is systematically considered. SPP solving is carried out using the NAOP SPP simulator on the one hand and the concepts in [41, 42] on the other hand. Some graph scenarios addressed and published in [41, 42] are considered and systematic benchmarking of the new NAOP solver concept with the concepts in [41, 42] is performed. The performance metrics used for the diverse comparisons are hereby mainly the simulation duration until convergence (computational efficiency) and the robustness (convergence). Regarding reconfigurability, it is only valid for the novel NAOP based SPP solver; the other concepts cannot support it. The benchmarking results obtained are used to underscore both the effectiveness and the efficiency of the novel NAOP SPP solver concept in complex and dynamically reconfigurable network graphs while considering even negative as well as high values of edge costs. Lastly, Section 5 presents a series of concluding remarks. A summary of the core contributions of this paper is presented. Further, selected interesting open research questions (under investigation in some of our ongoing subsequent works) are listed in an outlook.
2. General Methodology Based on Nonlinear Adaptive Optimization (NAOP) for Modeling Shortest Path Problems (SPP)
In some of our recent contributions/papers (see [43, 44]), the NAOP concept has been successfully used for solving differential equations (ODEs and/or PDEs). We now want to demonstrate that the NAOP concept can be efficiently used as a general and robust framework for modeling shortest path problems (SPP) even in dynamically reconfigurable graphs.
This section provides a full description of the proposed NAOP concept for finding shortest paths in complex and dynamically reconfigurable graphs/networks. The abstract input/out logical diagram of this concept is schematically presented in Figure 1. The proposed concept does take three basic inputs as illustrated in Figure 1.
The first input is an external command (see Figure 1), which is used for assigning one of the three possible attributes (i.e., source, destination, or intermediate) to each of the nodes of the graph under investigation. The command is defined as follows: , whereby when the node with index is a source node, when the node with index is a destination node, and when the node with index is a candidate intermediate node on the shortest path (in fact, after origin and destination nodes are both fixed, all remaining nodes of the graph are “candidate” intermediate nodes on the final shortest path).
The second input (see Figure 1) expresses the topology of the graph under investigation. The incidence matrix denoted by is matrix, whose elements can only take three possible values from the set . In essence, the matrix expresses the states of connectivity/incidence of all edges to nodes (incidence matrix). A directed edge going onto a node is denoted in the matrix by while 1 denotes a directed edge leaving a node. The state of an edge, which is not connected to any node, is expressed by 0. In the matrix , each column corresponds to a given edge and each row does correspond to a given node. This is clearly illustrated in (10b).
The third input (see Figure 1) is used for assigning weight values to all (directed) edges of the graph. The vector is defined in terms of the elements as follows: . The coefficients represent the weight values (i.e., costs) of all edges of the graph.
The output in Figure 1 expresses the state of connectivity (i.e., belonging to the SP) of all edges. The states of edges are represented by the variables . A value means that the edge with index is part of the shortest path; otherwise, means it is not.
The concept developed here is general and scalable regarding graph size and magnitude and possibly bidirectional connectivity between nodes.
The NAOP concept in Figure 1 has already been successfully applied for solving both nonlinear ordinary differential equations (ODEs) and partial differential equations (PDEs) [43, 44]. The aim of this paper is to demonstrate that the NAOP concept (see Figure 1) can also be efficiently used as a universal framework to realize a robust SPP solver (called NAOP SPP simulator). Some aspects of the “universality” flavor of this NAOP SPP solver are expressed amongst others by the fact that this concept is applicable to various types of graphs/networks regardless of size, magnitude, and nature/form of the costs of edges (negative and/or positive values). And although it is not addressed in this paper (this issue will be extensively handled in a subsequent work), this concept is also applicable for cases where the path cost is nonadditive, that is, nonlinear (e.g., see a possible case where SPP is combined with network/graph reliability considerations (in such a case, every edge has 2 weights: a deterministic weight () related to the SPP problem and a probability value () related to the graph reliability perspective; then the path cost is , whereby is the sum of all belonging to the path and is the product of all belonging to the path) or a case where the contribution of every edge belonging to the total path cost related to the “driving energy consumption perspective” is the sum of (a) edge length and (b) square of the edge’s maximum (or current) driving speed).
During the past/last decades, tremendous attention has been devoted to the development of new methods, concepts, algorithms, and tools based on the neurocomputing paradigm (e.g., use of neural networks) for finding shortest paths in networks/graphs [1–9, 13, 17, 29, 30, 35, 36, 41, 42]. However, already published works are mainly just focusing on proof-of-concept examples and do not solve a series of fundamental issues related amongst other things to a systematic and general (or universal) methodological framework for efficiently solving shortest path problems in graphs of complex topologies. An open challenge is related to suggesting and validating a successful and systematic concept for controlling and mastering the following key performance metrics: accuracy, precision, robustness, flexibility, reconfigurability, and universality.
The key contribution of this paper is to develop a comprehensive NAOP SPP solver concept for finding shortest paths in complex and dynamically reconfigurable graphs which satisfactorily addresses and solves the abovementioned key challenges. The overall superiority of this concept is demonstrated in extensive benchmarking where the above-listed performance metrics are used.
The next section presents with full details the steps involved in the design of the NAOP SPP solver concept. The derivation of the related coupled ODE mathematical models is systematically performed.
3. General Methodology for Modeling the Shortest Path Problem through NAOP: Expression of the Lagrange Function and Derivation of the NAOP Mathematical Model
3.1. Transforming SPP Problem into an Optimization Problem
We now consider the derivation of the Lagrange function corresponding to the shortest path problem. We provide a description of the key steps involved in the mathematical modeling of the shortest path problem as a multivariable nonlinear optimization problem. Finally, we explain the process leading to the derivation of the set of coupled nonlinear ODEs, which correspond to the model of the NAOP SPP solver simulator.
The first step relates to the derivation of the corresponding Lagrange function (see (4)), which is viewed as the total energy of the system. In this context, the optimization problem is represented by an objective function (see (1)) subject to constraints ((2) and (3)). These constraints make the problem NP-hard. In order to solve NP-hard problems, the constrained optimization problem can be transformed into an unconstrained optimization problem by using relaxation methods. Several methods have been proposed by the relevant literature (see [9, 13, 17–22, 28, 29]). Some of these methods are direct substitution, constrained variation, and Lagrange multipliers , just to name a few.
The shortest path problem can be considered as a multivariable constrained optimization problem. This type of problem can be formulated mathematically by (1), (2), (3), and (4). In these equations, is the vector of multipliers variables and is the vector of multipliers variables . The decision variable is the state vector of all edges involved in the graph under consideration. Thus, the dimension of is equal to , where is the size of the graph. The function represents the total cost of the graph under investigation. The optimization constraints are formulated in order to ensure the connectivity of each node belonging to the shortest path, while the constraints impose the binarization of all components of the decision variable vector . These last-named constraints are formulated to ensure the convergence of all components of the vector to binary variables (0 or 1). Thus, according to the definition of the optimization constraints, the indexes , , and are defined as follows: ; ; . The integers and represent, respectively, the magnitude and the size of the graph. Hence,The second step exploits the concept of neuron dynamics as an optimization strategy that maps the optimization problem unto the energy of a neural network (e.g., a Hopfield network) in order to find the optimal solution. In this context, we represent this energy by the Lagrange form and the minimization of the Lagrange function leads to a stable state (due to the already proven convergence of the Basic Differential Multiplier Method (BDMM); see [38, 43, 44]).
Applying the BDMM to the Lagrange function in (4) leads to the derivation ofThe symbols , , and denote the partial derivatives with respect to , , and , respectively. As already mentioned above, the variables are the components of the vector of decision variables , while and are components of the vectors of multiplier variables and , respectively.
The set of (5a), (5b), and (5c) is the characteristic model of BDMM [38, 43, 44]. This model (from which the coefficients of the coupled ODEs for a given graph are calculated) reveals the coupling between the dynamics of decision variable with the dynamics of multiplier variables and . The parameters and are step sizes for updating decision variables and multiplier variables, respectively. The values of these parameters are chosen through a stability or convergence analysis (see Section 3.3). A similar study (i.e., stability analysis) is considered (in [43, 44]) and it is demonstrated that the sign of the damped mass matrix depends on the parameters and . Specifically, it is demonstrated (see [43, 44]) that the convergence is ensured for positive definite damped mass matrix. Thus, the values of and must be chosen accordingly. However, the convergence analysis in [43, 44] was considered in the design of a general and robust framework for ultrafast solving of ordinary and/or partial differential equations. Our aim in this paper is to extend the convergence analysis developed in [43, 44] to the case of the design of a novel, general, and robust SPP solver concept to which we have assigned the acronym of NAOP SPP solver. The output of the NAOP SPP solver is expressed by the solutions of (5a). are decision variables of the optimization process. These solutions, which reveal the state of edges connectivity, will converge to binary variables (0 or 1), 0 meaning that this edge does not belong to the SP and 1 meaning that it does.
3.2. Derivation of the Mathematical Model of the NAOP SPP Solver Simulator
We provide in this part a full explanation of the systematic concept leading to the ODE based mathematical modeling of the shortest path problem as well as the design and implementation of the appropriate NAOP SPP solver simulator. Our focus is on finding the shortest path from a source node () to a destination node (). The paths in the graph can be expressed by the objective function in (1), where the function in (6) represents the total cost corresponding to the graph under investigation. In (6), is the size of the graph. Hence,The minimization of the total cost in (6) corresponds to the shortest path calculation whereby this path is defined starting from a given source node () to a destination node ().
The next step of the modeling process is related to the formulation of the optimization problem’s constraints. Two key constraints are considered in order to fulfill some key requirements of the problem formulation. These requirements are summarized by the following conditions or constraints: (a) connectivity of nodes (by edges) and the participation of incident edges belonging to the path regarding, respectively, inflow nodal degree and outflow nodal degree and (b) binarization of the decision variables , which are used to express the belonging or not of any edge to the shortest path.
The first-mentioned constraint (a) is modeled mathematically by (7). In (7), is the state of an edge. The graph is assumed to be directed. In case of undirected graph, meaning bidirectionality between all connected node pairs, each direction is represented by a separate directed edge: For each node , on the corresponding line/row in matrix we have 3 groups of values which are reflected in (7): (i) a group of ingoing edges represented by values , (ii) a group of outgoing edges represented by values , and (iii) a group of nonincident edges (i.e., edges not connected to node ) represented by values . The parameter is used to assign one of the three attributes (i.e., source, destination, or intermediate) to node as follows: (if is an intermediate node); (if is a source node); (if is a destination node).
The previously mentioned constraint (b) is related to the binarization of all state variables, which are used to express the belonging to the shortest path (SP) of any edge. This constraint is modeled mathematically byThus, according to (4), the objective function in (6) can be combined with the constraints formulated in (7) and (8) to derive the Lagrange expression given inIn (9), , , and are vectors of, respectively, , , and components as already described above.
The BDMM application to (9) is done as follows: note that each decision variable and each multiplier variable are considered as a particular direction in a multidimensional space. In this context, the dimension of the multidimensional space is always equal to (i.e., decision variables , then multiplier variables of type , and multiplier variables of type ). Thus, the BDMM application according to (9) corresponds to making partial derivations with respect to each particular direction. In each of these partial derivations only the variable expressing that particular direction is considered “dependent variable” in (9) and all other variables (or directions) are taken/handled as constant variables/values in the partial derivation process. Each partial derivation with respect to a particular variable will result in a related ordinary differential equation. Thus, we will obtain a total of coupled nonlinear differential equations, which can be grouped and expressed in a matrix form as indicated in (10a) and (10b). These equations ((10a) and (10b)) constitute the NAOP SPP solver model used for shortest path finding. This set of (10a) and (10b) is a general mathematical model, which is applicable (for shortest path detection) to any graph regardless of its type (i.e., directed or undirected graph, etc.) and also regardless of both its magnitude and its size. Hence,In (10a), the matrix (graph’s incidence matrix) is expressed as follows:
The matrix is the incidence matrix of the graph and denotes the transpose of . This matrix is of dimension , where and are, respectively, the magnitude (number of nodes) and the size (number of edges) of the graph. The elements of , which are illustrated in (10b), express specific connectivity between nodes and edges. For example, the first column of shows the connection between nodes and node through the edge in the direction from to denoted as . Similarly, the other columns of express the following connectivity: column-2 , column- , and column-. Thus, the derivation of the incidence matrix is immediate to a given graph.
Overall, for any given graph, one should first give IDs to all nodes by numbering them in any order with natural numbers; the same is done for giving IDs to all edges (here also the numbering order is not important). After all nodes and edges have been given an ID, the matrix can be determined and it does express the current graph topology. It should be mentioned here that the use of the variables is twofold: the variables are used for the identification of the edges belonging to the graph. These variables are also solutions of (10a) and (10b). Thus, the variables do therefore depict the state of all edges of the graph. Specifically, the solutions reveal the edges involved in the shortest path, while stand for those edges, which are not included in the shortest path.
Important Remark. Consider the following: The application examples (see Section 4) demonstrate the straightforward application of the general concept in (10a) and (10b) to the cases of directed graphs with positive costs of edges, directed graphs with mix of positive and negative costs of edges, and so forth. All the graph examples envisaged in this paper are already considered in [41, 42]. Benchmarking of our results against the results published in [41, 42] is further considered. Besides robustness, the core metric for comparison is the number of iterations needed by each method (i.e., the methods in [41, 42] and our NAOP SPP solver concept) until the convergence to the exact shortest path is achieved. Several scenarios are envisaged in which the nodes are assigned different attributes (source, destination, or intermediate).
According to (9), the following important statements can be made to explicitly describe the set of (10a) and (10b) in order to allow easy use of this set of coupled equations as a general modeling framework, applicable to all graph architectures for shortest path detection. Hence, we have the following statements:(i) is the vector of decision variables with components . Thus, as already mentioned above, , where is the size of the graph.(ii) is a first vector of multiplier variables with components . Thus, as already mentioned above, , where is the magnitude of the graph.(iii) is a second vector of multiplier variables with components . Thus, as already mentioned above, , where is the size of the graph.(iv) is a vector of all edges’ weights with components . Thus, .(v) is the vector for assigning attributes to nodes, that, is nodes’ configuration vector. The components of are denoted by . Thus, as already mentioned above, , where is the magnitude of the graph.
Comment on (10a) and (10b). The set of (10a) and (10b) contains three key fundamental parameters which are the external commands , , and . The derivation (or calculation) of these key fundamental parameters is straightforward for a given graph architecture:(i)The matrix defines the topology of the graph under investigation. This matrix provides the connections between nodes, the identities of connectors (edges), and the valency of each node belonging to the graph.(ii)The vector offers the possibility of dynamically externally varying the costs of edges (weight values) . This is performed through an external command (see Figure 1).(iii)The vector offers the possibility of performing dynamic reconfigurability of the graph nodes using an external command (see Figure 1). This reconfigurability offers the possibility of defining each node- of the graph to be either source node () or destination node () or intermediate node (). The command is thus used for assigning a specific attribute to each node of the graph.(iv)In essence, the strong points of the general mathematical model in (10a) and (10b) when compared with the traditional methods, concepts, and algorithms for solving shortest path problems are fourfold:(a)The high flexibility of the set of (10a) and (10b), which can easily be adaptable to any new graph topology.(b)The universality of the set of (10a) and (10b), which is valid for all graphs regardless of the type, the magnitude, the size, and the topology.(c)The excellent dynamic reconfigurability through an external command without any need for retraining (contrarily to this, most traditional SPP solver methods do generally need new retraining or equivalent reprocessing in case of dynamic change of parts of the graph’s attributes). The reconfigurability potential is used to dynamically change not only edge weights but also “source-destination” node pairs through external commands without any need for retraining.(d)The robustness and correctness of the model in (10a) and (10b), which are very good, are underscored by the fact that the model always converges to the exact shortest path and, further, there is no faulty SP detection ever observed after very extensive stress testing of the concept.
If one uses the dynamic systems’ terminology, (10a) and (10b) do describe indeed a nonlinear oscillator system that is externally excited by 3 inputs vectors describing the graph under consideration (matrix , vector , and vector ) and its oscillations over time do converge to fix points (or equilibrium points) that represent the shortest path of the graph described by the 3 inputs. These inputs constitute the signature of any graph.
An important remark to be underscored here is that the set of (10a) and (10b) is expressed in terms of two specific parameters and , which are step sizes for updating decision variables and multiplier variables, respectively. The aim of the next subsection (see Section 3.3) is to demonstrate the possibility of efficiently using these two parameters as control parameters to ensure/guarantee that the set of (10a) and (10b) always converges to the exact shortest path whatever the topology, size, magnitude of the graph, and so forth could be. The derivation of a systematic framework to control the convergence properties of the NAOP SPP solver in (10a) and (10b) is of necessary importance in order to guarantee excellent scalability potential of (10a) and (10b) when investigating SPP in complex graphs of huge size and high magnitude. Thus, the analytical condition established below (see Section 3.3) will be considered in order to improve the robustness of the optimization process and facilitate fast convergence to the exact shortest path.
The convergence properties of the NAOP SPP simulator are addressed (or considered) in the next subsection and analytical criteria are derived under which the NAOP SPP simulator always converges to the exact shortest path.
It is worth mentioning that the relevant literature has clearly demonstrated that the convergence of the analysis methods/concepts for solving shortest path problems (SPP) based on traditional neural networks (ANN) (see  and the references therein) and the concepts based on recurrent neural networks (RNN) (see  and the references therein) is an important issue to which full attention should be devoted. However, the literature does not propose an analytical framework that could help to systematically predict and control the convergence properties of both ANN and RNN concepts under various possible changes/configurations (e.g., variations in graph topology, choice of source-destination pair-node, size, and magnitude.). This remark is important to justify the analytical concept developed in this paper in order to provide a systematic analytical concept that efficiently addresses the issue related to convergence.
3.3. Convergence of the NAOP SPP Solver Concept under All Possible Combinations or Settings of Parameters
The key important issue in this context is how to ensure convergence of the NAOP SPP solver concept for all possible parameter settings of (10a) and (10b). These parameters are , , , , and . However, since the first three parameters are derived according to a given graph under investigation, we will be deriving a general analytical framework for convergence regardless of these parameters (, , and ). Thus, the analytical expression derived as a general framework for convergence will be controlled by two specific and fundamental parameters which are the step size for updating decision neurons and the step size for updating multiplier neurons .
The convergence problem was already addressed in our previous contributions (see [43, 44]) when dealing with the design of a universal framework based on NAOP for solving ordinary differential equations (ODEs) and partial differential equations (PDEs). Our aim here is to provide the full/complete steps of the analytical process for ensuring convergence of the NAOP concept for shortest path detection. Further, we provide (see the Appendix) an in-depth analytical study of the convergence properties of the NAOP SPP solver concept. Such in-depth development was not considered in our previous papers ([43, 44]). Further, another motivation of carrying out an in-depth analytical study to ensure convergence of the NAOP framework for solving shortest path problems (SPP) is justified by our wish to demonstrate the possibility of controlling the convergence properties of the NAOP based solver for SPP through specific and well-known parameter settings. This will offer the possibility of using NAOP as a general and robust framework for the investigation of shortest path problems (SPP).
Regarding the convergence, we are providing here a practical/physical comment of the analytical results we want to achieve/derive. Indeed, the stability (or convergence) of the optimization process is ensured by the “sign flip” concept [43, 44]. This concept is illustrated in Figure 2. The “sign flip” concept can be explained as follows. As the parameters and vary, the system in (5a), (5b), and (5c) (or equivalently in (10a) and (10b)) exhibits damped oscillations around the constraints subspace defined by the relation . The damped oscillations exhibited around the constraints subspace are characterized by the alternation (in time domain) of the sign of the constraint functions (i.e., , , and vice versa) during the optimization process in order to converge to a point located on the constraint subspace defined by the equation (see Figure 2). The damped mass matrix is used to provide an insight of the abovementioned convergence. Indeed, the damped mass matrix controls the energy dissipation within the system. Thus, according to (15) a positive value of the damped mass matrix is an insight that the total energy within the system decreases with time and, finally, the system settles down into the state where the energy is minimized. This characterizes attraction of the system state to the constraint subspace (stability). At the attraction point (or convergence state (see Figure 2)), all constraints are fulfilled. However, when the damped mass matrix is negative, the total energy within the system increases with time. A negative damped mass matrix is an insight that all constraints are not fulfilled (instability). To tackle this problem, the learning rate parameters ( and ) can be tuned/monitored (independently of the other system’s parameters , , and ) in order to bring the system model in (5a), (5b), and (5c) (or equivalently in (10a) and (10b)) into its stable state (i.e., convergence).
A full/complete detail of the analytical steps involved in the stability/convergence analysis of (10a) and (10b) is provided in the Appendix. The results provided in the Appendix can be summarized by the following important steps.
Step 1. The first step is the derivation of the characteristic ODE model for decision variables . The Lagrange function in (4) is combined with (5a), (5b), and (5c) to derive the following nonlinear ordinary differential equation (see analytical details in the Appendix):The expression in (11a) is a second-order ordinary differential equation with a single and dissipative coefficient denoted by . This coefficient represents the damped mass matrix defined as follows:In (11a), is an internal force producing the potential energy into the system. This force is expressed as follows:In (11a), (11b), and (11c), represents the force of inertia, is the damping matrix, and is an internal force, which creates the potential energy (i.e., the internal energy) of the system. This force is needed in order to fulfill the constraints (by moving the attractor from a given initial point chosen in the constraint subspace to a final point located on the characteristic line ).
Step 2. The second step is the derivation of the mathematical expression of the total energy of the system. This expression is further used in Step 3 for the global stability analysis. The Lyapunov function is derived as the total energy of the system . The total energy is expressed as the sum of kinetic and potential energies denoted by and , respectively. These energies are expressed by the following mathematical expressions while considering the state variable in (11a), (11b), and (11c):Thus, the total energy is expressed as follows: Substituting the expression of defined in (11c) into expression (13a) leads to the following expression of the total energy (see analytical details in the Appendix):The total energy in (13b) is positive definite for all when and are of the same sign. Thus, according to the first criterion of the Lyapunov theorem, the total energy in (13b) (i.e., the Lyapunov function) is positive definite for all state variables . Hence, the first criterion of the Lyapunov theorem for global stability is fulfilled (i.e., for all ).
Step 3. The third step is the derivation of the second criterion of the Lyapunov theorem for global stability. This criterion states that the first derivative of the Lyapunov function must be less than or equal to zero for all state variables . Thus, the aim of this step is to derive the analytical condition under which the first derivative of the Lyapunov function (i.e., the total energy) must be less than or equal to zero (i.e., for all ). Using the mathematical expression in (13b), the first derivative of the total energy can be evaluated and expressed as follows:Using the expression in (14), an in-depth analytical development is conducted (see full details in the Appendix) and finally the first derivative of the total energy is expressed into the following simplified form:The expression in (15) is the characteristic mathematical expression used to illustrate and underscore the significance of the “sign flip.” Indeed as already mentioned, the Lyapunov theorem for global stability analysis states that the system is stable if the first derivative of the total energy (i.e., the Lyapunov function) in (15) is less than or equal to zero. In (15), and are, respectively, the time derivative of decisions variables in the directions (row of the matrix ) and (column of the matrix ). Thus, and are orthogonal and therefore is the transpose of denoted by .
According to the well-known theorem of linear algebra (regarding the definition of the sign of a matrix), the damped mass matrix is positive definite if and only if for all ( is the transpose of ). Thus, since the matrix is expressed in terms of and , these parameters are dynamically monitored (as control parameters) during the optimization process in order to achieve a positive definite damped mass matrix . A positive definite leads to according to (15). Thus, according to the Lyapunov theorem (for global stability), the mathematical model in (10a) and (10b) is stable and, consequently, the NAOP SPP solver platform for shortest path detection always converges to the exact solution (i.e., the desired shortest path) during the optimization process.
Step 4. The fourth step is convergence monitoring and control. During our various numerical simulations, the condition in (15) is being coded (algorithmically) into a feedback loop in order to dynamically monitor and control the convergence properties of the NAOP platform for shortest path detection as illustrated in the flowchart presented in Figure 3. The values of the parameters and are evaluated in order to guarantee that the condition is fulfilled for all and . When this condition is observed/realized, the NAOP optimization platform always converges to the exact solution (revealing the exact/desired shortest path).
Comment on the Convergence Analysis. The advantage of considering the convergence/stability analysis is justified by the fact that a fundamental analytical relationship is derived under which straightforward convergence to the exact solution (i.e., exact shortest path) of the NAOP model (in (10a) and (10b)) for solving shortest path problems (SPP) is always observed regardless of the following key metrics: (a) the graph structure (directed, undirected, and/or mix of directed and undirected edges), (b) the graph size, (c) the costs of edges (high or small costs values), (d) the sign of costs (negative, positive, and/or mix of positive and negative costs), and (e) the magnitude of the graph. This latter metric is related to the scalability issue. This issue can be systematically tackled using the key analytical relationship or expression in (15). This expression can be appropriately used to ensure fast convergence even in case of graphs with huge amount of nodes. Specifically, it is demonstrated that the parameters and can be monitored (or varied) systematically and that a judicious choice of these parameters ensures fast convergence to the desired/exact solution (i.e., exact shortest path).
3.4. Flowchart of the NAOP Concept for Solving Shortest Path Problems (SPP) in Dynamically Reconfigurable Graphs
The aim of this subsection is to provide a full understanding of the functioning principle of the NAOP simulator for shortest path detection. The flowchart of the functioning principle of the NAOP simulator is proposed in Figure 3. The NAOP simulator is modeled by (10a), (10b), and (15). The known parameters , , and are used as coefficients of (10a) and (10b). These parameters are obtained from the graph under investigation. The parameters and are dynamically monitored in order to ensure the condition of convergence expressed by for all state variables . This condition is controlled through a feedback loop defined by (15). Thus, expression (15) is essential to ensure convergence of the NAOP simulator to the exact shortest path.
Thus, for a given graph under investigation the art consists of collecting the input parameters , , and which are further used by the NAOP simulator modelled by (10a), (10b), and (15). Mathematical expressions (10a), (10b), and (15) constitute a coupled mathematical model of the NAOP simulator. The coupled model is applicable to all types of graphs independently of their topology/structure, size, magnitude, and so forth.
For any given new graph, the corresponding parameters , , and are determined. Further, according to the new values of , , and , an update of the control parameters and may be performed if necessary (according to the flowchart in Figure 3). The update of the control parameters and allows fast convergence of the NAOP SPP solver concept to the exact shortest path. This interesting result is of necessary importance when considering graph scalability analysis since the control parameters and could be used to guarantee fast convergence to the exact shortest path, specifically when investigating shortest path problems (SPP) in graphs of huge magnitude and/or size.
During the optimization process (see Figures 2 and 3), the system modelled by (10a), (10b), and (15) converges to an equilibrium point (i.e., fixed point). At equilibrium point (i.e., final state (see Figure 2)), the state variables denoted by are solutions of (10a) and (10b). These solutions reveal the edges belonging to the shortest path and those that do not belong to the shortest path (see Output of NAOP in Figure 3). The initial conditions , , and (see Figure 3) are randomly selected. Interesting to be underscored here is that the values of initial conditions do not affect the convergence properties (robustness) or the exact shortest path detection (accuracy). This conclusion underscores the robustness of the NAOP SPP solver concept developed.
The next section will be concerned with a demonstration of the straightforward application of the NAOP SPP simulator model to several graph examples. Benchmarking of the NAOP SPP solver concept developed in this paper is further conducted, leading to a comparison of the results obtained (using NAOP) with the results provided by two well-known and proven efficient concepts for SP detection selected from the literature. These last-named concepts are the dynamic neural network concept developed in  (benchmarking 1) and the dependent variable Hopfield neural network developed in  (benchmarking 2). Examples of graphs published in [41, 42] are considered, specifically (a) graphs with unidirectional edges, (b) graphs with positive edges, (c) graphs with mixed positive and negative costs of edges, and so forth. All these examples have been considered either in  or in  and are selected here once again for benchmarking purposes.
3.5. General Algorithm for Determining the Fundamental Parameters (, , and ) of the Original Graph
This subsection proposes the algorithm (in the form of Matlab subcodes) for solving shortest path problems. The proposed algorithm is developed according to the analytical concept developed, which has led to the derivation of the general mathematical modelling framework in (10a) and (10b). The three steps below summarize the general procedure when dealing with a given graph:(i)The first step consists of determining the fundamental parameters , , and of a given graph under investigation. These parameters are the parameter settings of the mathematical model of the shortest path solver (see (10a) and (10b)).(ii)The second step consists of inserting , , and in the solver given in (10a) and (10b).The algorithm can be depicted in the following pseudocode format:
BEGIN(1)Identify the original graph under investigation:(a)Extract/read the incidence matrix () of the graph.(b)Choose the source and destination nodes ().(c)Assign values to edges ().(2)Use the parameters , , and as coefficients of the NAOP model in (10a) and (10b):(a)Define the size of .(b)Define the dimension of .(c)Define the dimension of .(3)Solve (10a) and (10b) under the convergence condition and read the shortest path (SP) as solution of (10a) and (10b):(a)Define (10a) and (10b) as a function.(b)Define the order of (10a) and (10b).(c)Read , , and as coefficients of (10a) and (10b).(d)Define the state variables of (10a) and (10b).(e)Define initial conditions on the state variables of (10a) and (10b).(f)Define a range (or a window) in which the control parameters and are dynamically monitored.(g)Evaluate the condition for convergence of NAOP:(i)Declare the variables , , and .(ii)Write a loop to evaluate the value of :(A)If , read the solutions of (10a) and (10b).(B)If , tune/monitor , dynamically until and read the solutions .(h)Read/depict the SP through solutions of (10a) and (10b): (A)If , select the corresponding edges as belonging to the SP.(B)If , ignore the corresponding edges.(C)Display the SP solution involving all edges belonging to the SP (as depicted above).END
4. Application of the NAOP Spp Solver Concept to Shortest Path Detection: Benchmarking with Cases of Selected Graph Examples in [41, 42]
4.1. Simulation Results and Benchmarking with 
Reference  presents a concept based on dynamic neural networks (DNN) for solving shortest path problems in graphs with various weight values or costs of edges (e.g., positive, negative, and mix of positive and negative costs). It is demonstrated that the DNN concept undergoes a transient behavior before converging to the exact shortest path. It is also demonstrated in  that the concept developed (i.e., the DNN) does not converge under some specific parameter settings (e.g., initial condition). Our aim in this subsection is to demonstrate that the NAOP SPP developed in this paper can overcome the limitations of the concept developed in . Specifically, we demonstrate the possibility of shortening or cancelling the transient phase duration using NAOP; we also demonstrate that the NAOP concept converges when considering the case in which the concept in  fails to converge. These two achievements/results can be considered to underscore the efficiency of the NAOP SPP solver concept developed in this paper.
The simulation is performed under the same parameter settings as those in . These parameters are (a) graph topology, (b) source-destination pairs, (c) costs of edges, and (d) initial values for the respective ODE models. Our aim here is to demonstrate the effectiveness and efficiency of the NAOP based SPP solver concept especially in light of a direct comparison with the concept presented in . The effectiveness is demonstrated by the capability of always robustly detecting the exact and truly best shortest path in each of the application examples considered in . Further, the efficiency is expressed by the extreme fast convergence of the NAOP SPP solver concept to the exact shortest path when compared with the results in . The metrics for comparison (benchmarking) considered here are the robustness and the simulation time, this last correlating with the number of iterations required to achieve convergence.
4.1.1. Application Example 1 (Taken from )
In this application example, the graph considered in “example ” from  presents the case of a directed graph of magnitude 4 and size 5 (see Figure 4). Edges in Figure 4 are made up of a mix of positive and negative weight values (costs).
Case 1 (convergence behavior with respect to both concepts). In this case, the initial values are chosen according to . These values are . The results obtained by the concept in  and the results obtained by the NAOP SPP solver simulator are shown in Figures 5 and 6, respectively, for the graph topology in Figure 4.
The results in Figure 5 (this figure is provided by ) show the temporal evolution of the state variables in the simulation window . These state variables undergo a transient behavior before converging to their equilibrium point “0” or “1.” In Figure 5(a), the state variables , , and converge to “1.” This convergence to “1” expresses the belonging of the edges , , and to the shortest path, whereas in Figure 5(b) the state variables and converge to “0” which explains the notion that the edges and do not belong to shortest path.
Figure 6 presents the results obtained by our NAOP SPP solver concept. These results confirm those obtained in Figure 5 (i.e., and ).
Hence, both concepts detect the exact shortest path. However, the significant difference between both concepts is the duration of the transient phase within the simulation window . This metric is considered to deduce the time needed by each concept to achieve the convergence of all state variables. In Figure 5 (this figure is provided by ), the transient phase is observed in the simulation window ; hence, the convergence of state variables to binary variables “0” and “1” starts at , whereas in Figure 6 the results provided by the NAOP SPP solver show that the state variables undergo a very brief transient phase before converging to binary variables at .
Thus, this first benchmarking example clearly underscores (see Table 1) the fast convergence of the NAOP SPP solver concept proposed in this paper.
Case 2 (divergent behavior with respect to ). In a second example case, the initial values are taken as as mentioned in  for the application example given here in Figure 4. It is clearly reported in  that under these initial values the DNN concept developed in  does not converge. The aim of this comparison is to show that, under the same initial values, the NAOP concept converges to the exact shortest path. The results from  and the results from solving (10a) and (10b) (i.e., the mathematical model of NAOP) are shown, respectively, in Figures 7 and 8.
The results in Figure 7 (this figure is provided by ) show that the concept in  does not converge to binary values “0” and “1” for the considered initial values . In contrast, the results provided by the NAOP SPP solver concept (see Figure 8) clearly show the robust convergence behavior for the same initial values. This conclusion also underscores the robustness of the NAOP based concept to changes in initial values (see conclusion in Table 2).
4.1.2. Application Example 2 (Taken from )
Our focus on this application example is justified by (a) the mix of positive and negative costs of edges, (b) the mix of unidirectional and bidirectional connections between nodes, and (c) the direct connections between all nodes. In this application example, the graph considered is the one given in Example of ; it presents the case of a directed graph of magnitude 4 and size 7; see Figure 9.
Table 3 just summarizes what is expressed in Figures 10 and 11. It is evident and confirmed once again that the NAOP based SPP solver is very robust and very much faster than the DNN concept developed in .
4.2. Simulation Results and Benchmarking with 
The Dependent Variables Hopfield Neural Network (DVHNN) is considered here because it is one of the leading traditional neural network (NN) concepts, which have been applied for solving shortest path problems (SPP). This promising concept for shortest path detection is extensively developed in . However, some inherent limitations of this concept are pointed out by the authors of  such as (a) the lack of reconfigurability (this is explained by the fact that when changes occur in the graph topology (e.g., change of source or destination nodes, suppression of an edge belonging to the graph, and suppression of a node), new training (i.e., retraining) is needed in order to derive the new parameters of the DVHNN corresponding to the changes observed); (b) the accuracy of the DVHNN which deteriorates when the weights values (or costs of edges) become very high ; (c) the DVHNN which does not converge in case several paths (at least two paths) with identical total cost equal to the minimum cost (i.e., several equivalent shortest paths possibilities) exist in the graph for a given source-destination nodes pair; (d) lack of a systematic method (e.g., an analytical framework) that can be used to predict and control the convergence properties of the DVHNN concept through variation (or monitoring) of its two fundamental parameters (this parameter is used in  for controlling convergence of the DVHNN) and (this parameter is used in  for controlling accuracy, i.e., the solution quality provided by the DVHNN). The authors of  have further pointed out that the development of a systematic concept (e.g., analytical concept) that could be used to efficiently predict and control both convergence and accuracy of the DVHNN is of high importance. The current addressing of both convergence and accuracy of the DVHNN concept in  is essentially based on a trial and error process (i.e., a guess and check process). This justifies the importance of developing a systematic concept or framework for addressing this underscored control and monitoring need of both convergence and accuracy. (e) Using the DVHNN, a tradeoff exists between convergence time and solution quality . It is demonstrated in  that an improvement of the convergence time degrades the accuracy of the DVHNN concept and vice versa. The dependence between convergence and accuracy is addressed through monitoring of the ratio . However, even this monitoring of the ratio is also based on a trial and error process (see ).
Our aim in this subsection is to demonstrate that the NAOP concept developed can solve the above unsolved problems by the DVHNN. Specifically, we want to demonstrate that the NAOP SPP concept developed does not need retraining in case the abovementioned changes could occur in the graph topology. We also want to demonstrate that the accuracy of the NAOP SPP concept is not affected by the values of the costs assigned to edges (e.g., high or small costs of edges). Finally, we want to propose a systematic analytical concept that can be efficiently used to predict and control convergence of the NAOP SPP simulator to the exact (or desired) shortest path independently of the graph topology, size, magnitude, and so forth. Thus, the concept developed in this work could be viewed as a concrete prototype (i.e., model or framework) to overcome some limitations encountered by the DVHNN concept in .
Benchmarking is further considered between the concept developed in this work (i.e., NAOP SPP) and the concept developed in  (i.e., DVHNN). The performance metrics used for benchmarking are the computing performance (i.e., simulation time) and the robustness (i.e., convergence and correctness). The benchmarking is conducted by considering the graph example in Figure 12, which is considered in . According to the theory developed in Section 3, the parameters , , and of (10a) and (10b) corresponding to the graph topology in Figure 12 are defined as follows:
The fundamental parameters , , and of the graph in Figure 12 are expressed in a general form, which allows the possibility of dynamically changing the source-destination pair and also the costs of edges. This shows the good reconfigurability potential of the NAOP SPP solver since the parameters , , and are used as external commands (i.e., inputs) to the NAOP SPP concept.
The core concept of the DVHNN for finding SP is based on the derivation of Kirchhoff’s constraints. These constraints express the dependent variables/neurons in terms of independent variables/neurons. Reference  proposes an algorithm for the selection of dependent variables. However, simple selection of the dependent variables can be performed by choosing spanning walk/path from source () to destination (). This is walk involving all nodes of the graph. The state variables involved in the spanning walk/path are dependent variables/neurons while the other variables are independent neurons. Full details of the DVHNN concept are provided in .
It is important to mention that, according to the DVHNN concept, the dependent and independent variables are defined for a specific choice of source-destination pair (lack of reconfigurability). Due to the nonreconfigurability of the DVHNN concept, we have performed for each of the scenarios (remark: each scenario is defined by a specific source-destination pair) envisaged in Table 4 additional specific training of the DVHNN in order to determine the suitable system’s parameters. It is important to notice that the simulation times in Table 4 (for the DVHNN) correspond only to the duration of the usage phases (training not considered). The duration of the training phase of DVHNN (for each of the scenarios listed in Table 4) was not constant; however, it was always greater than 5 seconds.
|In this paper, the concepts have been all implemented in Matlab on a standard PC.|
The results (in Table 4) show that the NAOP concept developed in this work leads to a better computing performance than the DVHNN concept (for both small and large values of the costs of edges). Further, the computing performance of the DVHNN degrades (i.e., becomes the worst) with increasing costs of edges. When the costs of edges are very high (e.g., greater than thousand (1000)), the DVHNN cannot converge. On the other hand, the accuracy of the NAOP SPP concept is preserved even for very high costs of edges.
Regarding the test for robustness, we have considered various critical and challenging cases where several paths (from source to destination) are depicted with the same minimum cost (i.e., paths with equivalent/identical costs equal to the minimum cost). As it appears in Table 4, the DVHNN cannot efficiently analyze some of the various scenarios envisaged. Further, the robustness of the DVHNN degrades with increasing values of the costs of edges. This is observed (in Table 4) by cases of nonconvergence (lack of convergence) of the VDHNN for shortest path finding. However, the results in Table 4 reveal the good robustness of the NAOP SPP simulator for shortest path finding. This robustness is characterized by successful and straightforward convergence to a routing path (i.e., a correct path, which corresponds to the path having the minimum cost) for the sample scenarios envisaged.
In Table 5, we do summarize the essential comparison between DVHNN and NAOP SPP concept while using the most relevant criteria. It is clear from the table that the NAOP SPP concept can overcome some limitations of the DVHNN.
|In this paper, the concepts have been all implemented in Matlab on a standard PC.|
The work presented in this paper has developed and validated a general theoretical concept based on nonlinear adaptive optimization (NAOP) for the efficient and robust solving of SPP problem cases in reconfigurable network graphs. The developed concept has been demonstrated as being capable of efficiently handling shortest path problems in both directed and undirected graphs, even in the case of large values of the edge costs. Further, it has been demonstrated that the developed concept can efficiently detect SP even in the particular cases where there exist several paths (SP) with identical minimum total cost. This result does significantly contribute to the enrichment of the relevant state of the art regarding shortest path problems since the traditional SP concepts and algorithms (e.g., the traditional neural network architectures, PSO, GA, etc.) cannot efficiently tackle such complexity or always ensure convergence . The efficiency in this context is related to core performance criteria such as computing speed, robustness, and convergence. One of the most important conclusions to be underscored in this paper is that the NAOP based SPP solver concept developed can also be used, after appropriate adaptations, to efficiently model and handle further graph theoretical problems (e.g., traveler salesman problem, minimum spanning tree problem, and vehicle routing problem). The latter cited problems are being addressed in some subsequent works.
As proof of concept, extensive benchmarking has been carried out. The performance of the NAOP SPP concept developed has been sufficiently compared with the performances of two concepts taken from the literature, namely, the DVHNN in  and the DNN in . The outcome of the benchmarking has confirmed the effectiveness of the novel NAOP based concept developed to efficiently handle (or tackle) SP problems.
A series of ongoing works under consideration do relate amongst others to (a) the extension of the concept developed (in this work) to the case where the cost of a path is a nonlinear function of the arc weights (e.g., for cases where the driving energy on a path must be minimized) and (b) the extension of the concept developed (in this work) to the case of stochastic graphs. These graphs are characterized by arc weights that are stochastic processes. Thus, extending the concept developed in this work to both nonlinear and stochastic graphs models is possible. Regarding the latter ones, this can be achieved by considering the key parameters, that is, the th moments of the distributions characterizing the stochastic dynamics of each edge of the network/graph. The extension of the concept developed in this work to stochastic graphs will lead to the mathematical modeling of various stochastics-prone systems, for example, of sensor data with straightforward interesting applications in areas like intelligent transportation systems and many others.
Furthermore, a series of extremely interesting problem cases (for practical applications) for which the core concept of this paper will be extended in future works are (a) network reliability problems in case of both dependent and independent edges; (b) optimal flow problems under multiple source-destination scenarios; (c) optimal flow problems under multiple source-destination scenarios and under stochastic arc’s weights conditions; (d) traveler salesman problem (TSP); (e) TSP with stochastic edge’s weights; (f) resource allocation problems under stochastic conditions; (g) vehicle routing problems (VRP) under stochastic conditions; (h) scheduling problems under stochastic conditions, and so forth. All these problems are extremely challenging and do face a huge computational complexity that can be significantly alleviated by the involvement of the novel concept developed in this work.
This appendix provides full detail of the stability analysis carried out in order to guarantee fast convergence (to the exact solution) of the NAOP platform developed in (10a) and (10b) for solving shortest path problems (SPP). The analysis is systematically conducted and this led to the derivation of analytical conditions (expressed by a formula; see (A.38)) under which the NAOP system model in (10a) and (10b) always provides converging solutions. As already mentioned, these solutions, for example, the decision variables (or solutions ), express the exact/desired shortest path.
The attention devoted to the convergence/stability analysis is justified by the fact that a fundamental analytical relationship is derived under which straightforward convergence to the exact solution (i.e., exact shortest path) of the NAOP model (in (10a) and (10b)) for solving shortest path problems (SPP) is always observed regardless of the following key metrics: (a) the graph structure (directed, undirected, and/or mix of directed and undirected edges), (b) the graph size, (c) the costs of edges (high or small costs values), (d) the sign of costs (negative, positive, and/or mix of positive and negative costs), and (e) the magnitude of the graph. This latter metric is related to the scalability issue. This issue is systematically addressed here and a key analytical relationship is established analytically to ensure fast convergence even in case of graphs with huge amount of nodes. Specifically it is demonstrated that the parameters and can be monitored (or varied) systematically and that a judicious choice of these parameters ensures fast convergence to the desired/exact solution (i.e., shortest path).
The functions , , and are expressed in terms of . Therefore, the total differentiation of these functions is expressed as follows:Equations (A.1) can be written in the following forms:where the single overdot denotes the first derivative. Thus, (A.2) can be expressed in the following compact form:The Lagrange function in (4) can be substituted into (5a) to obtain the following expression:Applying the first derivative to (A.6) leads toSimilarly, the Lagrange function in (4) can be substituted into (5b) to obtain the following expression of the first derivative of :An important remark is that can be written in the following expanded form:and, finally,Further, since , , and do not depend on .
Substituting (A.10) and (A.11) into (A.8) leads to the following expression:Similar to the process leading to (A.12), it can be shown thatSubstituting (A.3), (A.4), (A.5), (A.12), and (A.13) into (A.7) leads to the following mathematical expression:Expanding the summation on the indexes and in (A.14) leads toEquation (A.15) can be rewritten as follows by permuting the summations on the indexes and and also and :Expression (A.16) can be transformed as follows: By grouping in (A.17) the expressions involving summations on the index , the following new expression is obtained:Applying the partial derivative (in the direction ) to the products terms in (A.18) leads to the following expression:Further, since the variables and are pairwise independent, the following expression can be derived:Substituting (A.20) into (A.19) leads to the following simplified expression:Expression (A.21) can be written in the following simplified form:Therefore, grouping the terms of (A.22) leads to the following fundamental expression describing the temporal evolution of the decision variables (solutions of (10a) and (10b)) and outputs of the NAOP simulator in Figures 1 and 3:Equation (A.23a) is a second-order ordinary differential equation with a single and dissipative coefficient denoted by . This coefficient represents the damped mass matrix defined according to (A.22) as follows:In (A.23a) is an internal force producing the potential energy into the system. This force is expressed according to (A.22) as follows:The total energy () of the system is expressed as the sum of kinetic energy () and potential energy () by the following expression: According to (A.23a), (A.23b), and (A.23c), is the state variable. Therefore, the energies (i.e., and ) are expressed in terms of the state variable (solution of (A.23a), (A.23b), and (A.23c)) as follows:Substituting (A.25) into (A.24) leads to the following expression of the total energy:Thus, substituting (A.23c) into (A.26) leads to the following expression of the total energy:Equation (A.27) can be written in the following simplified form:In (A.28), the integral operator can be permuted with the summation operator to obtain the following expression:Finally, an evaluation of the integral term in (A.29) leads to the following expression:Using (A.30), the first derivative of the total energy is expressed as follows:Using (A.4) and (A.5), the terms and are expressed in the direction as follows:Substituting (A.32) into (A.31) leads to the following expression of the first derivative of the total energy:The summation on the index can be permuted with the summations on the indexes and , respectively. This algebraic manipulation leads to the following expression: