Research Article  Open Access
A New Cooperative DualLevel Game Approach for OperatorControlled Multihop D2D Communications
Abstract
With the development of wireless communications and the intellectualization of mobile devices, devicetodevice (D2D) communications are considered as a standard part of future 5G networks. This new paradigm can provide better user experiences while improving the system performance such as network throughput, latency, fairness, and energy efficiency. In this study, we investigate a new duallevel D2D communication scheme consisting of multiple D2D operators and a group of mobile devices. To model the interaction among D2D operators and devices, we adopt two cooperative game approaches based on the incentive mechanism design and regalitarian Shapley value. At the upper level, routing paths and incentive payments for multihop relay services are decided using the incentive mechanism. At the lower level, mobile devices share the given incentive based on the regalitarian Shapley value. Both level control procedures are mutually dependent on each other by the proper coordination and collaboration. According to the main features of two cooperative game models, the proposed scheme takes various benefits in a fairefficient way. Through the derived simulation results, we can verify the superiority of our proposed scheme comparing to the existing protocols. Finally, we propose further challenges and future opportunities in the research area of operatorcontrolled multihop D2D communications.
1. Introduction
The recent widespread use of intelligent mobile devices with smart applications has led to an explosive growth in mobile data traffic. As a result of dramatic traffic growth of mobile devices, a massive burden on wireless network is created. To provide a higher peak date rate and a better network capacity, the development of the fifth generation (5G) mobile communications technology far beyond the current 4G systems is necessary. By 2023, more than 20 percent of mobile data traffic worldwide is expected to be carried by 5G networks. This is 1.5 times more than the total 4G/3G/2G traffic today. This remarkable growing momentum of network traffic will stimulate and promote researches on novel cooperative communication techniques, mainly due to the emerging need for connecting mobile devices in a ubiquitous manner [1].
Recently, devicetodevice (D2D) communication has become a potential candidate technology to handle the 5G network capacity and coverage problems. Most D2D communications have focused on the case of natural disasters when the infrastructurebased network services are either partially or totally unavailable. Usually, the D2D technology extends the traditional wireless communication system. It enables two mobile devices directly establish a wireless link between each other without any backbone network structure; these two mobile devices are geographically close to each other in the proximity area. However, singlehop D2D communications are usually limited to a specific geographic area. Therefore, the advantages of D2D communication can be fully realized in the multihop communication scenario. Nowadays, ad hoc mannerbased multihop D2D service is a new communication model for the 5G network technology [2, 3].
Commonly, multihop D2D communications refer to direct data exchanges among multiple devices without the involvement of wireless operators (WOs). Therefore, the conventional multihop D2D approach cannot provide efficiently qualityofservice (QoS) ensuring data traffic services. Nowadays, there is a new trend towards an operatorcontrolled D2D communication paradigm to maximize the profit of system manager as well as better QoS experience for devices. In this new paradigm, WOs should pursue the D2D communication function in their networks. Specifically, each WO is responsible to monitor devices in its covering area and establishes routing paths in concert with other WOs. To induce selfish devices to participate in data relay transmissions in multihop D2D communications, WOs should provide adaptive incentives to the corresponding devices [4, 5].
In the operatorcontrolled D2D communication system, each individual WO needs to cooperate with each other to improve the total system performance. In addition, mobile devices covering by a specific WO also need to reach a collaborative agreement for the incentive distribution. To get the mutual advantages for themselves, this interrelated operatordevice cooperation can be modeled as a hierarchical cooperation problem. To solve this problem, the major questions to be answered are (i) what is the adaptable incentive payment to perform the multihop D2D communications and (ii) how to effectively distribute the given incentive payment to relaying devices. For these two issues, two different strategies are necessary [4, 6].
Over the past decade, various noncooperative and cooperative game models have been extensively applied to analyze interactive decision makings of network agents. However, traditional noncooperative games suffer from many shortcomings, which render them inadequate to apply for the operatorcontrolled D2D communications. In particular, major arguments against using noncooperative game models can be listed, but are not limited to (i) the immense overhead caused by information acquisition, (ii) the slow convergence to equilibrium, (iii) the inefficiency of equilibrium in terms of social welfare, and (iv) the theoretical complexity of characterizing the equilibrium set [7, 8]. In contrast to noncooperative games, cooperative game models can fit the characteristics of multihop D2D systems more appropriately. In essence, such models are beneficially used for D2D communication functions. To this end, the cooperative game approach is chosen to design a novel operatorcontrolled D2D communication scheme.
In this paper, we adopt two different cooperative game concepts to solve the hierarchical cooperation problem for the operatorcontrolled D2D communications. By considering the mutualinteractive relationship between WOs and mobile devices, a new duallevel game model is formulated based on two cooperative game solutions, i.e., incentive mechanism design (IMD) and regalitarian Shapley value. At the upperlevel process, the IMD is used to decide the incentive payment for WOs, which are covering the routing path of multihop D2D communications. At the lowerlevel process, each WO distributes the obtained payment to incentivize its corresponding mobile devices based on the regalitarian Shapley value. With desired properties of cooperative game concepts, we attempt to reach an outcome that meets our design goals while taking advantages under asymmetric information situations.
1.1. Related Work
Due to the promising driving force for the improvement of 5G network capacity, multihop D2D communications are attracting growing attention from industry and academia. Recently, various protocols have been published from the new perspectives of key techniques and challenges about multihop D2D communications. The Coalitional Perspective D2D (CPD2D) scheme is a cooperative game approach to perform a multipath routing mechanism. This scheme enables the WOs and mobile devices to improve their payoffs by collaborative working [4]. To model the interactions among game players, i.e., WOs and devices, this scheme designs a cooperative gametheoretic algorithm and proposes a layered coalitional game model to address the decisionmaking problems among players. By using the extended recursive core coalition approach, the cooperative devices establish links among each other to form a stable network structure for the multipath routing. Finally, simulation results have shown that the CPD2D scheme yields notable performance gains relative to the noncooperative approach and achieves good convergence speed [4].
The RelayAssisted D2D (RAD2D) scheme is designed to improve the QoS of D2D communications while enlarging the communication range [9]. By taking the user selfishness and mobility into considerations, this scheme formulates the throughput maximization problem for the multihop D2D communications and develops the theoretical foundation of the spectrum reuse set partitioning. In particular, the RAD2D scheme is a relayassisted D2D communication protocol that addresses the challenges in enabling multihop D2D communications with relay incentives; a cheat avoidance incentive mechanism is developed with lightweight overheads to incentivize users to relay data. Under dynamic scenarios, extensive simulation results show that the RAD2D scheme can improve the system throughput and the user access rate as compared with baseline schemes [9].
The Centralized Adaptive D2D (CAD2D) scheme focuses on an analysis of stateoftheart routing algorithms that will enable intelligent D2D communications [10]. Based on the centralized adaptive routing, this scheme develops a new route discovery mechanism that will reduce the routing overhead to a great extent. Depending upon network conditions, such as varying device density and traffic load, the CAD2D scheme updates periodically the D2D communication path while adapting between reactive and proactive routing strategies. By gathering information from all mobile devices, the proposed protocol in [10] has a number of features, including energy and load awareness, special route request, and avoiding any kind of flooding. The main contribution of the CAD2D scheme is to reduce the routing overhead to a dramatic level in multihop D2D communications [10].
The earlier studies [4, 9, 10] have attracted considerable attentions while introducing unique challenges in handling multihop D2D communication problems. In this paper, we compare our proposed scheme with the existing the CPD2D [4], RAD2D [9], and CAD2D [10] schemes and demonstrate that our duallevel cooperative gamebased D2D control approach can significantly outperform these existing schemes.
1.2. Contribution
In this paper, we model the interactions between WOs and mobile devices and design a new duallevel hierarchical cooperative game. At the upperlevel game, WOs are game players, and the incentive payments are calculated based on the IMD for relay WOs. At the lowerlevel game, data relaying mobile devices are game players, and they share the incentive payment of their corresponding WO based on the regalitarian Shapley value. To leverage the full synergy of our duallevel approach, we take into account comprehensively some control issues and consider all the relevant practical factors in the operatorcontrolled multihop D2D communications. In summary, the contributions of this study are as follows:(i)Duallevel game model: motivated by a hierarchically depending situation, we introduce a new duallevel game model while capturing the interactive relationship between WOs and mobile devices. Our duallevel game approach is generic and applicable to operatorcontrolled multihop D2D communications.(ii)IMD for the upperlevel game: incentive payments for WOs are assigned to compensate for the cost of D2D communication relay services. According to the IMD, we properly make amends for the devices’ relaying cost through their representative WO.(iii)regalitarian Shapley value for the lowerlevel game: mobile devices share the incentive payment of their corresponding WO. According to the regalitarian Shapley value, it can be shared in a fairefficient way. Therefore, our method can effectively induce mobile devices to participate in multihop D2D data relay services.(iv)The synergy of combined two game models: we explore the interaction of two different game approaches and jointly design an integrated scheme to leverage the synergistic and complementary features. The main idea of our duallevel game lies in its responsiveness to the reciprocal combination of two different cooperative game solutions for operatorcontrolled D2D communications.(v)Solution concept: under dynamic D2D communication environments, traditional noncooperative game solutions suffer from the uncertainty and impractical assumptions. The main goal of this study is to investigate the potential benefit gained from practically implemented cooperation game methods and to get the finest solution based on the stepbystep interactive feedback process.(vi)Conclusions: numerical study shows that our duallevel game approach can improve the system throughput and the fairness of WOs and mobile devices by 10% to 40% under different D2D service request rates, comparing to the existing CPD2D [4], RAD2D [9], and CAD2D [10] schemes.
1.3. Organization
The rest of this paper proceeds as follows. Section 2 presents an infrastructure of operatorcontrolled multihop D2D communication system, and some basic mathematical concepts about IMD and regalitarian Shapley value are given. Based on the novel duallevel game method, the details of our proposed scheme are covered in this section. Experimental results from the simulation analysis are provided in Section 3. Finally, Section 4 summarizes the whole work and concludes this study with suggestions for future work.
2. Proposed OperatorControlled D2D Control Scheme
In this section, we present several concepts in line with the IMD and regalitarian Shapley value; they are needed in the rest of the paper. And then, we briefly introduce the formulation of our duallevel game approach and explain in detail the proposed operatorcontrolled D2D communication scheme. Finally, they are described in the ninestep procedures.
2.1. OperatorControlled D2D Communication Infrastructure
In this study, we consider an operatorcontrolled D2D communication system consisting of a number of devices belonging to multiple operators. Mobile devices are randomly deployed within a large coverage area. In the operatorcontrolled D2D infrastructure, a geographic coverage area is subdivided and served by operators. Operators are connected each other through highspeed wired links to transfer the system control information, and mobile devices are connected their corresponding operators through wireless links. Due to the limited transmission power of each device, multihop relaying is adopted to route flow data communications from source mobile devices to destination mobile devices [4, 11]. Denote the set of operators as and is the set of devices under the operator . is responsible to the devices in the set , and reports its status information to its own operator , and each is connected to neighboring devices for multihop D2D communications. For the multiple access at every hop, we consider a OFDMAbased transmission, and communication capacity at each hop link is fixed [4, 11, 12]. The general infrastructure of operatorcontrolled D2D system is shown in Figure 1, and Table 1 lists the notations used in this paper.

During the D2D system operations, system agents, i.e., operators and mobile devices, make decisions individually. In this situation, a main issue for each agent is how to perform well by considering the mutualinteraction relationship. To formulate this relationship, we design a new duallevel game model. At the upper level, operators are game players and they establish routing paths while getting the relay service payment based on the IMD. At the lower level, mobile devices are game players, and they share the service payment given by their corresponding operator according to the regalitarian Shapley value. For the implementation practicality, our lowerlevel games are carried out in an entirely distributed and parallel fashion. During the duallevel interaction, operators and mobile devices work together toward an appropriate system performance.
Let denote the wireless link between two neighboring devices and , and is denoted as the set of ’s neighboring devices where . is the communication cost degree of link for D2D communications [4].where is the interference parameter of , and is the cardinality of . and are the path loss exponent and interference factors, respectively. is the distance between and . Let is the multihop communication flow, and the source and destination device of flow is represented as and , respectively. consists of multiple links, and mobile devices for these links can be covered by different operators [4]. Mobile devices estimate the values of all available connection links and report this information to their corresponding operators. Therefore, each operator can recognize the device topology of its covering area; operators interact with each other to configure the large area, which is partially covered by other operators.
If an operator includes a (or ) device, it is called as a source (or destination) operator. From the source operator to the destination operator, there can be some relay operators. Each relay operator reveals the total sum of values about the relay links in its corresponding area; it can be interpreted as a relay cost of that operator. In this study, the source operator is responsible to collect all this information and configures a routing path through relay operators to reach the destination operator. Usually, all multihop D2D communication algorithms are designed by relying on the assumption that devices under relay operators willing to act as relay nodes in the multihop routing path. However, devices acting as relay nodes have to sacrifice their resources to forward data packets. Therefore, it is necessary to stimulate collaborative actions of relay devices toward a socially optimal outcome. During the upperlevel game operation, we develop an incentive payment mechanism to guide selfish relay devices. Based on the IMD, the source device pays appropriate incentives for the relay operators. And then, each relay operator redistributes the given incentive to its corresponding relay devices. During the lowerlevel game operation, this incentive sharing problem is solved according to the regalitarian Shapley value.
2.2. Incentive Mechanism Design Based UpperLevel Game Model
In the upperlevel game procedure, the main issue is to calculate the incentive payment. To develop an incentive payment algorithm for D2D communications, the key concern is how much a relay operator should be paid for the participation in relay services. In this paper, the basic concept of IMD is adopted to calculate the incentive payment for each relay operator. Usually, the IMD, also called reverse game theory, is a field in economics and game theory that takes an engineering approach toward desired objectives, where players act rationally. The main feature of IMD is that a game designer, who is interested in the game’s outcome, chooses the game structure to reach a social optimum. For a class of privateinformation games, IMD studies solution concepts of broad applications from economics and politics to network system management. However, the IMD has enjoyed much success only in static settings; it does not easily translate into an optimal mechanism for dynamic settings. In addition, the classic IMD literature largely ignores computational considerations [13, 14].
From the viewpoint of strategic players, one natural objective in dynamic environments is maximizing the longterm social welfare of all players (optimality). With regards to optimal mechanisms in a dynamic setting, there are elegant extensions [13]. As a special case of traditional IMD, Vickrey–Clarke–Groves (VCG) mechanism is a generic truthful mechanism for achieving a socially optimal solution while being applicable to quite general dynamic settings. Especially, the VCG mechanism is strategyproof, in the sense that the truthful reporting of player’s preference is always a dominant strategy. This property can provide a normative guide for the outcome and has better computational properties than the classical IMD approach [14].
In the upperlevel game model, the VCG mechanism is used to define a strategic situation to make the D2D system exhibit better performance when independent operators pursue selfinterested strategies. Let M be our payment mechanism for the upperlevel game, and is denoted as a set of possible outcomes based on inputs from relay operators. represents the set of relay operators for the traffic relay service. Each relay operator has its valuation function , which quantifies ’s value to a specific outcome . Usually, maps to a positive real number. In this study, this number represents ’s real contribution for the relay service, and is a set of relay operators to establish a routing path. Motivated by the basic idea of Dijkstra routing algorithm, is given to establish the routing path while minimizing the total sum of values where . For the relay service, ’s function is defined as follows:where is the set of relay mobile devices under the for the service, and is the next relay device of . represents the ’s energy consumption to connect the . With the control parameter , the value polynomially increases related to the communication cost degree between and .
From M, each relay operator is asked to report its valuation function and M decides an outcome by selecting relay operators to set a multihop D2D path route. At this moment, each relay operator can submit its function, which may or may not equal . Let represent that an operator misreports its true valuation function , i.e., . If M is the VCG mechanism, the outcome selected by M iswhere means that consists of relay operators to relay the service from to . The incentive payment of , i.e., , is defined as the profit that its presence causes others with respect to the reported [8]; formally,
In equation (4), the first term is the total reported value the other operators would obtain when is absent and the second term is the total reported value the others obtain when is present. If ’s dominant strategy is to report its valuation truthfully, i.e., , we say that M is truthful [8]. Formally, it can be expressed as follows:where is ’s utility function and is decided based on input and . is a vector of except , and is the real cost function of . In this study, is defined as the same manner as the communication cost degree. If the incentive payment for each relay operator is given according to (4), M is a truthful mechanism [8].
Theorem 1. M is a truthful mechanism for all relay operators.
Proof. We can fix the reports of all relay operators except . Suppose that ’s true valuation function is , and it can report its valuation function truthfully, i.e., , or untruthfully, i.e., .
Case (I). If report its false valuation function , then the outcome of M is given by (3) and ’s incentive payment is given by (4). Based on this reason, ’s utility function can be defined as follows:
Case (II). If report truthfully its valuation function, i.e., , then the outcome of M is given by’s incentive payment is’s utility function can be defined as follows:Finally,’s report has no influence on and . Therefore, they are the same constant from the viewpoint of the player . Therefore, the final equation (10) can be simplified as follows:In conclusion, the value of is always higher than the value of for every , , and .
2.3. LowerLevel Game Model Based on the regalitarian Shapley Value
From the upperlevel game, the incentives are paid by the source device to relay operators. In the lowerlevel game, each individual relay operator redistributes its given incentive to the corresponding relay devices to compensate the loss of relay devices. Commonsensically, mobile devices under a relay operator enter into a binding agreement to form a coalition if all relay devices are able to improve their individual payoffs. When some relay devices may contribute more to the coalition than others, the given incentive should be shared fairly and optimally among the relay devices. Therefore, the main concern in the lowerlevel game is to maintain the overall cooperation of delay devices while fairefficiently share the given incentive. In the proposed scheme, we adopt another novel cooperative game solution to answer to this question.
In 1953, L. Shapley characterized a solution concept that associates with canonical coalition games. This solution is known as the Shapley value. Through superadditivity, it assigns a unique distribution among the players of a total surplus generated by the coalition of all players. Shapley also proved that the Shapley value can satisfy four axioms; (i) efficiency, (ii) symmetry, (iii) dummy, and (iv) additivity. The (i), (ii), and (iii) axioms are selfexplanatory. To motivate the (iv) axiom, imagine the same players engage in two consecutive games. This axiom states that the outcome in one game should not affect the other, and thus, in the combined game, the allocation to a player is the sum of his allocations in the component games [7, 8].
In 2018, Yokote et al. modified the concept of Shapley value and introduced the regalitarian Shapley value . It is characterized by some axioms that have the advantage of the original Shapley value. The solution satisfies efficiency, weak covariance, and balanced contributions property for equal contributors axioms. To explain the axioms, we introduce some notations. Let () be a game with transferable utility, where is the set of players and is the characteristic function, which assigns a real number to every coalition where is the set of nonempty subsets of . A transferable utility game is a pair consisting of a set of players , and a coalition function . With the and , , let denote the game in which the domain of is restricted from to , and the of is [15].(i)Efficiency. For all (), ; it divides the total payoff .(ii)Weak Covariance. For all () and , we define by . Then, ; this axiom leaves limited room for the treatment of ’s contributions. Regarding , we require the outcome to be determined linearly from .(iii)Balanced Contributions Property for Equal Contributors. For all (), and . If , then .
Given the grand coalition form, investigates the problem of how to distribute the total payoff among players fairly. The solution of assigns a payoff vector to each game . To define , the idea of rescaling the worth of coalitions is necessary [15]. Let denote the set of finite sequences of real numbers:where represents the cardinality of . According to (12) and , is defined as
In the game , the worth of each coalition is rescaled by multiplying the s^{th} entry of the sequence vector , where s is the size of coalition . This type of rescaling is often discussed in the context of the percapita measure or discounting; can be interpreted as generalizing these ideas by allowing for any sequence of real numbers [15]. Finally, the is defined as follows:where can be interpreted as the probability of a coalition containing with the size of and is the payoff difference between the coalitions with and without the , which measures the contribution of the to the coalition. According to and , the original Shapley value is obtained based on the game [7, 16]. Therefore, we can interpret as (i) the worth of coalitions are rescaled based on the sequence vector , (ii) an imaginary game is constructed, (iii) the Shapley value idea is applied to the game, and (iv) the gap between the and is equally divided among players [15].
In the lowerlevel game process, the given incentive of each relay operator is shared by corresponding relay devices. Therefore, relay devices are game players and form each coalition. In this study, the characteristic function for the coalition is defined based on the bankruptcy problem in [17]. It is analogous to a distribution or entitlement problem by involving the allocation of a given amount of a perfectly divisible good. Therefore, we can effectively estimate values for all possible coalitions. Based on the bankruptcy problem and , relay devices can share their incentive payment using equation (14); it is the most fairefficient solution while satisfying axioms.
2.4. Main Steps of Proposed DualLevel D2D Communication Scheme
To effectively operate operatorcontrolled multihop D2D communications, the interactive relationship between operators and mobile devices is an important research topic and should be considered to design the control scheme. In this study, we provide the main D2D communication control method, which is modeled based on two cooperative game solutions, i.e., the IMD and . Owing to our duallevel game model, the upperlevel and lowerlevel game processes are hierarchically applied, and we can get the most fairefficient system performance by combining both solution approaches. Periodically, our duallevel gamebased D2D control method is operated for each multihop communication service . The principle novelties of this study are a judicious mixture of two cooperation game solutions and its feasible selfadaptability of each D2D network agent, i.e., operators and mobile devices, in the realworld multihop D2D system operations.
Usually, conventional optimization methods such as Lagrangian or dynamic programming require global objective functions with exponential time complexity; it is impractical to be implemented for realistic system operations. However, our duallevel game approach model can significantly reduce computational complexity based on the distributed lowerlevel game operations; it is an important feature of the proposed scheme. The main steps of the proposed scheme are described as follows.
Step 1. System factors and control parameters are determined by the simulation scenario (see simulation assumptions in Section 3).
Step 2. All operators announce their function values to connect their neighboring operators. Owing to the feature of VCG mechanism, relay operators truthfully announce their values.
Step 3. Multihop D2D communication service () is generated from the . At this time, the source operator including the finds out the destination operator including the while figuring out all possible relay operators.
Step 4. The source operator establishes the multihop D2D communication route , which can be consisting of multiple relay operators. According to the Dijkstra routing algorithm, this route is decided to minimize the total sum of all relay operators’ values.
Step 5. During the upperlevel game process, pays the incentive payments to relay operators using equation (4).
Step 6. During the lowerlevel game process, mobile relay devices under each individual relay operator in share the given incentive payment using equation (14). Owing to the feature of , the is shared fairefficiently among relay mobile devices.
Step 7. In a distributed fashion, all relay operators execute their lowerlevel games in parallel. Therefore, we can significantly reduce the computation complexity to calculate the . This approach is suitable for the practical implementation.
Step 8. Based on the duallevel game model, relay operators and mobile devices are hierarchically interconnected and interacting with one another to operate multihop D2D communications.
Step 9. Repeatedly, is generated from another and proceeds to Step 3 for the next duallevel game procedure for the new D2D communication.
3. Simulation Results and Discussion
In this section, we perform simulations to examine the performance of our proposed protocol, and compare it with that of the CPD2D [4], RAD2D [9], and CAD2D [10] schemes. To ensure a fair comparison, we have considered the following assumptions and scenarios.(i)Simulated operatorcontrolled D2D communication system covers a cellular area of 500 × 500 meter square(ii)There are 10 operators; they can cover to within a 150meter radius; they are laid out in regular pattern(iii)There are 100 mobile devices; they are randomly located in the cellular area(iv)Multihop communication service request rate is Poisson process (). The offered rate range is varied from 0 to 3.0(v)We assume that there are no physical obstacles in the experiments and each mobile device has enough bandwidth capacity for relay services(vi)Network performance measures obtained on the basis of 100 simulation runs are plotted as functions of the offered multihop service request rate ().(vii)We set , , and in this simulation study; they represent the path loss exponent, interference factor for a wireless link, and a control parameter to polynomially increase the cost degree, respectively.(viii)Performance criteria obtained through simulation are system throughput, the fairness among operators, and mobile devices; these simulation metrics are evaluated mainly to demonstrate the validity of our proposed method.
The result of throughput comparisons for multihop D2D systems is displayed in Figure 2. In this study, system throughput is the ratio of successful data delivery over multihop D2D communications. We measure this performance metric to show and determine whether our duallevel game approach can well orchestrate the operatorcontrolled D2D communication infrastructure to maximize the system performance. As expected, the system throughput of each scheme tends to increases as D2D communication service request rates increases; it is intuitively correct. The resulting curves allow us to see that our proposed scheme has gained a better system throughput than other existing schemes. It is therefore worth to say that, under different service request rate conditions, our duallevel game based selfcontrolled management policies can perform excellently to maintain the stable performance superiority.
Figure 3 plots the fairness comparison among operators in the D2D system. To characterize the fairness notion, we follow the main concept of Raj Jain’s fairness index, which is varied from 0 to 1; 1 is the best case for fairness. It is given by [18]
Following the main features of IMD, our upperlevel game procedure can balance well the ratio of D2D relay contribution to incentive payment in each operator. Therefore, under diversified service request conditions, the proposed scheme can maintain significantly higher values than the CPD2D, RAD2D, and CAD2D schemes. It is a highly desirable property for multihop D2D communication operations. To our knowledge, this result has not been made without explicitly adopting a truthful incentive mechanism for relay operators.
Figure 4 depicts the fairness comparison among mobile devices in each relay operator. It is also estimated based on the in equation (15). As can be seen, the fairness among mobile devices is very similar to the performance trend in Figure 3. In the proposed scheme, mobile devices in each relay operator share the given incentive payment according to the . If the fairness concept is not considered obviously at the design stage of lowerlevel game process, the values of each mobile device are dissimilar significantly. It causes lower values. Simulation results have shown clearly that our proposed scheme can effectively assign the incentive payments to relay mobile devices while ensuring the fairness among mobile devices. In particular, the method can compensate the actual contributions of relay devices with the axiom of balanced contributions property for equal contributors. Therefore, we attain a higher fairness for mobile devices compared to other existing schemes.
4. Summary and Conclusions
To meet the growing demands of traffic services, a constant need to increase the network capacity has led to the evolution of D2D communications in 5G networks. In a conventional D2D communication system, devices are not allowed to communicate with each other through multihop connections. This paper proposes a novel operatorassisted multihop D2D communication scheme. The role of operators is to coordinate mobile devices in a distributed manner while getting the incentive payment from the source device. To induce selfish mobile devices to participate in multihop D2D communications, we adopt two cooperative game solutions; IMD and . These two solution methods mutually interact with each other in our duallevel game model, and we can formulate a winwin situation for multihop D2D communication services. Therefore, in the proposed scheme, operators and mobile devices reciprocally work together toward an appropriate system performance. Based on the simulation result analysis, we demonstrate that our duallevel game approach is effective and efficient comparing to the existing CPD2D, RAD2D, and CAD2D schemes. As directions for future research, we aim at investigating the privacy and energy issues for multihop D2D communications. In addition, we plan to develop a new mechanism design with theoretical analysis. It will be a potential direction and another possible extension to this work.
Data Availability
The data used to support the findings of this study are available from the corresponding author upon request.
Conflicts of Interest
The author declares that there are no conflicts of interest regarding the publication of this paper.
Acknowledgments
This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP20192018001799) supervised by the IITP (Institute for Information and communications Technology Planning and Evaluation) and was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF2018R1D1A1A09081759).
References
 M. Z. Shakir, M. Ismail, X. Wang, K. A. Qaraqe, and E. Serpedin, “From D2D to Ds2D: prolonging the battery life of mobile devices via Ds2D communications,” IEEE Wireless Communications, vol. 24, no. 4, pp. 55–63, 2017. View at: Publisher Site  Google Scholar
 Y. Desale, K. Bakade, and S. Wagh, “Optimum protocol for route discovery in device to device communication,” in Proceedings of the IEEE International Conference on Signal Processing, pp. 1–4, Phoenix, AZ, USA, September 2016. View at: Google Scholar
 F. S. Shaikh and W. Roland, “Routing in multihop cellular devicetodevice (D2D) networks: a survey,” in Proceedings of the IEEE Communications Surveys & Tutorials, 2018. View at: Google Scholar
 X. Lu, P. Wang, and D. Niyato, “Hierarchical cooperation for operatorcontrolled devicetodevice communications: a layered coalitional game approach,” in Proceedings of the IEEE Wireless Communications & Networking Conference, pp. 2056–2061, New Orleans, LA, USA, March 2015. View at: Google Scholar
 E.c. Park, D.y. Kim, C.h. Choi, and J. So, “Improving quality of service and assuring fairness in WLAN access networks,” IEEE Transactions on Mobile Computing, vol. 6, no. 4, pp. 337–350, 2007. View at: Publisher Site  Google Scholar
 J. So and N. H. Vaidya, “Loadbalancing routing in multichannel hybrid wireless networks with single network interface,” IEEE Transactions on Vehicular Technology, vol. 56, no. 1, pp. 342–348, 2007. View at: Publisher Site  Google Scholar
 S. Kim, Game Theory Applications in Network Design, IGI Global, Hershey, PA, USA, 2014.
 A. R. Karlin and Y. Peres, Game Theory, Alive, American Mathematical Society, Providence, RI, USA, 2017.
 T. Liu, J. C. S. Lui, X. Ma, H. Jiang, and H. Jiang, “Enabling relayassisted D2D communication for cellular networks: algorithm and protocols,” IEEE Internet of Things Journal, vol. 5, no. 4, pp. 3136–3150, 2018. View at: Publisher Site  Google Scholar
 F. S. Shaikh and W. Roland, “Centralized adaptive routing in multihop cellular D2D communications,” in Proceedings of the International Conference on Communication and Computing Systems, pp. 158–162, Kracow, Poland, 2017. View at: Google Scholar
 H. Zhang, L. Song, and Y. J. Zhang, “Load balancing for 5G ultradense networks using devicetodevice communications,” IEEE Transactions on Wireless Communications, vol. 17, no. 6, pp. 4039–4050, 2018. View at: Publisher Site  Google Scholar
 H. Byun and J. So, “Node scheduling control inspired by epidemic theory for data dissemination in wireless sensoractuator networks with delay constraints,” IEEE Transactions on Wireless Communications, vol. 15, no. 3, pp. 1794–1807, 2016. View at: Publisher Site  Google Scholar
 S. M. Kakade, I. Lobel, and H. Nazerzadeh, “Optimal dynamic mechanism design via a virtual VCG mechanism,” ACM SIGecom Exchanges, vol. 10, no. 1, pp. 27–30, 2011. View at: Publisher Site  Google Scholar
 R. K. Dash, N. R. Jennings, and D. C. Parkes, “Computationalmechanism design: a call to arms,” IEEE Intelligent Systems, vol. 18, no. 6, pp. 40–47, 2003. View at: Publisher Site  Google Scholar
 K. Yokote, T. Kongo, and Y. Funaki, “The balanced contributions property for equal contributors,” Games and Economic Behavior, vol. 108, pp. 113–124, 2018. View at: Publisher Site  Google Scholar
 J. Cai and U. Pooch, “Allocate fair payoff for cooperation in wireless ad hoc networks using Shapley value,” in Proceedings of the IEEE International Parallel & Distributed Processing Symposium, pp. 1–8, Anchorage, AK, USA, 2004. View at: Google Scholar
 D. Niyato and E. Hossain, “A cooperative game framework for bandwidth allocation in 4G heterogeneous wireless networks,” in Proceedings of the IEEE International Conference on Communications, pp. 4357–4362, Istanbul, Turkey, June 2006. View at: Google Scholar
 M. Dianati, X. Shen, and S. Naik, “A new fairness index for radio resource allocation in wireless networks,” in Proceedings of the IEEE Wireless Communications & Networking Conference, vol. 2, pp. 712–715, Orleans, LA, USA, March 2005. View at: Google Scholar
Copyright
Copyright © 2019 Sungwook Kim. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.