Abstract

From social contracts to climate agreements, individuals engage in groups that must collectively reach decisions with varying levels of equality and fairness. These dilemmas also pervade distributed artificial intelligence, in domains such as automated negotiation, conflict resolution, or resource allocation, which aim to engineer self-organized group behaviors. As evidenced by the well-known Ultimatum Game, where a Proposer has to divide a resource with a Responder, payoff-maximizing outcomes are frequently at odds with fairness. Eliciting equality in populations of self-regarding agents requires judicious interventions. Here, we use knowledge about agents’ social networks to implement fairness mechanisms, in the context of Multiplayer Ultimatum Games. We focus on network-based role assignment and show that attributing the role of Proposer to low-connected nodes increases the fairness levels in a population. We evaluate the effectiveness of low-degree Proposer assignment considering networks with different average connectivities, group sizes, and group voting rules when accepting proposals (e.g., majority or unanimity). We further show that low-degree Proposer assignment is efficient, in optimizing not only individuals’ offers but also the average payoff level in the population. Finally, we show that stricter voting rules (i.e., imposing an accepting consensus as a requirement for collectives to accept a proposal) attenuate the unfairness that results from situations where high-degree nodes (hubs) play as Proposers. Our results suggest new routes to use role assignment and voting mechanisms to prevent unfair behaviors from spreading on complex networks.

1. Introduction

Fairness has a profound impact on human decision-making and individuals often prefer fair outcomes over payoff-maximizing ones [1]. This has been evidenced through behavioral experiments, frequently employing the celebrated Ultimatum Game (UG) [2]. In the UG, one Proposer decides how to divide a given resource with a Responder. The game only yields payoffs associated with the proposed resource allocation to the participants if the Responder accepts the proposal. Human Proposers tend to sacrifice some of their share by offering high proposals, and Responders often prefer to earn nothing rather than accepting unfair divisions. These counterintuitive results motivated several lab experiments and theoretical models that aimed at justifying, mathematically and empirically, the emergence and maintenance of fair intentions in human behavior [37].

Most of these works, however, have neglected the fact that, in many situations, offers are made in the context of groups, instead of simpler pairwise interactions. This is the case in the negotiation of collective work contracts, environmental coalitions and policy making [8], human rights conventions, collective insurance [9], adoption of regulatory frameworks (e.g., in the use of technology [10]), exchange of flexibilities between local energy communities [11], or the simple act of scheduling a meeting with several participants, among other possible scenarios. Fairness and bargaining dilemmas occur within groups, in which group decisions emerge from the combination of each individual’s assessment of what is perceived as a fair offer. Similarly, in engineering applications grounded on artificial intelligence and multiagent systems, fairness concerns are important in domains that go beyond pairwise interactions. Autonomous agents have to take part in group interactions that must decide between outcomes that may each favour a different part of the group. Examples of such domains are automated bargaining [12], conflict resolution [13], or multiplayer resource allocation [14].

To capture some of the dilemmas associated with fairness versus payoff maximization in these group interactions, one may resort to multiplayer extensions of the Ultimatum Game [15] (MUG) (see Figure 1). Here, a proposal is made by a Proposer to a group of Responders that, collectively, decide to accept or reject it. As in the pairwise UG, the strategy of a Proposer, , is the fraction of resource offered to the Responders; the strategy of each Responder , , is the personal threshold used to decide between acceptance and rejection [5, 6]. Groups decide to accept and reject a proposal through functions of the individual acceptance thresholds, . Group acceptance depends on a decision rule: if the fraction of acceptances equals or exceeds a minimum fraction of accepting Responders, , the proposal is accepted by the group. In that case, the Proposer keeps what she did not offer () and the offer is divided by the Responders—each receiving . If the fraction of acceptances remains below , the proposal is rejected by the group and no one earns anything. As in the UG, the sub-game perfect equilibrium of MUG consists of a very low value of proposal and very low values of threshold [16].

Previous studies with the UG [47, 17] and the MUG [15, 18, 19] assume that the roles of the Proposer and Responder are attributed following uniform probability distributions: each agent has the same probability of being selected to play as the Proposer. These assumptions are naturally at odds with reality. In real-life Ultimatum Games, being the Proposer or the Responder depends on particular agents’ characteristics. Proposers, such as employers, investors, auction first-movers, and rich countries, are in the privileged position of having the material resources to decide upon which proposals to offer. This advantageous role is notorious if, again, one considers the theoretical prediction of payoff division in the UG (sub-game perfect equilibrium) posing that Proposers will keep the largest share of the resource being divided. The benefits of Proposers are more evident when proposals are made to groups, as Responders need to divide the offers—thus increasing the gap in gains between the single Proposer and the Responders. In this multiplayer context, punishing Proposers becomes harder: any attempt to punish unfair offers is only effective if there is a successful collective agreement—amongst Responders—to sacrifice individual gains and reject an offer. Asserting that these two roles are asymmetric, so should be the criteria to assign them, leading us to two main questions:(i)How should a Proposer be selected within a group, in Multiplayer Ultimatum Games, to guarantee efficiency and fairness?(ii)The fact that individuals are often embedded in networks makes it important to understand to which extent the network ties and the way groups are assembled influence overall fairness, exchanges, and cooperation. Given this networked context, which network-based role-assignment criteria can be used to maximize long-term efficiency and fairness?

Here, we introduce a model, based on evolutionary game theory (EGT) [20, 21] and complex networks, to approach the previous questions. We analyze multiplayer ultimatum games in heterogeneous complex networks through the network centrality-based role assignment. The fact that networks are heterogeneous allows us to test several node properties and centrality measures as base criteria for defining how to select Proposers in a group. We focus on degree centrality. We find that selecting low-degree Proposers elicits fairer offers and increases the overall fitness (average payoff) in a population.

1.1. Related Work

The questions we address in this work—and the model proposed to tackle them—lay on the interface between mechanisms for fairness elicitation in multiagent systems, multilayer bargaining interactions, dynamics on complex networks, and network interventions to sustain socially desirable outcomes.

Some of the most challenging contexts to elicit fairness involve the tradeoff between payoff-maximizing outcomes and fair outcomes. As stated, the UG [2] has been a fundamental interaction paradigm to study such dilemmas. In this context, reputations [5] and stochastic effects [7] were identified as mechanisms that justify fair behaviors. Page et al. found that, in a spatial setting, fairer proposals emerge as clusters of individuals proposing high offers are able to grow [6]. Also, in the realm of interaction networks, De Jong et al. concluded that scale-free networks allow agents to achieve fairer agreements; rewiring links also enhances the agents’ ability to achieve fair outcomes [4]. A game similar to the UG assumes that Responders are unable to reject any proposal and Proposers unilaterally decide about a resource division. This leads to the so-called Dictator Game. In this context, reputations and mechanisms based on partner choice were also identified as drivers of fair proposals [22].

The previous works assume that all agents have the same probability of playing the role of Proposer or Responder. Going from well-mixed (i.e., all individuals are free to interact with everyone else) to complex networks, however, provides the opportunity to implement network-based role assignment that considers network measures. In this context, Wu et al. studied the pairwise UG in scale-free networks, with roles being attributed based on network degrees. The authors show that attributing the role of Proposer to high-degree nodes leads to unfair scenarios [23]. Likewise, Deng et al. studied the role assignment based on degree, concluding that the effect of degree-based role assignment depends on the mechanism of strategy update [24]. When considering a pairwise comparison based on accumulated payoffs and social learning (as we do in the present work), the levels of contribution in the population increase if lower-degree individuals have a higher probability of being the Dictators. Both works consider the pairwise Ultimatum Game.

In this work, we use a multiplayer version of the UG (MUG) proposed in [15]. Other forms of Multiplayer Ultimatum Games can be found in [19, 25, 26]. Santos et al. studied this game in the context of complex networks, showing that fairness is augmented whenever the networks, upon which the game is played, allow agents to exert a sufficient level of influence over each other, by repeatedly participating in each other’s interaction groups. The authors also find that stricter group decision rules (i.e., high in MUG) allow for fairer strategies to evolve under MUG. Here, we use networks to define group formation as suggested in the previously mentioned work (originally in [27]) and as exemplified in Figure 2.

Departing from previous works that study degree-based role assignment in pairwise Ultimatum Games [23, 24], we focus on a multiplayer game. As mentioned, this version highlights the asymmetries between the Proposer and Responder roles. By comparison with the UG, MUG Proposers are likely to receive an even higher share of payoffs than each Responder as the latter must divide any accepted offer between themselves. Moreover, in order to punish unfair MUG Proposers, Responders must act as a group which may naturally call for extra coordination mechanisms. Also, in contrast with [23, 24], here we combine the study of network-based role assignment with different voting mechanisms; we show that, whenever highly connected nodes are the natural candidates to play the role of Proposer, stricter voting rules (i.e., imposing an accepting consensus as requirement for collectives to accept a proposal) attenuate the emergent level of inequality.

Finally, the approach we follow in this work is akin to testing network interventions for social good. Several works study social dilemmas on top of complex networks and stress the conditions leading, in this context, to socially desirable outcomes [2831]. In this realm, we shall underline a recent work that employs EGT—as we do in the present paper—to study interventions that aim to sustain cooperation in complex networks [32]. The authors conclude that local interventions, i.e., based on information about the neighborhood of the affected node, outperform global ones. A similar conclusion is presented in [28].

2. Materials and Methods

Here, we detail the proposed evolutionary game theoretical model to evaluate the effect of degree-based role assignment on fairness under MUG. We start by providing details on the payoff calculation under MUG.

2.1. Multiplayer Ultimatum Games

In the 2-player UG, a Proposer has a resource and is required to propose a division with a Responder. The game only yields payoff to the participants if the Responder accepts the proposal [2]. Given a Proposer with strategy and a Responder with strategy , the payoff for the Proposer yieldsand for the Responder,

In the MUG, proposals are made by one Proposer to the remaining Responders, who must individually reject or accept them [15, 18]. Since individuals may act both as Proposers and Responders (with a probability that will depend on node characteristics), we assume that each individual adopts a strategy (, ). When playing as the Proposer, individuals offer to the Responders. Responders will individually accept or reject the offer having their as a threshold: if the share of an offer is equal or larger than (i.e., ), the individual accepts the proposal. Otherwise, the Responder rejects that proposal. We can regard as the minimum fraction that an individual is willing to accept, relative to the maximum to be earned as a Responder in a group of a certain size. Alternatively, we could assume that individuals ignore the group size and as such, when faced with a proposal, they must judge the absolute value of that proposal (an interpretation that also holds if we assume that individuals care about the whole group payoff).

Overall group acceptance will depend upon , the minimum fraction of Responders that must accept the offer before it is valid. Consequently, if the fraction of individual acceptances stands below , the offer will be rejected. Otherwise, the offer will be accepted. In this case, the Proposer will keep to himself and the group will share the remainder; that is, each Responder gets . If the proposal is rejected, no one earns anything. All together, in a group with size composed of 1 Proposer with strategy and Responders with strategies , the payoff of the Proposer is given bywhere is the Heaviside step function that evaluates to 1 when and evaluates to 0 when . The payoff of any Responder in the group yields

We assume that MUG interactions are played on a complex network, in which individuals are assigned nodes and links define who can interact with whom. Following [27, 33], every neighborhood characterizes a N-person game, such that the individual fitness (or success) of an individual is determined by the payoffs resulting from the game centered on herself plus the games centered on her direct neighbors. We provide a visual representation of such group formation in Figure 2. Degree heterogeneity will create several forms of diversity, as individuals face a different number of collective dilemmas depending on their degree (and social position); groups where games are played may also have different sizes. Such diversity is introduced by considering two types of scale-free networks. One is generated with the Barabási–Albert algorithm (BA) of growth and preferential attachment [34] leading to a power-law degree distribution, high correlation in the degrees of neighboring nodes, and a low clustering coefficient. The clustering coefficient offers a measure of the likelihood of finding triangular motifs or, in a social setting, the likelihood that two friends of a given node are also friends of each other, a topological property of relevance in the context of fairness and N-person games [33]. In the second case, we consider the Dorogovtsev–Mendes–Samukhin (DMS) model [35], exhibiting the same power-law degree distributions, yet with large values of the clustering coefficient.

2.2. Networks Generated

In the BA model [34], at each time step, the network grows by adding a new node and connecting it to m other nodes already in the network. These connections are probabilistic, depending on the degree of the nodes to be connected with: having a higher degree increases the probability of gaining a new connection. This process results in heterogeneous degree distributions, in which older nodes become highly connected (creating the so-called hubs). This is the combination of two processes–growth and preferential attachment. In the DMS model [35], at each time step, a node is added; instead of choosing other nodes to connect with, it chooses one existing edge randomly and connects to both ends of the edge. The networks generated by the DMS model have higher clustering coefficient than those with the BA model, combining the high clustering and high heterogeneity that characterize real-world social networks.

2.3. Network-Based Role Selection

Previous works show that anchoring the probability of nodes being selected for the role of Proposer or Responder on their degree has a sizable and nontrivial effect on the evolving magnitude of proposals in traditional two-person Ultimatum Games [23, 24]. Considering multiplayer ultimatum games, however, opens space to study the interplay between group characteristics (such as group sizes) and network-based criteria to select Proposers in completely unexplored directions. So far, we assume that nodes are selected to be Proposers based on their degree. As such, in a group with individuals, where each individual has degree , the probability that is selected as the Proposer is given by , where controls the influence of degree on role selection. One node is selected as the Proposer, and the remaining play as Responders.

2.4. Evolutionary Dynamics

We simulate the evolution of and in a population of size , much larger than the group size . Initially, each individual has values of and drawn from a discretized uniform probability distribution in the interval . The fitness of an individual of degree is determined by the payoffs resulting from the game instances occurring in groups: one centered on herself plus others centered on each of her neighbors (see Figure 2). Values of and evolve as individuals tend to imitate (i.e., copy and ) the neighbors that obtain higher fitness values.

The numerical results presented in the following were obtained for structured populations of size . Similar results were obtained for . As already mentioned, we consider networks generated with both BA and DMS algorithms, with average degree . Simulations take place for generations, considering that, in each generation, all the individuals have (on average) the opportunity to revise their strategy through imitation once.

At every (discrete and asynchronous) time step, two individuals and (neighbors) are selected from the population. Given the group setting of the MUG, is chosen from one of the neighbors of . Their individual fitness is computed as the accumulated payoff in all possible groups for each one, provided by the underlying structure (in each group, the role of Proposer or Responder is selected following the results); subsequently, copies the strategy of with a probability that is a monotonic increasing function of the fitness difference , following the pairwise comparison update rule: [36].

The parameter specifies the selection pressure ( represents neutral drift, and represents a purely deterministic imitation dynamics). Imitation is myopic: the value of and copied will suffer a perturbation due to errors in perception, such that the new parameters will be given by and , where and are uniformly distributed random variables drawn from the interval . This feature not only (i) models a slight blur in perception but also (ii) helps to avoid the random extinction of strategies and (iii) ensures a complete exploration of the strategy spectrum. To guarantee that new and are not lower than 0 or higher than 1, we implement reflecting boundaries at 0 and 1. Alternative options of mutation operators to test in the future include drawing mutations from normal distributions considering absorbing boundaries [37].

Furthermore, with probability , imitation will not occur and the individual will adopt random values of and , drawn from a uniform distribution over . This can either represent the adoption of a random strategy by an individual or a low rate of existing players being replaced by new naive players. We use , , and throughout this work. The effect of varying is similar to the one verified when changing : an overall increase of randomness leads to higher chances of fairer offers (as in [7, 15]). For each combination of parameters, the simulations are repeated 100 times (10 times using 10 different networks from each class studied), whereas each simulation starts from a population where individuals are assigned random values of and drawn uniformly from . We provide a summary of the algorithm used to revise agents’ strategies in Algorithm 1. The average values of p, q, and f (denoted by , , and ) are obtained as a time and ensemble average, taken over all the runs (considering the last generations, disregarding an initial transient period).

Initialize all
For To do Main cycle of interaction and strategy update:
 For To Select agent to update:
  /∗ Sample two neighbors of the population
   (agent to update)
   (agent to be imitated)
  if then Mutation:
   
   
  else Imitation:
    fitness()
    fitness()
   
   If then
     + imitation error
     + imitation error

3. Results and Discussion

We run the proposed model and record the average strategies played by the agents over time and over different runs (starting from different initial conditions, see Materials and Methods). We find that attributing the role of Proposer to low-degree nodes (or low-degree Proposer assignment) increases the average level of proposal, , adopted in the population of adaptive agents. This means that the payoff gap between Proposers and Responders is alleviated. Figure 3 shows that, for low (), we obtain higher levels of average proposal when considering both BA (low clustering coefficient) and DMS (high clustering coefficient) networks. We observe a steep decline in average proposals when the role of Proposer and Responder is attributed, regardless the degree of individuals (). The low-proposal tendency is maintained if the role of Proposer is assigned to high-degree nodes ().

We also confirm that the high-degree Proposer assignment leads to unequal (unfair) results within a population. Figure 4 depicts the average payoff gains for individuals with a certain degree. We can observe that, for , high-degree nodes obtain much higher values of payoff than low-degree nodes. This situation is ameliorated if individuals with a lower degree are given a higher chance of becoming Proposers (lower ) and, to a lower extent, if more Responders are required to accept a proposal in order for it to be accepted (higher , (d–f) in Figure 4).

We can further verify the effect of on fairness through the so-called Lorenz curves [38], often used to compute the Gini coefficients [39] that quantify income inequality. In Figure 5, we represent the Lorenz curves associated with different role-assignment rules () and voting rules, . Each curve is generated by ordering individuals by increasing the value of income plotting the corresponding cumulative distribution. A curve closer to the perfect equality line () represents a more egalitarian distribution of resources and a lower Gini coefficient. As we verify in Figure 5, the most unequal outcomes (higher Gini) are obtained for higher . We further verify that, when fixing , having stricter voting rules (high ; in this case, ) attenuates the unfairness associated with having hubs as the Proposers.

Not only does the low-degree Proposer assignment reduce unfairness, it also sustains more efficient outcomes—taken as higher values of average fitness observed in the population. In Figure 6, we confirm that low values of maximize the average fitness of populations. This occurs when considering heterogeneous networks with different average degrees () and group decision rules (). This effect is more evident when considering less-strict group decision rules (that is, lower , meaning that less number of Responders are required to accept a proposal for the group to accept it) and networks with higher .

Finally, we confirm that the low-degree Proposer assignment maximizes the average proposal played in the population (and thus fairness) when considering networks with higher and, as a result, larger average group sizes. As Figure 7 conveys, the higher values of average proposal, , are obtained for . Notwithstanding, we are able to find parameter spaces where the dependence of on is seemingly affected by (i) the average connectivity of the network—and thus the average size of the groups in which MUG are played—and (ii) particular values of . Also, we confirm that increasing increases for all values of . Our results suggest that offering the first move to low-degree nodes balances the natural power of highly connected nodes in scale-free networks, leading to a significant increase in the global levels of fairness. Interestingly, we also find that particular voting rules () are able to attenuate the negative effect of high (i.e., privileged high-degree nodes being selected to be Proposers) on fairness.

One can reach an additional intuition for the increase of fairness through the attribution of the role of Proposer to low-degree nodes if we approximate scale-free networks to a collection of heterogeneous starlike structures [27]. For simplicity, let us consider two hubs ( and , both with degree ) at the center of two stars, each with two prevalent values (high in the green star, around , and low in the blue star, around ) (see Figure 8). Under this configuration, we may ask which strategy ( or ) will prevail. For that, we note that, within each star, the strategies of the hubs are likely to locally prevail and thereby we focus on strategy invasion along the edge connecting both hubs (red/thick link); we further assume that is characterized by a higher value of than (). The question is will impact the total payoff of and ( and , respectively) such that under high is likely to be imitated by () and under low is likely to be imitated by ()? The answer is yes, if high-degree nodes are preferentially selected as Proposers (high ), the total payoffs of and decrease with the value of characterizing their stars, and , respectively. As a result, if , it is likely that gets imitated by which contributes to decrease in the average value of in both stars (Box 1). Conversely, if high-degree nodes are preferentially selected as Responders (low ), the fitness of and will increase with and , respectively. As a result, the hub associated with the star revealing a higher p is likely to be imitated which, if , implies that will tend to be imitated by ; the average value of p in both stars thereby increases (Box 2). This intuition hinges on the assumption that all offers are accepted, which is only true for or for all nodes. If increases, it is harder for low proposals to get accepted as they require a higher number of accepting Responders to be validated which contributes for to increase overall and to fairer proposals [15, 33]. This intuition remains valid if such heterogeneous structures portray a high clustering coefficient (e.g., when leaves of each starlike community are linked to each other), offering an additional intuition on why the DMS and the BA network models offer similar results.

4. Conclusion

In this paper, we address the general problem of (1) deciding how to attribute bargaining roles in a social network and, in particular, (2) understanding the impact of different criteria on the emerging levels of fairness in Multiplayer Ultimatum Games. We verified that attributing the role of Proposer to low-degree nodes boost both fairness and overall fitness. This conclusion remains valid for different network structures (BA and DMS networks with average degrees ranging from 4 to 16) and interaction scenarios (in terms of group sizes and group decision rules).

We also find that the perils of having high-degree Proposers can be softened with strict group decision rules. This means that, whenever is high, can be default, and cannot be lowered (e.g., hubs by having the needed resources to be the first movers in a bargaining situation), unfairness can be reduced by imposing that proposals need to be validated by a large fraction of Responders. The effect of on eliciting fairer offers is similar to that found in the recent literature [15, 18]. By considering a higher , more accepting Responders are required in order for a proposal to be accepted; as a result, it is harder for unfair Proposers (i.e., adopting lower ) to have their proposals accepted and increase their payoffs by keeping the largest sum of the initial endowment to themselves. As a result, there is a tendency for the average offer in the population, , to increase. Also, our results are in line with works showing that selecting low-degree Proposers maximizes fairness in the context of pairwise Ultimatum Games [23] and Dictator Games [24]. Here, we confirm that the mechanisms contributing for fairness through adaptive role assignment in the pairwise UG are likely to extend to Multiplayer Ultimatum Games, a setting where (as discussed in Related Work) the asymmetry in payoffs between Proposers and Responders is exacerbated; extending the analysis of role assignment in MUG allows one to cover N-person interactions and, importantly, to test how voting mechanisms can curb the effects of particular role assignments.

This work can underlie several extensions of interest for social and engineering sciences. Here, we consider that role assignment is endogenously imposed. In reality, the tendency for certain nodes to be allocated with particular roles is likely to evolve side-by-side with individual strategies, being another self-organized property of the system, like fairness and wealth distributions. Other sources of heterogeneity known to influence the propensity to be fair, such as cultural [40] and socioeconomic [41] settings or individuals’ engagement in institutions [42], may further influence how roles and power dependencies [43] are assigned. Moreover, the fact that network-based role assignment elicits fairness in rather complicated scenarios—as multiplayer bargaining games—suggests that such an approach could also be used within the broader context of active interventions aiming at fostering fairness in hybrid populations comprising humans and machines [18, 4446]. In this context, it would be relevant to assess—both experimentally and through numerical simulations—the impact on human decision-making of having virtual regulators dynamically deciding the role to adopt by their group peers, depending on their position in the interaction structure.

Finally, we note that, while here we consider static networks, it is likely that dynamic networks [4750] can offer extra means for degree and roles to become correlated over time. For example, if fair Proposers (or lenient Responders) attract a higher number of neighbors, their degree will increase as a by-product of their role and strategy, which may imply that effective values of may emerge from the coevolution of strategies and social ties.

Despite these open questions, our present work already suggests that carefully selecting the role of each agent within a group—depending on their social position and without limiting their available options—can offer a long-term social benefit, both in terms of the overall levels of fairness, wealth inequality, and global wealth of a population comprised of self-regarding agents.

Data Availability

The methods to produce the data that support the findings of this study are included within the article.

Disclosure

A preliminary version of this work was presented in Abstracts from the Proc. of the 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021).

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was partially supported by FCT-Portugal (UIDB/50021/2020, PTDC/MAT-APL/6804/2020, and PTDC/CCI-INF/7366/2020). F. P. S. acknowledges support from the James S. McDonnell Foundation Postdoctoral Fellowship Award.