Role of delay-based reward in the spatial cooperation

https://doi.org/10.1016/j.physa.2016.08.014Get rights and content

Highlights

  • We introduce delay-based reward into spatial cooperation, in which individuals can select various reward amounts on the basis of delay.

  • We find that the intermediate differences of reward among kinds of delays are most important contributions to cooperation.

  • We show that the individuals who select the fastest and smallest reward in evolving have strongest competition.

Abstract

Strategy selection in games, a typical decision making, usually brings noticeable reward for players which have discounted value if the delay appears. The discounted value is measure: earning sooner with a small reward or later with a delayed larger reward. Here, we investigate effects of delayed rewards on the cooperation in structured population. It is found that delayed reward supports the spreading of cooperation in square lattice, small-world and random networks. In particular, intermediate reward differences between delays impel the highest cooperation level. Interestingly, cooperative individuals with the same delay time steps form clusters to resist the invasion of defects, and cooperative individuals with lowest delay reward survive because they form the largest clusters in the lattice.

Introduction

Game theory that seeks to understand the strategy selection of individuals to maximize the payoffs in specific condition, has attracted more ​attention  [1], [2]. Various researches about evolutionary games have focused on strategy selection mechanisms in repeated games  [3], [4]. Individual within evolutionary games framework, selects the strategy of the next round by referring to the past payoff and opponent’s strategy, such as cooperation or defection. For instance, tit-for-tat [5], [6], [7], [8], [9], win–stay and lose–shift  [10], [11] have been explored. Games in structured population, it is found that spatial topology favors cooperation, where individual interacts with its direct neighbors and gains a payoff  [12], [13], [14], [15], [16]. By means of strategy adoption rules, such as deterministic  [17], [18] and pairwise-comparison  [19], [20], [21], [22], [23], individual adjusts its strategy by comparing the performance of neighbors’ and itself. Some features, including memory  [24], reputation  [25], [26], diversity  [27], [28], [29], [30] have been incorporated into spatial games. In addition, punishment and reward  [31], [32], including antisocial pool reward  [33], rewarding with links  [34] and adaptive rewarding  [35] have also been examined from both experimental and theoretical aspects  [36], [37]. Previous results also show that incorporating of punishment and reward groups into the original game model can produce interesting phase diagram  [38].

Because individuals are driven by payoffs during the decision making process, the emerging of delayed and uncertain rewards may affect their strategies selection  [39], [40], [41], [42]. For example, when an individual is required to choose between a smaller reward available sooner and a larger reward available later, where the decision usually involves reward amount which is so called temporal discounting  [43]. That is, individual discounts the value of reward based on the delay. The general formula of discounting in neurobiological and economic science is exponential, while the alternative form in psychology experiments is a hyperbolic V=1/(1+Kd), where V is the subject value for delay d, and K marks the discounting factor  [44].

Players participating in evolutionary games, are also required to make decisions in each round, such as cooperation or defection. Especially, when cooperative players are rewarded on the basis of cooperative rounds. Then, from the perspective of temporal discounting in neurobiological and economic science, incorporating of delay will lead to varied strategy selections  [45], [46], such as cooperating once to obtain the soonest but smallest reward, or largest but delayed reward by cooperating with several rounds. And how this decision making affects the spatial cooperation, especially when individuals can adopt the strategy and reward selection of neighbors simultaneously. To address this problem, we explore the role of delay-based reward for decision making in spatial cooperation. Here, individuals can select either the larger but available later or smaller and sooner reward based on cooperative rounds. We assume individuals select the reward choices and strategies of neighbors at the same time. We show that, delay-based reward spreads of cooperation on lattice, random and small-world networks. The results prove that, temporal discounting impels the intermediate reward variation for different cooperative rounds to be most important contribution to the cooperation. We find that, cooperators who select the fastest and lowest reward have the strongest competition in the system, rather than who choose delayed and larger reward. This paper is organized as follows: Section  2 presents the model, Section  3 demonstrates numerical results, and the discussion is presented in Section  4.

Section snippets

Prisoner’s dilemma game with delay-based reward

In the spatial prisoner’s dilemma game (PDG), individuals locate on the sites of networks (Newman–Watts small-world networks and square lattice networks with periodic boundary conditions). Initially, each individual is designed either as a cooperator (sx=C) or a defector (sx=D) with equal probability, and it plays PDG with its nearest neighbors. For each individual, its payoff is composed of the total payoffs acquired from the neighbors in games, and cooperators can obtain an extra reward

Results

Fig. 1 shows cooperation level of PDG accompanied with delay-based reward in decision making, for square lattice networks. We see that, cooperation level with delay-based reward is greatly improved, compared with the original PDG. Fig. 1 depicts that, even for large enough b, i.e. b=2, cooperators can still resist the invasion of defectors. It can be noted that cooperation level fC extremely relies on the discounting factor K for small b. However, for b>1.5,fC is almost independent of K,

Summary

Evolutionary games on spatial networks have attracted increasing attention for past years  [53], [54]. Reward is a common mechanism used to improve the cooperation level. So far, the general method to explore the reward effect on cooperation is incorporating reward cooperator into original two species games. It assumes that reward cooperators are willing to extract some of their payoffs, and share these payoffs with cooperative neighbors. It has been proven incorporating reward cooperators can

References (54)

  • H. Gintis, Game Theory Evolving: A Problem-centered Introduction to Modeling Strategic Interaction, Princeton, NJ:...
  • J. Hofbauer et al.

    Evolutionary Games and Population Dynamics

    (1998)
  • J.M. Smith

    Evolution and the Theory of Games

    (1982)
  • M. Nowak et al.

    A strategy of win-stay, lose-shift that outperforms tit-for-tat in the prisoner’s dilemma game

    Nature

    (1993)
  • C. Wedekind et al.

    Human cooperation in the simultaneous and the alternating prisoner’s dilemma: Pavlov versus generous tit-for-tat

    Proc. Natl. Acad. Sci. USA

    (1996)
  • M.A. Nowak

    Five rules for the evolution of cooperation

    Science

    (2006)
  • Y. Liu et al.

    Win-stay-lose-learn promotes cooperation in the spatial prisoner’s dilemma game

    PLoS One

    (2012)
  • M.A. Nowak et al.

    Evolutionary games and spatial chaos

    Nature

    (1992)
  • Z. Wang et al.

    Interdependent network reciprocity in evolutionary games

    Sci. Rep.

    (2013)
  • M. Perc et al.

    Heterogeneous aspirations promote cooperation in the prisoner’s dilemma game

    PLoS One

    (2010)
  • C. Hauert et al.

    Volunteering as red queen mechanism for cooperation in public goods games

    Science

    (2002)
  • A. Szolnoki et al.

    Selection of noise level in strategy adoption for spatial social dilemmas

    Phys. Rev. E

    (2009)
  • M.A. Nowak et al.

    The spatial dilemmas of evolution

    Int. J. Bifurcation Chaos

    (1993)
  • C. Hauert et al.

    Spatial structure often inhibits the evolution of cooperation in the snowdrift game

    Nature

    (2004)
  • G. Szabó et al.

    Phase transitions and volunteering in spatial public goods games

    Phys. Rev. Lett.

    (2002)
  • J. Vukov et al.

    Evolutionary prisoner’s dilemma game on newman-watts networks

    Phys. Rev. E

    (2008)
  • L.-L. Jiang et al.

    Role of adaptive migration in promoting cooperation in spatial games

    Phys. Rev. E

    (2010)
  • Cited by (32)

    • The influence of aggressive behavior on cooperation evolution in social dilemma

      2023, Physica A: Statistical Mechanics and its Applications
    • Based on reputation consistent strategy times promotes cooperation in spatial prisoner's dilemma game

      2023, Applied Mathematics and Computation
      Citation Excerpt :

      Among them, network reciprocity is a very important evolutionary rule that has been widely studied by scholars. To facilitate evolution and research, the corresponding evolution carrier is mainly based on the spatial square network with periodic boundary conditions [11,14,16,20,21,25,29–31,33,35–43]. To solve dilemmas brought by game, related scholars have proposed many evolutionary game incentive mechanisms, such as exclusion [16–20], punishment [21–28], reward [29–31], tax [32], aspiration [33,34], memory effect [35–37], and reputation [38–43], etc.

    View all citing articles on Scopus
    View full text