idw – Informationsdienst Wissenschaft

Nachrichten, Termine, Experten

Grafik: idw-Logo
Grafik: idw-Logo

idw - Informationsdienst
Wissenschaft

Science Video Project
idw-Abo

idw-News App:

AppStore

Google Play Store



Instance:
Share on: 
10/27/2022 09:29

Why do we learn to reward cooperation?

Michael Hesse Presse- und Öffentlichkeitsarbeit
Max-Planck-Institut für Evolutionsbiologie

    Researchers at the Max Planck Institute in Plön show that reputation plays a key role in determining which rewarding policies people adopt. Using game theory, they explain why individuals learn to use rewards to specifically promote good behaviour.

    Often, we use positive incentives like rewards to promote cooperative behaviour. But why do we predominantly reward cooperation? Why is defection rarely rewarded? Or more generally, why do we bother to engage in any form of rewarding in the first place? Theoretical work done by researchers Saptarshi Pal and Dr. Christian Hilbe at the Max Planck Research Group ‘Dynamics of Social Behaviour’ suggests that reputation effects can explain why individuals learn to reward socially.

    With tools from evolutionary game theory, the researchers construct a model where individuals in a population (the players) can adopt different strategies of cooperation and rewarding over time. In this model, the players’ reputation is a key element. The players know, with a degree of certainty (characterized by the information transmissibility of the population), how their interaction partners are going to react to their behaviour (that is, which behaviours they deem worthy of rewards). If the information transmissibility is sufficiently high, players learn to reward cooperation. In contrast, without sufficient information about peers, players refrain from using rewards. The researchers show that these effects of reputation also play out in a similar way when individuals interact in groups with more than two individuals.

    Antisocial rewarding

    In addition to highlighting the role of reputation in catalyzing cooperation and social rewarding, the scientists identify a couple of scenarios where antisocial rewarding may evolve. Antisocial rewarding either requires populations to be assorted or rewards to be mutually beneficial for both the recipient and the provider of the reward. “These conditions under which people may learn to reward defection are however a bit restrictive since they additionally require information to be scarce” adds Saptarshi Pal.

    The results from this study suggest that rewards are only effective in promoting cooperation when they can sway individuals to act opportunistically. These opportunistic players only cooperate when they anticipate a reward for their cooperation. A higher information transmissibility increases both, the incentive to reward others for cooperating, and the incentive to cooperate in the first place. Overall, the model suggests that when people reward cooperation in an environment where information transmissibility is high, they ultimately benefit themselves. This interpretation takes the altruism out of social rewarding - people may not use rewards to enhance others’ welfare, but to help themselves.


    Contact for scientific information:

    Christian Hilbe
    Max Planck Research Group Dynamics of Social Behavior,
    Max Planck Institute for Evolutionary Biology, 24306, Plön, Germany


    Original publication:

    https://www.nature.com/articles/s41467-022-33551-y


    Images

    Results from evolutionary simulations display the co-evolution of cooperation and social rewarding in a population. At low information transmissibility, most population members learn not to reward others.
    Results from evolutionary simulations display the co-evolution of cooperation and social rewarding i ...


    Criteria of this press release:
    Journalists
    Biology
    transregional, national
    Research results
    English


     

    Help

    Search / advanced search of the idw archives
    Combination of search terms

    You can combine search terms with and, or and/or not, e.g. Philo not logy.

    Brackets

    You can use brackets to separate combinations from each other, e.g. (Philo not logy) or (Psycho and logy).

    Phrases

    Coherent groups of words will be located as complete phrases if you put them into quotation marks, e.g. “Federal Republic of Germany”.

    Selection criteria

    You can also use the advanced search without entering search terms. It will then follow the criteria you have selected (e.g. country or subject area).

    If you have not selected any criteria in a given category, the entire category will be searched (e.g. all subject areas or all countries).