Introduction Model Results Discussion

Observations on Cooperation Yuval Heller (Bar Ilan) and Erik Mohlin (Lund)

Erice 2017

Heller & Mohlin

Observations on Cooperation

1 / 22

Introduction Model Results Discussion

Motivating Example Alice interacts with a remote trader, Bob. Both agents have opportunities to shirk/cheat. Alice obtains anecdotal evidence about Bob’s actions in a couple of past interactions Alice considers this information when deciding how to act. Alice is unlikely to interact with Bob again. Future partners may ask Bob about Alice’s behavior. Research question Can cooperation be sustained in such environments? Heller & Mohlin

Observations on Cooperation

2 / 22

Introduction Model Results Discussion

Underlying Game: The Prisoner’s Dilemma (PD)

c c

1

d

1+g

d 1

−l

−l

0

g>0 - gain of a greedy player.

1+g 0

l>0 - loss if the partner defects. g
Heller & Mohlin

Observations on Cooperation

3 / 22

Introduction Model Results Discussion

Brief Summary of Results 1

Novel behavior supports stable cooperation. (uniqueness in the restricted set of stationary strategies).

2

Stable cooperation requires observation of 2+ of interactions

3

Observation of partner’s past actions: g > l: Only defection is stable. g < l: Cooperation is stable (and robust to any noise). l +1

4

Observation of action profiles: Cooperation is stable iff g <

5

Optimal feedback: Observing partner’s actions against cooperation. Heller & Mohlin

Observations on Cooperation

2

.

4 / 22

Introduction Model Results Discussion

Observation Structure and Environment Strategies and Steady States Solution Concept

Observation Structure and Environment Basic stationary model (focus of the presentation): Each player privately observes a sample of k actions played by his partner (against other opponents). Agents are restricted to stationary strategies. IID sampling from the partner’s stationary behavior. Set of possible signals - m ∈ {0, 1, 2, ..., k} (interpreted as the number of observed defections).

Alternative model: unrestricted set of strategies, observing the last k actions. All the results hold except uniqueness. Heller & Mohlin

Observations on Cooperation

5 / 22

Stationary Strategies Definition (Strategy - s : {0, ..., k} → ∆ ({c, d})) Mapping assigning a mixed action for each possible observation. Interpretation: The agent’s behavior conditional on the observed signal.

Strategy distribution – Distribution σ over the set of strategies (with a finite support). Interpretation: Heterogeneous population. Example of a Strategy Distribution

supp (σ ) = {su , s1 , s2 } su ≡ 50%

s1 (m) =

   c   d

σ (su ) = ε, σ (s1 ) = m=0 s2 (m) = m≥1

1−ε 6 ,

σ (s2 ) =

   c

m≤1

  d

m≥2

5·(1−ε ) 6

Consistent Signal Profile Definition (Signal profile - θ : supp (σ ) −→ ∆ (M)) θ (s) is interpreted as the distribution of signals observed by agents who are matched with a partner who plays strategy s.

Definition (Consistent signal profile ) Signal profile θ and strategy distribution σ jointly induce a behavior profile: a distribution of actions for each strategy. The behavior profile induce a signal profile of observed actions.

Consistency: The induced signal profile is θ .

A strategy distribution may admit multiple signal profiles. Ignored in this presentation.

Introduction Model Results Discussion

Observation Structure and Environment Strategies and Steady States Solution Concept

Commitment Strategies (“Crazy” Agents)

We refine our solution concept by requiring robustness to the presence of few “crazy” agents (`a la Kreps et al., 1982). Definition (Distribution of Commitments – (Sc , λ )) Sc is a finite set of commitment strategies, and λ ∈ ∆ (Sc ) is a distribution over these strategies. We assume that at least one of the commitment strategies is totally mixed.

Heller & Mohlin

Observations on Cooperation

8 / 22

Introduction Model Results Discussion

Observation Structure and Environment Strategies and Steady States Solution Concept

Nash in Perturbed Environment Definition (Perturbed Environment ) A fraction ε of committed agents play a strategy according to λ ∈ ∆ (SC ). Definition (Nash equilibrium in a perturbed environment.) π (σ ∗ ) ≥ πs (σ ∗ ) for every strategy s, where π (σ ∗ ) denote the mean payoff of the 1 − ε “normal” agents, and πs (σ ∗ ) denotes the payoff to strategy s.

Heller & Mohlin

Observations on Cooperation

9 / 22

Introduction Model Results Discussion

Observation Structure and Environment Strategies and Steady States Solution Concept

Perfect Equilibrium

Definition (Perfect equilibrium σ ∗ ) The limit of Nash equilibria in some converging sequence of perturbed environments.

Heller & Mohlin

Observations on Cooperation

10 / 22

Introduction Model Results Discussion

Observation Structure and Environment Strategies and Steady States Solution Concept

Perfect Equilibrium

Definition (Perfect equilibrium σ ∗ ) The limit of Nash equilibria in some converging sequence of perturbed environments. Definition (Strictly perfect action a∗ ) The limit behavior of Nash equilibria in any converging sequence of perturbed environments.

Heller & Mohlin

Observations on Cooperation

10 / 22

Introduction Model Results Discussion

Taxonomy of PDs Observation of Actions Other Observation Structures

Heller & Mohlin

Observations on Cooperation

Results

11 / 22

Prisoner’s Dilemma - Taxonomy Offensive (submodular, Takahashi, 2010) PD l < g : stronger incentive to defect against cooperative partner than defective partner. Defensive (supermodular) PD l > g . Acute PD g >

l+1 2 :

defection against cooperator gives more than half of

what opponent looses. Mild (mildly tempting) PD g <

l+1 2 .

Introduction Model Results Discussion

Taxonomy of PDs Observation of Actions Other Observation Structures

Stable Defection in any PD

Claim Defection is strictly perfect equilibrium action in any PD.

Heller & Mohlin

Observations on Cooperation

13 / 22

Defection is the Unique Outcome in Offensive PDs Proposition Assume an offensive PD (l < g ) with observation of any number of actions. If σ ∗ is a perfect equilibrium then everyone defects.

Defection is the Unique Outcome in Offensive PDs Proposition Assume an offensive PD (l < g ) with observation of any number of actions. If σ ∗ is a perfect equilibrium then everyone defects.

Intuition: Assume to the contrary that σ ∗ 6≡ d Direct gain from defecting decreases in the partner’s prob. of defection. The indirect loss is independent of the current partner’s behavior. ⇒ Incumbents are less likely to defect when observing more defections. ⇒ If Alice always defects, she outperforms the incumbents.

Defection is the Unique Outcome in Offensive PDs Proposition Assume an offensive PD (l < g ) with observation of any number of actions. If σ ∗ is a perfect equilibrium then everyone defects.

Intuition: Assume to the contrary that σ ∗ 6≡ d Direct gain from defecting decreases in the partner’s prob. of defection. The indirect loss is independent of the current partner’s behavior. ⇒ Incumbents are less likely to defect when observing more defections. ⇒ If Alice always defects, she outperforms the incumbents.

Remark (Alternative model with unrestricted set of strategies) A weaker result: Full cooperation isn’t a perfect equilibrium.

Stable Cooperation in Defensive PD Proposition Assume g ≤ l and observing k ≥ 2 actions. Cooperation is strictly perfect. Moreover there is essentially a unique strategy distribution that supports cooperation (uniqueness relies on the restriction to stationary strategies).

Essentially Unique Stable State Everyone cooperates when observing no defections. Everyone defects when observing ≥ 2 defections. 0
1 k

of the incumbents defect when observing 1 defection

(i.e., q of the agents follow s 1 , and the remaining 1−q follow s 2 ). The value of q depends on the commitment strategies.

Introduction Model Results Discussion

Taxonomy of PDs Observation of Actions Other Observation Structures

Other Observation Structures

What happens if the signal about the partner depends also on the behavior of other opponents against her? We study three observation structures: 1

The entire action profile.

2

Mutual cooperation or not (=conflict). Signals:{CC , not − CC }.

3

Observing actions against cooperation.

Heller & Mohlin

Signals:{CC , DC , CD, DD}.

Signals:{CC , DC , ?D}.

Observations on Cooperation

16 / 22

Stable Cooperation when Observing Mutual Cooperation Proposition If players observe conflicts (i.e., CC or not) in at least two interactions, then cooperation is a perfect equilibrium iff the PD is mild (g <

l +1 2

).

Intuition Mild PDs: The perfect state is similar to the previous results. Players condition their play on the number of observed conflicts. Acute PDs: involvement in a conflict has to be punished with probability of at least 1/2. Because both players have to be punished, each conflict induces at least one additional conflict ⇒ Conflicts are “contagious”.

Stable Cooperation when Observing Action Profiles Proposition If players observe action profiles in at least two interactions, then cooperation is a perfect equilibrium iff the PD is mild (g <

l +1 2

).

Intuition The high frequency of punishments required in acute PDs implies that the partner is more likely to defect when observing mutual defection (relative to observing the partner to be the sole defector). ⇒ Agents “punish” partners who defect against another defector ⇒ destabilizes cooperation. In mild PDs, cooperation is stable (with essentially the same unique supporting behavior).

Introduction Model Results Discussion

Taxonomy of PDs Observation of Actions Other Observation Structures

Observing Actions Against Cooperation Proposition If players observe actions against cooperation (i.e., {CC , DC , ?D}) in at least two interactions, then cooperation is perfect in any Prisoner’s Dilemma.

Intuition No indirect loss of defecting against a defector, since it is not observed. Makes it easier to incentivize agents to deter defection. Providing more information to agents may harm cooperation. Heller & Mohlin

Observations on Cooperation

19 / 22

Introduction Model Results Discussion

Related Literature and Contribution Conclusion

Related literature (Partial List): Community Enforcement 1

Contagious equilibria (e.g., Kandori 1992; Ellison, 1994).

2

Applications of belief-free equilibria (Takahashi, 10; Deb, 12).

3

Image scoring (e.g., Nowak & Sigmund, 98).

4

Exogenous reputation mechanisms (e.g., Sugden, 86; Kandori, 92).

5

Structured populations (Cooper & Wallace, 04; Alger & Weibull, 13).

6

Observation of preferences (e.g., Dekel et al., 07; Herold, 12).

Our Main Methodological Contributions Robustness to few crazy agents. Heller & Mohlin

Observations on Cooperation

20 / 22

Companion Projects and Directions for Future Research Companion working papers: 1

“When is Social Learning Path-Dependent?”: When does a distribution of stationary strategies uniquely determines the consistent behavior?

2

“Coevolution of deception and preferences”: Players can deceive others about their preferences and intentions.

Directions for future research: Experiment to test the theoretical predictions. Realistic, yet tractable, model of online feedback. Studying non-negligible noise levels.

Introduction Model Results Discussion

Related Literature and Contribution Conclusion

Conclusion Introducing robustness against few crazy agents into the setup of community enforcement (Prisoner’s Dilemma with random matching). 1

Unique novel behavior supports stable cooperation.

2

Stable cooperation requires observation of 2+ interactions

3

Observation of partner’s past actions: g > l: Only defection is stable. g < l: Cooperation is stable.

4

Observation of action profiles: Cooperation is stable iff g <

5

Optimal feedback: Observing actions against cooperation. Heller & Mohlin

Observations on Cooperation

l+1 2 .

22 / 22

Summary of Results - When is Cooperation Stable?

Category of PD

Mild (g <

l+1 2 )

Acute (g >

l+1 2 )

Actions

Defen.

Y

Offen.

N

Defen.

Y

Offen.

N

Conflicts

Action profiles

Y

Y

N

N

Action against Coop.

Y

Stable cooperation requires observation of 2+ interactions. Observing a single interaction: Cooperation is not stable if g > 1.

Introduction Model Results Discussion

Related Literature and Contribution Conclusion

Backup Slides

Heller & Mohlin

Observations on Cooperation

24 / 22

Influence of Cheap Talk Introducing cheap-talk with unrestricted language destabilize the perfect equilibrium in which everyone defects. Experimenting agents use a secret handshake to cooperate among themselves (Robson, 1990). Implications (observation of actions + cheap-talk): Defensive PD - Only the cooperative equilibrium is stable. Offensive PD - No stable equilibrium. The population state cycles between the defective and the cooperative equilibrium (as in the one-shot PD, see Wiseman & Yilankaya, 2001).

Introduction Model Results Discussion

Related Literature and Contribution Conclusion

Steady States and Payoffs - Details

Back

Fact (standard fixed point argument) Each strategy distribution admits a consistent behavior (not necessarily unique).

Example (k = 3; Each agent plays the mode (frequently observed action).) 3 consistent behaviors: full cooperation, no cooperation, uniform mixing.

The Payoff of each incumbent strategy (s ∈ supp (σ )) and the average payoff in the population are defined in a standard way. πs (σ , η) = ∑s 0 σ (s 0 ) · π (ηs (s 0 ) , ηs 0 (s)), π (σ , η) = ∑s∈supp(σ ) σ (s) · πs (σ , η). Heller & Mohlin

Observations on Cooperation

26 / 22

Introduction Model Results Discussion

Related Literature and Contribution Conclusion

Illustration of Stable Cooperation in Defensive PD

Heller & Mohlin

Observations on Cooperation

27 / 22

Introduction Model Results Discussion

Related Literature and Contribution Conclusion

Illustration of Unstable Cooperation in Offensive PD

Heller & Mohlin

Observations on Cooperation

28 / 22

Observations on Cooperation

Introduction. Model. Results. Discussion. Observations on Cooperation. Yuval Heller (Bar Ilan) and Erik Mohlin (Lund). Erice 2017. Heller & Mohlin. Observations on Cooperation. 1 / 22 ... Consistency: The induced signal profile is θ. A strategy distribution may admit multiple signal profiles. Ignored in this presentation.

3MB Sizes 0 Downloads 241 Views

Recommend Documents

Observations on Cooperation
Model. Results. Discussion. Observations on Cooperation. Yuval Heller (Bar Ilan) and Erik Mohlin (Lund). PhD Workshop, BIU, January, 2018. Heller & Mohlin. Observations .... Consistency: The induced signal profile is θ. Definition (Steady state (σ,

Observations on Cooperation
Jun 26, 2017 - case Bob acts opportunistically, is restricted. The effectiveness of .... Summary of Results We start with a simple result (Prop. 1) that shows that ...

Supplementary Material for ``Observations on ...
Nov 19, 2017 - C.2 Steady State in a Perturbed Environment. In this subsection we formally adapt the definitions of a consistent signal profile and of a steady state to perturbed environments. Let f((1−ϵ)·σ+ϵ·λ) : OS → OS be the mapping bet

Process Observations on Neuroscience & Mindfulness
These subjective, experiencing events combine to create our. Copyright ..... By way of concluding, I want to share a comment about my own work as a pastoral ...

pdf-389\a-different-perspective-an-entrepreneurs-observations-on ...
Page 1 of 10. A DIFFERENT PERSPECTIVE: AN. ENTREPRENEUR'S OBSERVATIONS ON. OPTOMETRY, BUSINESS, AND LIFE BY. ALAN CLEINMAN. DOWNLOAD EBOOK : A DIFFERENT PERSPECTIVE: AN ENTREPRENEUR'S. OBSERVATIONS ON OPTOMETRY, BUSINESS, AND LIFE BY ALAN. CLEINMAN P

pdf-1443\geological-observations-on-volcanic-islands-illustrated ...
... apps below to open or edit this item. pdf-1443\geological-observations-on-volcanic-islands-i ... -autobiography-of-charles-darwin-by-charles-darwin.pdf.

Some critical observations on the science of complexity
master tool with which to seize the truth of the world. More than one century ..... possesses the high degree of complexity involved in deterministic chaos. It would be an .... possibility if gives of accounting for “emergent properties”. However

Observations on the histology of carcinomata and the ...
3 This tuniour was fixed in sublimate and hardened in alcohol ; and I quite agree with. Ruffer that this method is iiot always the best for displaying the parasites.

Sculpture-Some-Observations-On-Shape-And-Form-From ...
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item.

Observations on Social Audit in Koraput -
format. • Particular observations may vary in degrees from location to location, but in the context of social audit each small ... meetings prior to SA for awareness.

[PDF Download] The Merely Personal: Observations on ...
workings of game theory and chess machines, and raise big questions: If German scientists had succeeded in ... God in fact play dice with the universe?

Some Observations on the Early History of Equilibrium ...
ing contributions of Michael Fisher to statistical mechanics and critical phe- nomena. ... tributed an account(1) of Michael's activities at King's College London,.

Observations on incidental catch of cetaceans in three landing centres ...
reported in India is vast (Lal Mohan, 1985; Mahadevan et al., 1990; Kasim et al., 1993; Satya Rao &. Chandrasekar, 1994; Thiagarajan et al., 2000). According to Lal Mohan (1994), the annual cetacean mortality caused by the Indian gill net fishery is