Introduction Model Results Discussion

Observations on Cooperation

Yuval Heller (Bar Ilan) and Erik Mohlin (Lund)

PhD Workshop, BIU, January, 2018

Heller & Mohlin

Observations on Cooperation

1 / 20

Introduction Model Results Discussion

Motivating Example

Alice interacts with a remote trader, Bob. Both agents have opportunities to shirk/cheat. Alice obtains anecdotal evidence about Bob's past behavior. Alice considers this information when deciding how to act. Alice is unlikely to interact with Bob again. Future partners may ask Bob about Alice's behavior.

Heller & Mohlin

Observations on Cooperation

2 / 20

Introduction Model Results Discussion

Motivating Example

Alice interacts with a remote trader, Bob. Both agents have opportunities to shirk/cheat. Alice obtains anecdotal evidence about Bob's past behavior. Alice considers this information when deciding how to act. Alice is unlikely to interact with Bob again. Future partners may ask Bob about Alice's behavior.

Important economic interactions:

e.g, Greif, 94)), modern face-to-face interactions (e.g., Bernstein, 1992; Dixit 2003), the sharing economy & online trade (e.g., ebay, Uber, Airbnb). Hunter-gatherer societies, medieval trade (

Heller & Mohlin

Observations on Cooperation

2 / 20

Introduction Model Results Discussion

Underlying Game: The Prisoner's Dilemma (PD)

c c

1

d

1+g

d 1

−l

−l

Heller & Mohlin

0

g > 0 - gain of a greedy player.

1+g

l > 0 - loss if the partner defects.

0

Observations on Cooperation

3 / 20

Introduction Model Results Discussion

Brief Summary of Results

1

Novel simple behavior supports stable cooperation. (uniqueness in the restricted set of stationary strategies).

2

Stable cooperation requires observation of 2+ of interactions

3

Observation of partner's past actions:

g > l : Only defection is stable. g < l : Cooperation is stable (and robust to any noise).

Heller & Mohlin

Observations on Cooperation

4 / 20

Introduction Model Results Discussion

Brief Summary of Results

1

Novel simple behavior supports stable cooperation. (uniqueness in the restricted set of stationary strategies).

2

Stable cooperation requires observation of 2+ of interactions

3

Observation of partner's past actions:

g > l : Only defection is stable. g < l : Cooperation is stable (and robust to any noise). 4

Observation of action proles: Cooperation is stable i g < l +21 .

5

Optimal feedback: Observing partner's actions against cooperation. Heller & Mohlin

Observations on Cooperation

4 / 20

Introduction Model Results Discussion

Observation Structure and Environment Strategies and Steady States Solution Concept

Observation Structure

Basic stationary model (focus of the presentation):

Each player privately observes a sample of k actions played by his partner (against other opponents). Agents are restricted to stationary strategies. IID sampling from the partner's stationary behavior. Set of possible signals - m ∈ {0, 1, 2, ..., k} (interpreted as the number of observed defections).

Heller & Mohlin

Observations on Cooperation

5 / 20

Introduction Model Results Discussion

Observation Structure and Environment Strategies and Steady States Solution Concept

Observation Structure

Basic stationary model (focus of the presentation):

Each player privately observes a sample of k actions played by his partner (against other opponents). Agents are restricted to stationary strategies. IID sampling from the partner's stationary behavior. Set of possible signals - m ∈ {0, 1, 2, ..., k} (interpreted as the number of observed defections).

Alternative model (in the paper): observing the last

k

Unrestricted set of strategies,

actions. All the results hold except uniqueness.

Heller & Mohlin

Observations on Cooperation

5 / 20

Stationary Strategies Denition (Strategy -

s : {0, ..., k} → ∆ ({c, d}))

Mapping assigning a mixed action for each possible observation. Interpretation: The agent's behavior conditional on the observed signal.

Strategy distribution 

Distribution

σ

over the set of strategies (with

a nite support). Interpretation: Heterogeneous population.

Stationary Strategies Denition (Strategy -

s : {0, ..., k} → ∆ ({c, d}))

Mapping assigning a mixed action for each possible observation. Interpretation: The agent's behavior conditional on the observed signal.

Strategy distribution 

Distribution

σ

over the set of strategies (with

a nite support). Interpretation: Heterogeneous population.

Example of a Strategy Distribution

supp (σ ) = {su , s1 , s2 } σ (su ) = ε, su ≡ 50%,

σ (s1 ) = s1 (m) =

1−ε

,  6   c m = 0   d

m ≥ 1,

σ (s2 ) = s2 (m) =

5·(1−ε )

 6   c m ≤ 1   d

m≥2

Consistent Signal Prole

Denition (Signal

θ (s)

prole - θ : supp (σ ) −→ ∆ (M))

is interpreted as the distribution of signals observed by agents who are

matched with a partner who plays strategy

Denition (Consistent Signal prole

θ

signal prole

s.

)

and strategy distribution

σ

jointly induce a behavior

prole: a distribution of actions for each strategy. The behavior prole induce a signal prole of observed actions.

Consistency: The induced signal prole is Denition (Steady

θ.

state  (σ , θ ))

Pair consisting of a strategy distribution and a consistent signal prole.

Commitment Strategies (Crazy Agents)

As argued by Ellison (1994), the assumption that all agents are rational and, in equilibrium, best reply to what everyone else is doing is fairly implausible in large populations. We rene our solution concept by requiring robustness to the presence of a few crazy agents (` a la Kreps et al., 1982).

Commitment Strategies (Crazy Agents)

As argued by Ellison (1994), the assumption that all agents are rational and, in equilibrium, best reply to what everyone else is doing is fairly implausible in large populations. We rene our solution concept by requiring robustness to the presence of a few crazy agents (` a la Kreps et al., 1982).

Denition (Distribution of Commitments 

Sc

(Sc , λ ))

is a nite set of commitment strategies, and

λ ∈ ∆ (Sc )

is a

distribution over these strategies. We assume that at least one of the commitment strategies is totally mixed.

Introduction Model Results Discussion

Observation Structure and Environment Strategies and Steady States Solution Concept

Nash in Perturbed Environment

Denition (Perturbed Environment A fraction

εn

((SC , λ ) , εn ))

of committed agents plays a strategy according to

λ ∈ ∆ (SC ).

Heller & Mohlin

Observations on Cooperation

9 / 20

Introduction Model Results Discussion

Observation Structure and Environment Strategies and Steady States Solution Concept

Nash in Perturbed Environment

Denition (Perturbed Environment A fraction

εn

((SC , λ ) , εn ))

of committed agents plays a strategy according to

λ ∈ ∆ (SC ).

Nash equilibrium in a perturbed environment - (σn∗ , θn∗ )) agent cannot improve his payo by deviating. Formally:

Denition ( A normal

π (σn∗ , θn∗ ) ≥ πs (σn∗ , θn∗ )

for every strategy

payo of the 1 − εn  normal agents, and strategy

s,

s,

where

π (σn∗ , θn∗ )

πs (σn∗ , θn∗ )

denotes the payo to

against a population who plays according to

Heller & Mohlin

denotes the mean

(σn∗ , θn∗ ).

Observations on Cooperation

9 / 20

Introduction Model Results Discussion

Observation Structure and Environment Strategies and Steady States Solution Concept

Perfect Equilibrium

Denition (Perfect equilibrium

(σ ∗ , θ ∗ ))

The limit of Nash equilibria in a converging sequence of perturbed environments.

(i.e., ∃ (SC , λ , (εn )n ), s.t. ((σn∗ , θn∗ ) →εn →0 (σ ∗ , θ ∗ )))

Heller & Mohlin

Observations on Cooperation

10 / 20

Introduction Model Results Discussion

Taxonomy of PDs Observation of Actions Other Observation Structures

Heller & Mohlin

Observations on Cooperation

Results

11 / 20

Prisoner's Dilemma - Taxonomy

Oensive

(submodular, Takahashi, 2010) PD l < g :

stronger incentive to

defect against a cooperative partner than against a defective partner. Defensive (supermodular) PD

l > g.

c c

1

d

1+g

d 1

−l

−l 0

g > 0 - gain of a greedy player.

1+g 0

l > 0 - loss if the partner defects.

Prisoner's Dilemma - Taxonomy

Oensive

(submodular, Takahashi, 2010) PD l < g :

stronger incentive to

defect against a cooperative partner than against a defective partner. Defensive (supermodular) PD

l > g.

c c

1

d

1+g

d 1

−l

−l 0

g > 0 - gain of a greedy player.

1+g 0

l > 0 - loss if the partner defects.

Introduction Model Results Discussion

Taxonomy of PDs Observation of Actions Other Observation Structures

Stable Defection in any PD

Claim

Defection is perfect equilibrium outcome in any PD.

Heller & Mohlin

Observations on Cooperation

13 / 20

Defection is the Unique Outcome in Oensive PDs

Proposition

Assume an oensive PD (l < g ) with observation of any number of actions. If (σ ∗ , θ ∗ ) is a perfect equilibrium then everyone defects.

Defection is the Unique Outcome in Oensive PDs

Proposition

Assume an oensive PD (l < g ) with observation of any number of actions. If (σ ∗ , θ ∗ ) is a perfect equilibrium then everyone defects. Intuition: Assume to the contrary that normal agents sometimes cooperate. Direct gain from defecting decreases in the partner's prob. of defection. The indirect loss is independent of the current partner's behavior.



Incumbents are less likely to defect when observing more defections.



If Alice always defects, she outperforms the incumbents.

Stable Cooperation in Defensive PD Proposition

Assume g ≤ l and observing k ≥ 2 actions. Cooperation is strictly perfect. Moreover there is essentially a unique strategy distribution that supports cooperation.

Stable Cooperation in Defensive PD Proposition

Assume g ≤ l and observing k ≥ 2 actions. Cooperation is strictly perfect. Moreover there is essentially a unique strategy distribution that supports cooperation. Essentially Unique Stable State

Sketch of proof

Everyone cooperates when observing no defections. Everyone defects when observing

0

< q < k1

(i.e.,

q

≥2

defections.

of the incumbents defect when observing 1 defection

of the agents follow

s 1,

and the remaining 1−q follow

s1

s 2.

The value of

q

balances the payo of

The value of

q

depends on the commitment strategies.

and

s 2 ).

Introduction Model Results Discussion

Taxonomy of PDs Observation of Actions Other Observation Structures

Other Observation Structures

What happens if the signal about the partner depends also on the behavior of other opponents against her? We study three observation structures: action prole.

1

The entire

2

Mutual cooperation or not (=conict).

3

Observing

Signals:{CC , DC , CD, DD}.

actions against cooperation.

Signals:{CC ,

not − CC }.

Signals:{CC , DC , ?D}.

(1)+(2): Cooperation is a perfect equilibrium outcome i

g<

l +1 2 .

(3): Cooperation is a perfect equilibrium outcome for all

Heller & Mohlin

Observations on Cooperation

16 / 20

Introduction Model Results Discussion

Equilibrium Renements Related Literature and Contribution Conclusion

Equilibrium Renements

In the paper we show that all of our equilibria satisfy two additional requirement:

1

Strict perfection - the equilibrium holds regardless of how the crazy agents behave.

2

Evolutionary stability (a la Maynard-Smith & Price, 1973). Small group of agents who deviate together are outperformed.

3

Robustness to small perturbations of the signal prole. The population converges back to steady state from any nearby (inconsistent) state

Heller & Mohlin

(σ ∗ , θ ∗ ),

(σ ∗ , θ ).

Observations on Cooperation

17 / 20

Related literature (Partial List): Community Enforcement

1

Contagious equilibria (e.g., Kandori 1992; Ellison, 1994).

2

Applications of belief-free equilibria (Takahashi, 10; Deb, 12).

3

Image scoring (e.g., Nowak & Sigmund, 98).

4

Exogenous reputation mechanisms (e.g., Sugden, 86; Kandori, 92).

5

Structured populations (Cooper & Wallace, 04; Alger & Weibull, 13).

6

Observation of preferences (e.g., Dekel et al., 07; Herold, 12).

Related literature (Partial List): Community Enforcement

1

Contagious equilibria (e.g., Kandori 1992; Ellison, 1994).

2

Applications of belief-free equilibria (Takahashi, 10; Deb, 12).

3

Image scoring (e.g., Nowak & Sigmund, 98).

4

Exogenous reputation mechanisms (e.g., Sugden, 86; Kandori, 92).

5

Structured populations (Cooper & Wallace, 04; Alger & Weibull, 13).

6

Observation of preferences (e.g., Dekel et al., 07; Herold, 12).

We show that

a plausible requirement of robustness to a few crazy

agents qualitatively changes the analysis.

Introduction Model Results Discussion

Equilibrium Renements Related Literature and Contribution Conclusion

Directions for Future Research

Experiment to test the model's predictions. Realistic, yet tractable, model of online feedback. Applying the methodology to other underlying games.

Heller & Mohlin

Observations on Cooperation

19 / 20

Introduction Model Results Discussion

Equilibrium Renements Related Literature and Contribution Conclusion

Conclusion

Introducing robustness against few crazy agents into the setup of community enforcement (Prisoner's Dilemma with random matching).

1

Unique novel simple behavior supports stable cooperation.

2

Stable cooperation requires observation of 2+ interactions

3

Observation of partner's past actions:

g > l:

Only defection is stable.

g < l:

Cooperation is stable.

4

Observation of action proles: Cooperation is stable i

5

Providing more information may harm cooperation.

Heller & Mohlin

Observations on Cooperation

g < l+21 .

20 / 20

Summary of Results - When is Cooperation Stable?

Category of PD

Mild

Defen.

(g < l+21 )

Oen.

Acute

Defen.

(g > l+21 )

Oen.

Actions

Y N Y N

Conicts

Action proles

Y

Y

N

N

Stable cooperation requires observation of 2+ interactions. Observing a single interaction: Cooperation is not stable if g > 1.

Action against Coop.

Y

Introduction Model Results Discussion

Equilibrium Renements Related Literature and Contribution Conclusion

Backup Slides

Heller & Mohlin

Observations on Cooperation

22 / 20

Sketch of Proof.

η ∗ ≡ c ⇒ Everyone cooperates when observing no defections.

Sketch of Proof.

η ∗ ≡ c ⇒ Everyone cooperates when observing no defections.

0 < q < 1 of the incumbents cooperate when observing 1 defection: If everyone cooperates when with probability

δ << 1

m = 1:

If Alice deviates and defects

when observing

m = 0,

outperforms the incumbents (indirect loss is If everyone defects when

η∗ ≡ d

m = 1:

(each defection induces

then she

 O δ 2 ).

The unique consistent behavior is

k ≥1

additional defections).

⇒ Both actions must be best replies when m = 1.

Sketch of Proof.

η ∗ ≡ c ⇒ Everyone cooperates when observing no defections.

0 < q < 1 of the incumbents cooperate when observing 1 defection: If everyone cooperates when with probability

δ << 1

m = 1:

If Alice deviates and defects

when observing

m = 0,

outperforms the incumbents (indirect loss is If everyone defects when

η∗ ≡ d

m = 1:

(each defection induces

then she

 O δ 2 ).

The unique consistent behavior is

k ≥1

additional defections).

⇒ Both actions must be best replies when m = 1.

Everyone defects when observing m ≥ 2: The gain from defecting is increasing in the probability that the partner is going to defect. ⇒ Defection is the unique best reply when observing m ≥ 2.

Sketch of Proof (continued).

Back

Let

δ << 1

Let

Pr (d|m = 1)

be the average probability of observing

m = 1.

be the probability that the partner is going to

defect conditional on observing

m = 1.

The direct gain from defecting when observing

m=1

δ · ((l · Pr (d|m = 1)) + g · Pr (c|m = 1)) < δ · (l + 1) .

is:

Sketch of Proof (continued).

Back

Let

δ << 1

Let

Pr (d|m = 1)

be the average probability of observing

m = 1.

be the probability that the partner is going to

defect conditional on observing

m = 1.

The direct gain from defecting when observing

m=1

is:

δ · ((l · Pr (d|m = 1)) + g · Pr (c|m = 1)) < δ · (l + 1) . The indirect loss from defecting when

q · (k · δ ) · (l + 1) + O δ ⇒

Unique 0

< q < k1

2

m=1

is:

.

balances the gain and the indirect loss.

q is robust to perturbations (like the mixed eq. in Hawk-Dove).

Inuence of Cheap Talk

Introducing cheap-talk with unrestricted language destabilize the perfect equilibrium in which everyone defects. Experimenting agents use a secret handshake to cooperate among themselves (Robson, 1990). Implications (observation of actions + cheap-talk): Defensive PD - Only the cooperative equilibrium is stable. Oensive PD - No stable equilibrium. The population state cycles between the defective and the cooperative equilibrium .

(as in the one-shot PD, see Wiseman & Yilankaya, 2001)

Introduction Model Results Discussion

Equilibrium Renements Related Literature and Contribution Conclusion

Back

Steady States and Payos - Details

standard xed point argument)

Fact (

Each strategy distribution admits a consistent behavior (not necessarily unique). Example (k

= 3; Each agent plays the mode

.

(frequently observed action) )

3 consistent behaviors: full cooperation, no cooperation, uniform mixing.

The Payo of each incumbent strategy (s ∈ supp (σ )) and the average payo in the population are dened in a standard way. πs (σ , η) = ∑s 0 σ (s 0 ) · π (ηs (s 0 ) , ηs 0 (s)) π (σ , η) = ∑s∈supp(σ ) σ (s) · πs (σ , η) Heller & Mohlin

.

, Observations on Cooperation

26 / 20

Introduction Model Results Discussion

Equilibrium Renements Related Literature and Contribution Conclusion

Illustration of Stable Cooperation in Defensive PD

Heller & Mohlin

Observations on Cooperation

27 / 20

Introduction Model Results Discussion

Equilibrium Renements Related Literature and Contribution Conclusion

Illustration of Unstable Cooperation in Oensive PD

Heller & Mohlin

Observations on Cooperation

28 / 20

Observations on Cooperation

Model. Results. Discussion. Observations on Cooperation. Yuval Heller (Bar Ilan) and Erik Mohlin (Lund). PhD Workshop, BIU, January, 2018. Heller & Mohlin. Observations .... Consistency: The induced signal profile is θ. Definition (Steady state (σ,θ)). Pair consisting of a strategy distribution and a consistent signal profile.

3MB Sizes 0 Downloads 283 Views

Recommend Documents

Observations on Cooperation
Introduction. Model. Results. Discussion. Observations on Cooperation. Yuval Heller (Bar Ilan) and Erik Mohlin (Lund). Erice 2017. Heller & Mohlin. Observations on Cooperation. 1 / 22 ... Consistency: The induced signal profile is θ. A strategy dist

Observations on Cooperation
Jun 26, 2017 - case Bob acts opportunistically, is restricted. The effectiveness of .... Summary of Results We start with a simple result (Prop. 1) that shows that ...

Supplementary Material for ``Observations on ...
Nov 19, 2017 - C.2 Steady State in a Perturbed Environment. In this subsection we formally adapt the definitions of a consistent signal profile and of a steady state to perturbed environments. Let f((1−ϵ)·σ+ϵ·λ) : OS → OS be the mapping bet

Process Observations on Neuroscience & Mindfulness
These subjective, experiencing events combine to create our. Copyright ..... By way of concluding, I want to share a comment about my own work as a pastoral ...

pdf-389\a-different-perspective-an-entrepreneurs-observations-on ...
Page 1 of 10. A DIFFERENT PERSPECTIVE: AN. ENTREPRENEUR'S OBSERVATIONS ON. OPTOMETRY, BUSINESS, AND LIFE BY. ALAN CLEINMAN. DOWNLOAD EBOOK : A DIFFERENT PERSPECTIVE: AN ENTREPRENEUR'S. OBSERVATIONS ON OPTOMETRY, BUSINESS, AND LIFE BY ALAN. CLEINMAN P

pdf-1443\geological-observations-on-volcanic-islands-illustrated ...
... apps below to open or edit this item. pdf-1443\geological-observations-on-volcanic-islands-i ... -autobiography-of-charles-darwin-by-charles-darwin.pdf.

Some critical observations on the science of complexity
master tool with which to seize the truth of the world. More than one century ..... possesses the high degree of complexity involved in deterministic chaos. It would be an .... possibility if gives of accounting for “emergent properties”. However

Observations on the histology of carcinomata and the ...
3 This tuniour was fixed in sublimate and hardened in alcohol ; and I quite agree with. Ruffer that this method is iiot always the best for displaying the parasites.

Sculpture-Some-Observations-On-Shape-And-Form-From ...
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item.

Observations on Social Audit in Koraput -
format. • Particular observations may vary in degrees from location to location, but in the context of social audit each small ... meetings prior to SA for awareness.

[PDF Download] The Merely Personal: Observations on ...
workings of game theory and chess machines, and raise big questions: If German scientists had succeeded in ... God in fact play dice with the universe?

Some Observations on the Early History of Equilibrium ...
ing contributions of Michael Fisher to statistical mechanics and critical phe- nomena. ... tributed an account(1) of Michael's activities at King's College London,.

Observations on incidental catch of cetaceans in three landing centres ...
reported in India is vast (Lal Mohan, 1985; Mahadevan et al., 1990; Kasim et al., 1993; Satya Rao &. Chandrasekar, 1994; Thiagarajan et al., 2000). According to Lal Mohan (1994), the annual cetacean mortality caused by the Indian gill net fishery is