Order manipulation attacks such as frontrunning and sandwiching have become an increasing concern in blockchain applications. Enforcing strict transaction ordering in a block is one of the many proposals for reducing MEV in blockchains. The paper Breaking the Chains of Rationality: Understanding the Limitations to and Obtaining Order Policy Enforcement explores the impossibility conditions for Order Policy Enforcement under the assumption of rational actors. The paper also proposes AnimaguSwap, a DEX that uses rationally binding transactions to enforce the stakers and flippers to have conflicting interests by constructing a game with Pareto efficient outcomes in an inefficient Nash equilibrium. It further goes on to formally prove how this would not work with binding side contracts for stakers(exactly why governments enforce anti-collusion laws). This article dives deeper into the flipper game, how it is constructed, and why I believe this is not a strong enough rational binding for the flipper to not collude. This article uses a game theoretical approach and analyzes how a rational flipper would behave under different strategies employed by a set of colluding stakers.
The Flipper Game involves a set of stakers, with one staker elected as the flipper in each round. For simplicity, let us assume that this leader is elected in a round-robin fashion(can be stake-weighted random as well) and all the stakers have equal weights. Every epoch has rounds such that each staker has been elected as the flipper at least once.
Let us understand the terminologies involved in the mechanism before we go ahead:
Let us discuss this game under the lens of an AMM-based DEX that leverages the game to prevent MEV extraction. We know sandwiching transactions is a common strategy to extract MEV and won't delve into it. The paper linked in the introduction explains what sandwiching is in detail and can be used for reference. The game proceeds as follows:
The utility for the flipper and the stakers individually is illustrated below:
(, ) | Attack | Defer |
---|---|---|
Collude | (k, k) | (0, 0) |
Trick | (z, -z) | (0, 0) |
Because is a shared utility from MEV, is almost always holds good. This shows why the flipper has the incentive to switch from colluding to tricking the rest of the stakers.
In the previous section, we modeled the game such that we consider the game to be played by a staker only when it is elected as a flipper. The reality is, MEV is extracted in every block and not just when is the flipper. Now let us try modeling the game with utilities associated with a time frame of one epoch:
(, ) | Attack | Defer |
---|---|---|
Collude | (, ) | (0, 0) |
Trick | (, ) | (0, 0) |
Every epoch has rounds and can choose to be part of the colluding network or not. It is fair to assume that if is not part of the colluding network the MEV proceeds from other rounds will not be shared with .
When we model the game for every epoch it enforces a stronger condition for to not collude with the rational stakers.
It was quite straightforward to move from a single-block game to a single-epoch game. Things get trickier when we model the game to exist over multiple epochs. Let us analyze the multi-epoch game when the stakers employ different strategies.
The stakers following this strategy will permanently defect following any instance of the flipper tricking the stakers.
In such a setting, the utility when decides to trick is as follows:
The utilities for the stakers and flipper over epochs is illustrated below:
(, ) | Attack | Defer |
---|---|---|
Collude | (, ) | (0, 0) |
Trick | (, ) | (0, 0) |
For large enough we know .
When employs a Grim Trigger strategy it is evident that it is in the interest of to collude with .
The stakers following this strategy will start with cooperating with and then mirroring the previous action of .
In such a setting, the utility when decides to trick is as follows:
receives a reward of in every alternate epoch.
(, ) | Attack | Defer |
---|---|---|
Collude | (, ) | (0, 0) |
Trick | (, ) | (0, 0) |
When the stakers employ a Tit-for-Tat strategy it enforces a stronger condition for to not collude with the rational stakers.
can employ any other strategy like Reputation-based Strategies, Minimax Strategy, Maximin Strategy etc., and a multi-epoch game can be modeled to understand the utility functions for each actor in the system.
Rational stakers will also try maximising their gains and the Grim Trigger strategy ensures maximal gains for the colluding stakers which enforces a stable Nash equilibrium with colluding and attacking.
We can now try to model a multi-epoch game such that engages in a Sybil attack, that is, every epoch removes stake and restakes as a different entity.
This is very similar to the One game every epoch
model with an added cost of using something like tornado cash to anonymously transfer staked assets to a different address.
Assuming bears an additional cost for doing this, the game is illustrated below:
(, ) | Attack | Defer |
---|---|---|
Collude | (, ) | (0, 0) |
Trick | (, ) | (0, 0) |
This enforces a stronger condition when compared to the One game every epoch
model enforcing for to not collude with the rational stakers.
The main takeaways are:
Modeling games with gaining when incurs losses does not guarantee a stable Nash equilibrium of not colluding. A game where is directly incentivized by the user will enforce a stable Nash equilibrium for not colluding. However, this can only be achieved when the user is ready to pay the flipper the full value of the MEV.
This article focuses on a single flipper playing against colluding actors . In a subsquent article I intend formally prove how this can be extended to a multiple non-colluding flipper scenario(if this holds good :P).