This paper is exploring the effect of possible implementation of EIP-7251: Increase the MAX_EFFECTIVE_BALANCE from the scope of how possible consolidation of Validators for up to 2048 ETH can change slashing risks.
This paper is based on excellent research provided in MaxEB Slashing risks and based on the same assumptions:
The most common cause of slashing due to operator error is running the same key in separate instances that could have different views of the chain. Specifically a slashing offence may occur when the
AttestationData
produced by each node diverges.
An attestation contains the following subjective data for an attestation at slot:
- LMD Ghost head vote = what's the head of the chain at slot
- FFG vote target = what's the checkpoint's of head chain at slot
- FFG vote source = what's the latest justified checkpoint
The head vote can diverge if at the 4 second mark each node has a different view of what's the head. In healthy network conditions that happens when blocks are produced late, and some node receives the block latter than others.
Therefore slashing doesn't happen for all slots with double attestation, but requires also winning a "lottery" of the slot being divergent, with observed frequency of such slots (divergent rate) ~1-4% (~once ever 20 minutes)
In case of misconfiguration (double attesting) Node Operator react for this only after first slashing happens
And share of validators slashed within that first slashing is a random variable, based on what part of all misconfigurated validators was assigned to
slot that "wins" the divergent chain views lottery, signing conflicting messages and getting slashed
Example: If Node operator is running 32 misconfigurated validators, first slot that would lead to slashings could contain from 1 to 32 of that validators, depending on how many of that 32 validators were assigned specific divergent slot
There is a response on slashing alert, mitigating possible slashings of misconfigurated validators that are still didn't win the divergent chain view lottery
In response to the alerts, some action is done to correct the operational error: such as a human operator stoping a docker container.
Based on those assumption this paper research how concentration (number of validators for the same amount of staked ETH) impacts expected losses on double attestation slashing incidents within different network parameters (divergent rate) and reaction time for incident
Initial loss - loss associated with validators slashed in initial observed slot (first divergent slot with double attestations)
Valuation of expected loss and it's variation is based on calculation properties of random variable: Share of validators in the first divergent slot - which has binomial distribution with condition that variable is greater than zero (representing the idea that slashing should actually be reported)
Expected value | Variation |
---|---|
Image Not Showing
Possible Reasons
|
Image Not Showing
Possible Reasons
|
We can observe through the model that after ~100 indexes decrease in Expected loss is almost insignificant, which is backed up by the properties of initial random variable (X - number of validators slashed, M - total number of validators on the cluster)
With E(X) = 1/32 * M
And P(X=0) = (31/32)^M - responsible for the form of the function with increasing consolidation (and therefore reducing M).
As observed from the function form it starts from 1 (100% of validators slashed with M=1) and tends asymptotically to 1/32 in terms of validators share
Decrease in Variance still persists with increasing number of validators (indexes) with a maximum value at 14 validators with around 2% of indexes as standard deviation.
Observing cumulative distribution functions for different consolidation parameters also illustrates that with a major shift in share of ETH slashed for (2-128 indexes interval) and reduction in variance (less "stairy" form with increased number of indexes)
Valuating total slashing losses requires estimation of share of ETH slashed after initial slashing happens during reaction time, therefore:
Additional assumption:
7. Reaction time on incident: 10 epochs (1 hour)
An estimation of possible losses is based on 1000 simulations for different number of indexes and divergent rates.
Low rate | High rate |
---|---|
Image Not Showing
Possible Reasons
|
Image Not Showing
Possible Reasons
|
Primary effect: Greater divergent rate -> more brutal slashings (as more expected to be slashed in reaction time)
Secondary effect: Greater divergent rate -> less importance of consolidation
In terms of variance there is still a persistent effect of lower variance with greater number of validators:
With an interesting twist - as variance increases with divergent rate to some point and than decreases back with rate reaching up 20% - representing overall greater losses, but more predictable due to higher frequency of divergent slots
The effect on Expected losses is most persistent on low amount of indexes on one host.
Considering up to 256 ETH of capital, the risks on different options between running 1 to 8 validators could be valuated:
Illustrating that running more validators on one host rather than consolidating them would lead to lower expected losses in case of double attestation slashing. With most effect achieved with lower reaction time and low divergent rate within network.