Costs to prove Ethereum blocks are down only. Competing zkVMs have drastically improved their performance over the past few years, with many predicting that proving Ethereum blocks in real time (under 12 seconds) will become a practical reality by 2025. However, if Ethereum is to rely on provers in the not-so-distant future, rigorous research is needed to determine whether certain EVM instructions (i.e., opcodes and precompiles) are particularly slow or computationally intensive to prove.
In this post, we use a data-driven approach to identify "prover killer" blocks with particularly long proving times, and then analyze the opcode-level gas consumption patterns to pinpoint the specific opcodes that contribute most to these performance bottlenecks.
These datasets were merged to obtain a unified dataset that links each block's proving performance with granular opcode-level gas usage metrics, enabling the identification and analysis of prover killer blocks.
To identify prover killer blocks, we used a regression-based outlier detection method. Blocks were first grouped by team and day, and a linear regression model was used to capture the relationship between gas usage and proving time. Blocks with residuals exceeding three stds from the regression line were flagged as prover killers. Each flagged block (N=256) was then matched with a non-prover killer block based on similar gas usage within a 10% tolerance to ensure accurate comparisons.
A Random Forest classifier was trained to predict prover killer blocks using the aggregated opcode features. Hyperparameters were tuned via grid search with stratified k-fold cross-validation. Feature importance was evaluated using both the classifier’s intrinsic measures and permutation-based tests.
Permutation-based feature importance analysis highlighted a subset of opcodes that played a critical role in the classifier’s performance. The top 10 opcodes, based on permutation importance are shown in the figure and table below.
op_code | perm_importance_mean | perm_importance_std |
---|---|---|
STATICCALL_gas | 0.029730 | 0.024771 |
STATICCALL_freq | 0.024865 | 0.022379 |
PUSH32_gas | 0.021081 | 0.014744 |
RETURN_gas | 0.021081 | 0.018041 |
PUSH10_freq | 0.018378 | 0.025203 |
MLOAD_freq | 0.018378 | 0.029079 |
SWAP2_freq | 0.015676 | 0.013610 |
SSTORE_freq | 0.015676 | 0.011130 |
SLOAD_gas | 0.015135 | 0.011669 |
DELEGATECALL_gas | 0.015135 | 0.017314 |
The suffix
_gas
indicates that the feature is associated with the gas consumed by an opcode, while_freq
indicates that the feature corresponds to the frequency at which an opcode is executed within a block.
or
or
By clicking below, you agree to our terms of service.
New to HackMD? Sign up
Syntax | Example | Reference | |
---|---|---|---|
# Header | Header | 基本排版 | |
- Unordered List |
|
||
1. Ordered List |
|
||
- [ ] Todo List |
|
||
> Blockquote | Blockquote |
||
**Bold font** | Bold font | ||
*Italics font* | Italics font | ||
~~Strikethrough~~ | |||
19^th^ | 19th | ||
H~2~O | H2O | ||
++Inserted text++ | Inserted text | ||
==Marked text== | Marked text | ||
[link text](https:// "title") | Link | ||
 | Image | ||
`Code` | Code |
在筆記中貼入程式碼 | |
```javascript var i = 0; ``` |
|
||
:smile: | ![]() |
Emoji list | |
{%youtube youtube_id %} | Externals | ||
$L^aT_eX$ | LaTeX | ||
:::info This is a alert area. ::: |
This is a alert area. |
On a scale of 0-10, how likely is it that you would recommend HackMD to your friends, family or business associates?
Please give us some advice and help us improve HackMD.
Syncing