feat(benchmark): add clz opcode test case #1845

LouisTsai-Csie · 2025-07-02T15:07:47Z

🗒️ Description

Add benchmark test for CLZ opcode.

🔗 Related Issues or PRs

Issue #1795

✅ Checklist

All: Ran fast tox checks to avoid unnecessary CI fails, see also Code Standards and Enabling Pre-commit Checks:
```
uvx --with=tox-uv tox -e lint,typecheck,spellcheck,markdownlint
```
All: PR title adheres to the repo standard - it will be used as the squash commit message and should start type(scope):.
All: Considered adding an entry to CHANGELOG.md.
All: Considered updating the online docs in the ./docs/ directory.
All: Set appropriate labels for the changes (only maintainers can apply labels).
Tests: Ran mkdocs serve locally and verified the auto-generated docs for new tests in the Test Case Reference are correctly formatted.
Tests: For PRs implementing a missed test case, update the post-mortem document to add an entry the list.
Ported Tests: All converted JSON/YML tests from ethereum/tests or tests/static have been assigned @ported_from marker.

LouisTsai-Csie · 2025-07-02T15:19:17Z

tests/benchmark/test_worst_compute.py

+    attack_gas_limit = env.gas_limit
+    tx_gas_limit = fork.transaction_gas_limit_cap()
+
+    magic_value = 248  # CLZ(248) = 248


In this test case, each CLZ operation uses the same input. I discovered that CLZ(248) == 248, which makes it more efficient than the approach where we first push a value and then DUP it before calling CLZ.

To verify, you could try this script:

# This function is from https://github.com/ethereum/EIPs/blob/master/EIPS/eip-7939.md def clz(x): """Returns the number of zeros preceding the most significant one bit.""" if x < 0: raise ValueError("clz is undefined for negative numbers") if x > 2**256 - 1: raise ValueError("clz is undefined for numbers larger than 2**256 - 1") if x == 0: return 256 # Convert to binary string and remove any '0b' prefix. bin_str = bin(x).replace('0b', '') return 256 - len(bin_str) bits = 0 magic_value = 248 stack = [magic_value] for i in range(100): x = stack.pop() assert clz(x) == magic_value stack.append(clz(x))

LouisTsai-Csie · 2025-07-02T15:21:47Z

tests/benchmark/test_worst_compute.py

+        sender=pre.fund_eoa(),
+    )
+
+    if (tx_gas_limit is None) or (tx_gas_limit > attack_gas_limit):


This if-statement is for EIP-7852 for transaction limit cap, more details could be found in this issue.

We should have a wrapper for all the benchmark test cases, but i will leave it in a separate PR.

LouisTsai-Csie · 2025-07-02T15:24:24Z

tests/benchmark/test_worst_compute.py

+    code_seq = Bytecode()
+
+    for i in range(2 * iteration):
+        value = i if i % 2 else 2**256 - 1 - i


In the second test case, I aim to add some randomness. Using a simple range from 1 to iteration results in many values being too similar.

I get the idea but I would set this up differently. We have code_prefix and code_suffix so we know that we can add at most max_code_size - len(code_prefix) - len(code_suffix) bytes. I'd keep incrementing some number here and then let CLZ work on 2^256 - 1 >> i % 256. This i increments each round and will thus right shift the values such that CLZ yields a different value. Note that for each CLZ you add you should check Op.CLZ(value) + Op.POP if this length does not exceed the max size (else break). This length is not constant because it will depend on how many bytes we are pushing (to push 2^256-1 need PUSH32, and for 1 need PUSH1)

jochem-brouwer

Some comments 😄 👍

Need to also think on how to use these benchmarks, i.e. how to test these against clients and how to interpret the results. The end goal could be to determine the gas price of the opcode. For that we also need a baseline in order to interpret the results and thus also to price the opcode. This is non-trivial because these benchmarks use other opcodes as well, and we thus have to find a way to measure the costs for CLZ only (and not the cost of the other opcodes). Food for thought!

Left a comment on one of the tests. The 248 trick is beautiful 😄 👍

jochem-brouwer · 2025-07-07T09:49:12Z

tests/benchmark/test_worst_compute.py

+    attack_gas_limit = env.gas_limit
+    tx_gas_limit = fork.transaction_gas_limit_cap()
+
+    magic_value = 248  # CLZ(248) = 248


Heh, this is a nice trick.

jochem-brouwer · 2025-07-07T10:02:01Z

tests/benchmark/test_worst_compute.py

+        value = i if i % 2 else 2**256 - 1 - i
+        code_seq += Op.CLZ(value) + Op.POP
+
+    code_address = pre.deploy_contract(code=code_prefix + code_seq + code_suffix)


This will deploy a contract too large. Could you add the sanity check?

Suggested change

code_address = pre.deploy_contract(code=code_prefix + code_seq + code_suffix)

code_address = pre.deploy_contract(code=code_prefix + code_seq + code_suffix)

if len(attack_code) > max_contract_size:

raise ValueError(

f"Code size {len(attack_code)} exceeds maximum code size {max_contract_size}"

)

I might go with the following approach for consistency now, since the remaining benchmarks use it as well.

However, we could add this refactoring for all the assertions later in this issue

assert len(attack_code) <= max_code_size

jochem-brouwer · 2025-07-07T10:05:45Z

tests/benchmark/test_worst_compute.py

+    code_seq = Bytecode()
+
+    for i in range(2 * iteration):
+        value = i if i % 2 else 2**256 - 1 - i


I get the idea but I would set this up differently. We have code_prefix and code_suffix so we know that we can add at most max_code_size - len(code_prefix) - len(code_suffix) bytes. I'd keep incrementing some number here and then let CLZ work on 2^256 - 1 >> i % 256. This i increments each round and will thus right shift the values such that CLZ yields a different value. Note that for each CLZ you add you should check Op.CLZ(value) + Op.POP if this length does not exceed the max size (else break). This length is not constant because it will depend on how many bytes we are pushing (to push 2^256-1 need PUSH32, and for 1 need PUSH1)

LouisTsai-Csie · 2025-07-07T12:03:19Z

@jochem-brouwer Thanks for your review, I've updated accordingly! Please let me know if the change is good to you.

MariusVanDerWijden · 2025-07-17T06:38:05Z

tests/benchmark/test_worst_compute.py

+    code_prefix = Op.JUMPDEST
+    code_suffix = Op.PUSH0 + Op.JUMP
+
+    available_code_size = max_code_size - len(code_prefix) - len(code_suffix)
+
+    code_seq = Bytecode()
+
+    for i in range(available_code_size):
+        value = (2**256 - 1) >> (i % 256)
+        clz_op = Op.CLZ(value) + Op.POP
+        if len(code_seq) + len(clz_op) > available_code_size:
+            break
+        code_seq += clz_op
+
+    attack_code = code_prefix + code_seq + code_suffix
+    assert len(attack_code) <= max_code_size


I think this whole logic is very generalizable, you set up the stack, setup a jumpdest then do 24k - prefix - suffix times / inner code include the operations then add the suffix.
I think this could be refactored out into a helper cc @marioevz e.g.

func createBenchLoop(prefix, code, suffix []byte) []byte{ result := []byte result := append(result, prefix) dest := len(result) result := append(result, JUMPDEST) suffix := append(suffix, PUSH32) suffix := append(suffix, extend32(dest) suffixLen := len(suffix) available_code_size := max_contract_size - len(result) - suffixLen for i := range available_code_size / len(code) result := append(result, code) result := append(result, suffix) return result

This is part of our refactoring plan, we will create new test types: benchmark_test, and benchmark_state_test to handle the repetitive code, the transaction gas limit cap, and more. You could check more information in this issue.

LouisTsai-Csie commented Jul 2, 2025

View reviewed changes

LouisTsai-Csie mentioned this pull request Jul 1, 2025

feat(tests): additional test cases for eip-7939 #1795

Open

24 tasks

LouisTsai-Csie force-pushed the benchmark-clz-op branch from b731387 to 561a50e Compare July 3, 2025 11:43

LouisTsai-Csie mentioned this pull request Jul 3, 2025

Create blockchains that consume full gas on single operations. #1453

Open

LouisTsai-Csie marked this pull request as ready for review July 3, 2025 12:26

LouisTsai-Csie self-assigned this Jul 3, 2025

LouisTsai-Csie added feature:benchmark fork:osaka Osaka hardfork labels Jul 3, 2025

LouisTsai-Csie marked this pull request as draft July 4, 2025 14:45

LouisTsai-Csie marked this pull request as ready for review July 4, 2025 14:45

danceratopz requested a review from jochem-brouwer July 7, 2025 09:37

jochem-brouwer requested changes Jul 7, 2025

View reviewed changes

LouisTsai-Csie added 3 commits July 7, 2025 12:14

feat(benchmark): add clz benchmark with the same input

6c50dce

feat(benchmark): add clz benchmark with the diff input

92c30a8

refactor(benchmark): enhance clz benchmarks

055ecf9

LouisTsai-Csie force-pushed the benchmark-clz-op branch from 561a50e to 055ecf9 Compare July 7, 2025 10:46

LouisTsai-Csie requested a review from jochem-brouwer July 7, 2025 10:48

MariusVanDerWijden reviewed Jul 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(benchmark): add clz opcode test case #1845

feat(benchmark): add clz opcode test case #1845

LouisTsai-Csie commented Jul 2, 2025 •

edited

Loading

Uh oh!

LouisTsai-Csie Jul 2, 2025

Uh oh!

LouisTsai-Csie Jul 2, 2025 •

edited

Loading

Uh oh!

LouisTsai-Csie Jul 2, 2025

Uh oh!

jochem-brouwer Jul 7, 2025

Uh oh!

jochem-brouwer left a comment

Uh oh!

jochem-brouwer Jul 7, 2025

Uh oh!

jochem-brouwer Jul 7, 2025

Uh oh!

LouisTsai-Csie Jul 7, 2025

Uh oh!

jochem-brouwer Jul 7, 2025

Uh oh!

LouisTsai-Csie commented Jul 7, 2025

Uh oh!

MariusVanDerWijden Jul 17, 2025

Uh oh!

LouisTsai-Csie Jul 17, 2025

Uh oh!

Uh oh!

-    code_address = pre.deploy_contract(code=code_prefix + code_seq + code_suffix)
+    code_address = pre.deploy_contract(code=code_prefix + code_seq + code_suffix)
+    if len(attack_code) > max_contract_size:
+        raise ValueError(
+            f"Code size {len(attack_code)} exceeds maximum code size {max_contract_size}"
+        )

feat(benchmark): add clz opcode test case #1845

Are you sure you want to change the base?

feat(benchmark): add clz opcode test case #1845

Conversation

LouisTsai-Csie commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🗒️ Description

🔗 Related Issues or PRs

✅ Checklist

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LouisTsai-Csie Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jochem-brouwer left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LouisTsai-Csie commented Jul 7, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

LouisTsai-Csie commented Jul 2, 2025 •

edited

Loading

LouisTsai-Csie Jul 2, 2025 •

edited

Loading