Memoize transaction-related hashes #14752

georgeee · 2023-12-20T18:19:18Z

While examining the performance of Staged_ledger.update_coinbase_stack_and_get_data, I noticed that even after optimizations culminated by PR #14643, up to 50% of time is being spent in first and send passes of transaction application.

This was measured on blocks of 128 txs each being a 9-account-update zkapp (deploying 8 new accounts). No events or memo were used, this might have affected the results.

When I tried to make a breakdown of cost-centers I noticed the following pieces to take significant part of the costs:

Derive token id (derive_token_id in Account_id)
Events-related hashing (Zkapp_account.Event.hash and Zkapp_account.Make_events.push_hash)
Hashing of zkapp URI (hash_zkapp_uri in Zkapp_account)

These hashing routines (unlike account hash and merge hash used in merkle tree building) are solely dependent on a transaction in question, hence can be performed before block creation (and hashes memorized).

This would reduce the demand of block window duration, hopefully reducing time of update_coinbase_stack_and_get_data two-fold.

The text was updated successfully, but these errors were encountered:

georgeee · 2024-08-28T23:31:30Z

Now it could shave ~30% I think.

With the increased number of events it could be more than that (need to measure)

georgeee · 2024-08-29T19:11:08Z

Current state

Recent compatible with #15980 and o1-labs/proof-systems#2394 merged was tested on a o1.yak.finance.

Connected to mainnet, it took 4 minutes to load the frontier (down from 5m11s on 3.0.1 release).

When tested with the test-apply tool it was 1.7s per max-size block, which sums up to 8 minutes.
This is a lower bound for catchup-from-scratch procedure. And also a limiting factor (less important than networking and proving) to reducing block window.

A few more optimizations to reduce hashing could be performed (as described by the issue).

Alternatively, some smart parallelization strategies could be employed to run 2-3 hashing procedures in parallel after synchronizing 2-3 ledgers instead of just one we currently synchronize for the bootstrap.

With 6-core parallelism, catchup lower bound could be cut down to 90 seconds, which is acceptable.
We also should start keeping masks in DB, to speed-up restart (though for now it acts as an easy way to benchmark performance).

github-project-automation bot added this to Testworld Mission 2.0 Issue Board Dec 20, 2023

georgeee mentioned this issue Sep 16, 2024

Parallel ledger hashing #16053

Open

georgeee changed the title ~~Memorize transaction-related hashes~~ Memoize transaction-related hashes Sep 16, 2024

georgeee mentioned this issue Sep 16, 2024

Memoize transaction related hashes #16056

Draft

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memoize transaction-related hashes #14752

Memoize transaction-related hashes #14752

georgeee commented Dec 20, 2023 •

edited

Loading

georgeee commented Aug 28, 2024

georgeee commented Aug 29, 2024

Memoize transaction-related hashes #14752

Memoize transaction-related hashes #14752

Comments

georgeee commented Dec 20, 2023 • edited Loading

georgeee commented Aug 28, 2024

georgeee commented Aug 29, 2024

georgeee commented Dec 20, 2023 •

edited

Loading