optimize get_proof_positions to not use sort #66

Davidson-Souza · 2025-02-04T15:44:59Z

Calling sort on every iteration for a method that's called at least once every block is a huge perf hit. This commit rework get_proof_positions to work without sorting.

The reason why we sort is that we always work upwards in the tree, starting at row 0 and going all the way to the top-most root. We always need a node and it's sibling (either from the proof or computed as we go). Searching for a sibling is also incredibly slow, given that we might have up to hundreds of thousands of nodes. We want to have siblings adjacent to each other inside our container. If all leaves were at row 0, that would always happen. But since leaves may not be at row 0, it is possible to have siblings being scattered around. To fix this, we might just sort the vector, siblings are numerically adjacent. However, sorting a few thousand nodes is also expensive.

This commit uses a BTreeMap instead of a vector. We trade-off some small performance on searching, but now we can add stuff in a self-sorting way that's much lighter.

Calling sort on every iteration for a method that's called at least once every block is a huge perf hit. This commit rework `get_proof_positions` to work without sorting. The reason why we sort is that we always work upwards in the tree, starting at row 0 and going all the way to the top-most root. We always need a node and it's sibling (either from the proof or computed as we go). Searching for a sibling is also incredibly slow, given that we might have up to hundreds of thousands of nodes. We want to have siblings adjacent to each other inside our container. If all leaves were at row 0, that would always happen. But since leaves may not be at row 0, it is possible to have siblings being scattered around. To fix this, we might just sort the vector, siblings are numerically adjacent. However, sorting a few thousand nodes is also expensive. This commit uses a BTreeMap instead of a vector. We trade-off some small performance on searching, but now we can add stuff in a self-sorting way that's much lighter.

Davidson-Souza force-pushed the proof-position-sortless branch from 395519e to 13567e7 Compare February 5, 2025 14:10

Davidson-Souza merged commit 2466fd0 into mit-dci:main Feb 5, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimize get_proof_positions to not use sort #66

optimize get_proof_positions to not use sort #66

Davidson-Souza commented Feb 4, 2025

optimize get_proof_positions to not use sort #66

optimize get_proof_positions to not use sort #66

Conversation

Davidson-Souza commented Feb 4, 2025