Path base Implementing #534

huyngopt1994 · 2024-08-19T12:47:45Z

The PR include sub PRs for Path base Implementing

core,trie,eth,cmd: rework preimage store #533
trie, les, tests, core: implement trie tracer: Trie tracer is an aux… #552 : Init implementing for trie tracer.
all: introduce trie owner notion #553 : introduce trie owner notion.
core, trie: rework trie commiter #560: trie comitter rework.
all: rework genesis api #567 : Rework logic in genesis apis and support rewind the genesis state in case if it's not existed in data.
core/rawdb: the ancient store implementing is now exported in package… #571 Make freezer struct (ancient store implementing) is exported package ethdb
Change ancient chain segments from root ancient to sub folders #572 Make the ancient data move to one layer folder.
all: move genesis init to blockchain #570 : all: move genesis init to blockchain for avoiding openning trie.Db multiple times
Make db inspector for extending multiple ancient stores #574: Make db inspector for extending multiple ancient stores
rawdb,ethdb,eth: implement freezer tail deletion and use atomic refer… #577: implement freezer tail deletion
core,eth,tests,trie: abstract node scheme, and contruct database #578: core,eth,tests,trie: abstract node scheme, and contruct database interface instead of keyvalue for supporting storing diff reverse data in ancient, and track path in insert in StackTrie.
cmd, core, eth, trie: track deleted nodes #576 : track deleted nodes
all: prep for path-based trie storage #582 : all: prep for path-based trie storage
trie: implement NodeBlob api for trie iterator #584: trie: implement NodeBlob api for trie iterator
trie: refactor tracer #581: Refactor trie tracer.
core, trie: rework trie database #585: trie: implement NodeBlob api for trie iterator
eth/protocols/snap: fix batch writer when resuming an aborted sync (#… #587: eth/protocols/snap: fix batch writer when resuming an aborted sync
trie: add trie db wrapper; refactor trienode #588: trie: add trie db wrapper; refactor trienode
trie, core: track state changes in statedb #589: track state changes in statedb.
all:Remove trie cache journal period #595: all:Remove trie cache journal period
core, trie: Expose block number to statedb #593: core, trie: Expose block number to statedb
final implementing path base #591: Final implementing path base.
all: reworkNodeResolver for working with multiple state schemes with … #603: all: reworkNodeResolver for working with multiple state schemes
trie: fix issue insert wrong path in stack trie and remove the offset… #604: trie: fix issue insert wrong path in stack trie
all: enable pbss #600: all: enable pss.
Fix missing passing scheme when init genesis and avoid referencing same object when passing parents in cosortium v1 #608: Fix missing passing scheme when init genesis and avoid referencing same object when passing parents in cosortium v1
core, eth/downloader: pbss fix release v1.13.1 #614: core, eth/downloader: pbss fix release v1.13.1
rlp, trie: faster trie node encoding (#24126) #606: rlp, trie: faster trie node encoding (#24126)
core, accounts, eth, trie: pbss fix release v1.13.2 #615: core, accounts, eth, trie: pbss fix release v1.13.2
trie/triedb/pathdb, core/rawdb: pbss fix release v1.13.5 (corner-cases in path scheme state management) #619: trie/triedb/pathdb, core/rawdb: pbss fix release v1.13.5 (corner-cases in path scheme state management)
cmd/ronin/chaincmd: open ancient freezer when init genesis #620: cmd/ronin/chaincmd: open ancient freezer when init genesis
[consortium-v1] Get list validators from genesis instead of statedb for snapsync. #624: [consortium-v1] Get list validators from genesis instead of statedb for snapsync.

* core,trie,eth,cmd: rework preimage store * ci: trigger unittest path-base-implementing

…liary tool to capture all deleted node wwhich can't be captured by trie.Committer. The deleted nodes (#552) can be removed from the disk later. Implement traverse and rework init Trie

* cmd, core/state, light, trie, eth: add trie owner notion * all: refactor * tests: fix goimports * core/state/snapshot: fix ineffasigns Co-authored-by: rjl493456442 <[email protected]> Co-authored-by: Martin Holst Swende <[email protected]>

* core, trie: rework trie commiter changed the commit procedure, introduce new struct called nodeSet for returning including all dirty nodes of a trie. Multiple nodeset will be merged to MergedNodeSet struct. then be submitted to in-memory database from block to block * trie,core: fix comments

* core: store genesis allocation and recommit them if necessary (#24460) * core: store genesis allocation and recommit them if necessary * core: recover predefined genesis allocation if possible * all: cleanup the APIs for initializing genesis (#25473) * all: polish tests * core: apply feedback from Guillaume * core: fix comment --------- Co-authored-by: rjl493456442 <[email protected]>

core, eth, les, trie: rework snap sync Co-authored-by: rjl493456442 <[email protected]>

… ethdb, can be used independently of the chain database, reference by commit 1941c5e (#571)

* cmd, core, ethdb, node: rework ancient store folder reference by ethereum/go-ethereum@e44d655

* all: move genesis initialization to blockchain * all: fix test

* core: add blockchain test for failing create/destroy-case * core,state: some refactors * core/rawdb: refactor db inspector for extending multiple ancient store

…ence commit 538a868 (#577) fix up

* core,eth,tests,trie: abstract node scheme, and contruct database interface instead of keyvalue for supporting storing diff reverse data in ancient * stacktrie,core,eth: port the changes in stacktries, track the path prefix of nodes when commits, use ethdb.Database for constructing trie.Database, it's not necessary right now, but it's required for path-based used to open reverse diff freezer * core,trie: add scheme and resolvepath logic

* trie: track deleted nodes * core: track deleted nodes

* all: prep for path-based trie storage * all: use rawdb.HasLegacyNode() to check for node existance instead of check for length

* trie: implement NodeBlob API for trie iterator This functionality is needed in new path-based storage scheme, but can be implemented in a seperate PR though. When an account is deleted, then all the storage slots should be nuked out from the disk as well. In hash-based storage scheme they are still left in the disk but in new scheme, they will be iterated and marked as deleted. But why the NodeBlob API is needed in this scenario? Because when the node is marked deleted, the previous value is also required to be recorded to construct the reverse diff. * fuzzers/stacktrie: enable test --------- Co-authored-by: Gary Rong <[email protected]>

* trie: refactor tracer * fix: add description

…27842) (#587) Co-authored-by: Péter Szilágyi <[email protected]>

* trie: add wrapper for database * trie: refactor trie node * all: fix test * rawdb, trie: fix comment trie: change name WithPrev => NodeWithPrev rawdb: add schema_test

* trie: triestate/Set to track changes * core/state: track state changes journal.go: in resetObjectChange - add account in resetObjectChange (ref ethereum/go-ethereum#27339) - add prevAccount and prevStorage (ref ethereum/go-ethereum#27376) - add prevAccountOrigin and prevStorageOrigin to track changes state_object.go: add origin for tracking the original StateAccount before change statedb.go: - add accountsOrigin and storagesOrigin, same functions as above - stateObjectsDestruct now track the previous state before destruct - add functions for handle destructing old states * all: apply changes to tests

* core/state: clean up: db already exist in stateObject * core, trie: statedb also commit the block number

* all: clean up overall structure, preparing for path-based (#594) * trie/triedb/pathdb: init pathdb components * core, trie: track state change with address instead of hash Reference: ethereum/go-ethereum@817553c * trie: refactor * rawdb: implement freezer resettable & state freezer (#596) * rawdb: implement freezer resettable * rawdb: implement state freezer * rawdb: update description * trie: path based scheme implementing (#598) * core/state: move account definition to core/types Reference: ethereum/go-ethereum#27323 * trie: add path base utils * triedb: implement history and adding some test utils * trie/triedb/pathdb: implement difflayer and disklayer * Fix some issues related to history, and add logic checking maxbyte when is zero for retrieving ancient ranges with maxbyte is zero * trie/triedb/pathdb: implement database.go * freezer: Add unit test and docs for support freezer reading with no limit size * trie/triedb/pathdb: add database and difflayer tests * triedb/pathdb: implement journal and add more comments --------- Co-authored-by: Huy Ngo <[email protected]> --------- Co-authored-by: Francesco4203 <[email protected]>

…pathdb

…calling ReadTrieNode underlying (#603)

… when inserting in stack trie reference by 86fe359 (#604)

* trie: enable pathdb: add path config and enable tests * core/rawdb: now also inspect the state freezer in pathdb; rename * cmd: working on cmd ronin * core: refactor; add pathbase config; fix tests - all: fix and enable tests for pathbase - blockchain: open triedb explicitly in blockchain functions and close right after use, since diskLayer inside pathdb is a skeleton - blockchain: when writeBlockWithState, pathbase will skip the explicit garbage collector, which is only needed for hashbase - genesis.go: nit: change check genesis state, ref ethereum/go-ethereum@08bf8a6 * tests: enable path tests * eth: enable path scheme - all: fix tests, enable path scheme tests - state_accessor: split function to retrieve statedb from block to hash scheme and path scheme * light, miner, les, ethclient: clean up tests * trie: refactor triereader, return err when state reader won't be created in hash and path * trie: fix failed test in iterator and sync test tie * trie,core: improve trie reader and add checking config nil when initing database * trie: statedb instance is committed, then it's not usable, a new instance must be created based on new root updated database, reference by commit 6d2aeb4 * cmd,les,eth: fixed unittest and adding flag Parrallel correctly * core, eth: fix tests * core: refactor and fix sync_test logic * tmp: disable pathbase for TestIsPeriodBlock, TestIsTrippEffective --------- Co-authored-by: Huy Ngo <[email protected]>

…me object when passing parents in cosortium v1 (#608) * cmd,eth: fix wrong compare logic when data dir is empty and moving checking error correctly * docker: passing state.scheme when initing the genesis data * rawdb: add missing freezer in collections * v1/consortium: create a copy to keep parents content In snapshot function, the list parents is popped out gradually for getting its contents, so when calling apply, the parents list is empty. Simply create a copy at the beginning to fix it. This has been fixed in consortium v2. For a full sync scenario, however, the first blocks are still processed with consortium v1, which causes our node to panic. --------- Co-authored-by: Francesco4203 <[email protected]>

* eth/downloader: prevent pivot moves after state commit (#28126) * core, eth/downloader: fix genesis state missing due to state sync (#28124) * core: fix chain repair corner case in path-based scheme * eth/downloader: disable trie database whenever state sync is launched --------- Co-authored-by: Péter Szilágyi <[email protected]> Co-authored-by: rjl493456442 <[email protected]>

commit ethereum/go-ethereum@65ed1a6. This change speeds up trie hashing and all other activities that require RLP encoding of trie nodes by approximately 20%. The speedup is achieved by avoiding reflection overhead during node encoding. The interface type trie.node now contains a method 'encode' that works with rlp.EncoderBuffer. Management of EncoderBuffers is left to calling code. trie.hasher, which is pooled to avoid allocations, now maintains an EncoderBuffer. This means memory resources related to trie node encoding are tied to the hasher pool. This also refactors some functions in rlp package. goos: linux goarch: amd64 cpu: 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz │ old.txt │ new.txt │ │ sec/op │ sec/op vs base │ DeriveSha200/std_trie-8 725.1µ ± 31% 613.8µ ± 37% ~ (p=0.481 n=10) DeriveSha200/stack_trie-8 572.3µ ± 10% 493.1µ ± 13% -13.85% (p=0.005 n=10) geomean 644.2µ 550.1µ -14.61% │ old.txt │ new.txt │ │ B/op │ B/op vs base │ DeriveSha200/std_trie-8 287.4Ki ± 0% 283.0Ki ± 0% -1.53% (p=0.000 n=10) DeriveSha200/stack_trie-8 56.34Ki ± 0% 42.43Ki ± 0% -24.69% (p=0.000 n=10) geomean 127.2Ki 109.6Ki -13.88% │ old.txt │ new.txt │ │ allocs/op │ allocs/op vs base │ DeriveSha200/std_trie-8 2.931k ± 0% 2.917k ± 0% -0.46% (p=0.000 n=10) DeriveSha200/stack_trie-8 1.462k ± 0% 1.246k ± 0% -14.77% (p=0.000 n=10) geomean 2.070k 1.907k -7.90% │ old.txt │ new.txt │ │ sec/op │ sec/op vs base │ Prove-8 664.0µ ± 21% 450.2µ ± 27% -32.20% (p=0.000 n=10) VerifyProof-8 8.643µ ± 18% 9.009µ ± 33% ~ (p=0.684 n=10) VerifyRangeProof10-8 99.18µ ± 25% 67.60µ ± 67% ~ (p=0.089 n=10) VerifyRangeProof100-8 496.3µ ± 20% 487.0µ ± 33% ~ (p=0.739 n=10) VerifyRangeProof1000-8 5.149m ± 32% 4.095m ± 49% ~ (p=0.971 n=10) VerifyRangeProof5000-8 19.79m ± 60% 19.16m ± 28% ~ (p=0.631 n=10) VerifyRangeNoProof10-8 499.0µ ± 15% 422.8µ ± 29% -15.25% (p=0.035 n=10) VerifyRangeNoProof500-8 1.747m ± 30% 1.417m ± 24% -18.91% (p=0.023 n=10) VerifyRangeNoProof1000-8 3.025m ± 29% 2.239m ± 33% -25.98% (p=0.009 n=10) geomean 750.9µ 622.6µ -17.09% │ old.txt │ new.txt │ │ sec/op │ sec/op vs base │ HashFixedSize/10-8 60.30µ ± 19% 44.84µ ± 17% -25.64% (p=0.000 n=10) HashFixedSize/100-8 205.9µ ± 32% 145.2µ ± 19% -29.48% (p=0.000 n=10) HashFixedSize/1K-8 1326.5µ ± 23% 939.2µ ± 25% -29.20% (p=0.002 n=10) HashFixedSize/10K-8 14.77m ± 25% 12.74m ± 19% ~ (p=0.075 n=10) HashFixedSize/100K-8 135.2m ± 19% 104.1m ± 18% -23.03% (p=0.003 n=10) geomean 2.011m 1.520m -24.43% │ old.txt │ new.txt │ │ B/op │ B/op vs base │ HashFixedSize/10-8 11.729Ki ± 0% 9.752Ki ± 0% -16.85% (p=0.000 n=10) HashFixedSize/100-8 58.56Ki ± 0% 49.23Ki ± 0% -15.93% (p=0.000 n=10) HashFixedSize/1K-8 578.1Ki ± 0% 481.5Ki ± 0% -16.72% (p=0.000 n=10) HashFixedSize/10K-8 6.019Mi ± 0% 4.985Mi ± 0% -17.18% (p=0.000 n=10) HashFixedSize/100K-8 59.53Mi ± 0% 49.29Mi ± 0% -17.20% (p=0.000 n=10) geomean 683.5Ki 568.8Ki -16.78% │ old.txt │ new.txt │ │ allocs/op │ allocs/op vs base │ HashFixedSize/10-8 149.0 ± 0% 142.0 ± 0% -4.70% (p=0.000 n=10) HashFixedSize/100-8 772.0 ± 0% 739.0 ± 0% -4.27% (p=0.000 n=10) HashFixedSize/1K-8 7.443k ± 0% 7.099k ± 0% -4.62% (p=0.000 n=10) HashFixedSize/10K-8 77.09k ± 0% 73.32k ± 0% -4.89% (p=0.000 n=10) HashFixedSize/100K-8 767.8k ± 0% 730.5k ± 0% -4.86% (p=0.000 n=10) geomean 8.729k 8.321k -4.67% Co-authored-by: Qian Bin <[email protected]> Co-authored-by: Felix Lange <[email protected]>

* core, accounts, eth, trie: handle genesis state missing (#28171) * core, accounts, eth, trie: handle genesis state missing * core, eth, trie: polish * core: manage txpool subscription in mainpool * eth/backend: fix test * cmd, eth: fix test * core/rawdb, trie/triedb/pathdb: address comments * eth, trie: address comments * eth: inline the function * eth: use synced flag * core/txpool: revert changes in txpool * core, eth, trie: rename functions * trie: remove internal nodes between shortNode and child in path mode (#28163) * trie: remove internal nodes between shortNode and child in path mode * trie: address comments * core/rawdb, trie: address comments * core/rawdb: delete unused func * trie: change comments * trie: add missing tests * trie: fix lint --------- Co-authored-by: rjl493456442 <[email protected]>

…s in path scheme state management) (#619) * trie/triedb/pathdb, core/rawdb: enhance error message in freezer (#28198) This PR adds more error message for debugging purpose. * trie/triedb/pathdb: improve dirty node flushing trigger (#28426) * trie/triedb/pathdb: improve dirty node flushing trigger * trie/triedb/pathdb: add tests * trie/triedb/pathdb: address comment * core/rawdb: fsync the index file after each freezer write (#28483) * core/rawdb: fsync the index and data file after each freezer write * core/rawdb: fsync the data file in freezer after write --------- Co-authored-by: rjl493456442 <[email protected]>

…tium. (#624) In snap sync, we will disable accessing/mark stale to triedb when enabling path scheme for protecting the persistent storing, so the data of validators only used for checking in some first blocks which we can return hardcore list from genesis data for following the flow of snap-sync from go-eth team.

…pect failed in chain freezer (#627)

* trie: refactor stacktrie (#28233) This change refactors stacktrie to separate the stacktrie itself from the internal representation of nodes: a stacktrie is not a recursive structure of stacktries, rather, a framework for representing and operating upon a set of nodes. --------- Co-authored-by: Gary Rong <[email protected]> * trie: remove owner and binary marshaling from stacktrie (#28291) This change - Removes the owner-notion from a stacktrie; the owner is only ever needed for comitting to the database, but the commit-function, the `writeFn` is provided by the caller, so the caller can just set the owner into the `writeFn` instead of having it passed through the stacktrie. - Removes the `encoding.BinaryMarshaler`/`encoding.BinaryUnmarshaler` interface from stacktrie. We're not using it, and it is doubtful whether anyone downstream is either. * core, trie, eth: refactor stacktrie constructor This change enhances the stacktrie constructor by introducing an option struct. It also simplifies the `Hash` and `Commit` operations, getting rid of the special handling round root node. * core, eth, trie: filter out boundary nodes and remove dangling nodes in stacktrie (#28327) * core, eth, trie: filter out boundary nodes in stacktrie * eth/protocol/snap: add comments * Update trie/stacktrie.go Co-authored-by: Martin Holst Swende <[email protected]> * eth, trie: remove onBoundary callback * eth/protocols/snap: keep complete boundary nodes * eth/protocols/snap: skip healing if the storage trie is already complete * eth, trie: add more metrics * eth, trie: address comment --------- Co-authored-by: Martin Holst Swende <[email protected]> --------- Co-authored-by: Martin Holst Swende <[email protected]> Co-authored-by: Gary Rong <[email protected]>

…626) * core/rawdb: improve state scheme checking (#28724) This pull request improves the condition to check if path state scheme is in use. Originally, root node presence was used as the indicator if path scheme is used or not. However due to fact that root node will be deleted during the initial snap sync, this condition is no longer useful. If PersistentStateID is present, it shows that we've already configured for path scheme. * core, triedb/pathdb: calculate the size for batch pre-allocation (#29106) * core, triedb/pathdb: calculate the size for batch pre-allocation * triedb/pathdb: address comment * triedb/pathdb: fix panic in recoverable (#29107) * triedb/pathdb: fix panic in recoverable * triedb/pathdb: add todo * triedb/pathdb: rename * triedb/pathdb: rename --------- Co-authored-by: rjl493456442 <[email protected]>

Francesco4203 force-pushed the path-base-implementing branch from 1c73b9a to 0f27ff6 Compare September 16, 2024 08:56

huyngopt1994 force-pushed the path-base-implementing branch 2 times, most recently from 3cb37d6 to ca1f046 Compare September 17, 2024 05:01

Francesco4203 force-pushed the path-base-implementing branch from 88c7ca4 to b78c647 Compare October 17, 2024 08:06

huyngopt1994 changed the title ~~[WIP] Path base Implementing~~ Path base Implementing Oct 18, 2024

huyngopt1994 and others added 25 commits October 25, 2024 14:28

core,trie,eth,cmd: rework preimage store (#533)

b65ec76

* core,trie,eth,cmd: rework preimage store * ci: trigger unittest path-base-implementing

trie, les, tests, core: implement trie tracer: Trie tracer is an auxi…

3e85934

…liary tool to capture all deleted node wwhich can't be captured by trie.Committer. The deleted nodes (#552) can be removed from the disk later. Implement traverse and rework init Trie

core, eth: port snap sync changes (#564)

a02ce82

core, eth, les, trie: rework snap sync Co-authored-by: rjl493456442 <[email protected]>

core/rawdb: the ancient store implementing is now exported in package…

bc73cc6

… ethdb, can be used independently of the chain database, reference by commit 1941c5e (#571)

Change ancient chain segments from root ancient to sub folders (#572)

c225149

* cmd, core, ethdb, node: rework ancient store folder reference by ethereum/go-ethereum@e44d655

all: move genesis init to blockchain (#570)

a05a191

* all: move genesis initialization to blockchain * all: fix test

Make db inspector for extending multiple ancient stores (#574)

fb924c3

* core: add blockchain test for failing create/destroy-case * core,state: some refactors * core/rawdb: refactor db inspector for extending multiple ancient store

rawdb,ethdb,eth: implement freezer tail deletion and use atomic refer…

0219e42

…ence commit 538a868 (#577) fix up

cmd, core, eth, trie: track deleted nodes (#576)

5524d63

* trie: track deleted nodes * core: track deleted nodes

all: prep for path-based trie storage (#582)

dc978d0

* all: prep for path-based trie storage * all: use rawdb.HasLegacyNode() to check for node existance instead of check for length

trie: refactor tracer (#581)

a64001b

* trie: refactor tracer * fix: add description

eth/protocols/snap: fix batch writer when resuming an aborted sync (#…

0fa53c3

…27842) (#587) Co-authored-by: Péter Szilágyi <[email protected]>

trie: rework trie database (#585)

fb397a7

trie: add trie db wrapper; refactor trienode (#588)

80f5db8

* trie: add wrapper for database * trie: refactor trie node * all: fix test * rawdb, trie: fix comment trie: change name WithPrev => NodeWithPrev rawdb: add schema_test

all: remove trie cache journal (#595)

6d2fa7a

core, trie: Expose block number to statedb (#593)

bafb899

* core/state: clean up: db already exist in stateObject * core, trie: statedb also commit the block number

trie: remove nodes method and add diskdb method for consistency with …

95cacba

…pathdb

all: reworkNodeResolver for working with multiple state schemes with …

351ac1e

…calling ReadTrieNode underlying (#603)

huyngopt1994 and others added 3 commits October 25, 2024 14:29

trie: fix issue insert wrong path in stack trie and remove the offset…

8de85b0

… when inserting in stack trie reference by 86fe359 (#604)

huyngopt1994 force-pushed the path-base-implementing branch from 69b0c7f to e42b5b8 Compare October 25, 2024 09:40

Francesco4203 and others added 10 commits October 25, 2024 17:49

cmd/ronin/chaincmd: open ancient freezer when init genesis (#620)

eaa459d

[docker] remove duplicate param in entrypoint.sh

b92dac2

cmd,rawdb: avoid extend Tail method in chainfreezer which make db ins…

a44ecb8

…pect failed in chain freezer (#627)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Path base Implementing #534

Path base Implementing #534

huyngopt1994 commented Aug 19, 2024 •

edited

Loading

Path base Implementing #534

Are you sure you want to change the base?

Path base Implementing #534

Conversation

huyngopt1994 commented Aug 19, 2024 • edited Loading

huyngopt1994 commented Aug 19, 2024 •

edited

Loading