feat: add adr-001 for node key refactoring #608

cool-develope · 2022-11-01T11:08:35Z

ref: #592, #593, #597

docs/architecture/adr-001-node-key-refactoring.md

tac0turtle · 2022-11-01T12:59:39Z

a good question by celestia, how does this effect the commitment structure? Are proofs still the same?

cool-develope · 2022-11-01T13:02:13Z

a good question by celestia, how does this effect the commitment structure? Are proofs still the same?

it doesn't affect, as you can see in #597 it doesn't update any proof part and still passing tests

tzdybal · 2022-11-01T13:17:47Z

a good question by celestia, how does this effect the commitment structure? Are proofs still the same?

it doesn't affect, as you can see in #597 it doesn't update any proof part and still passing tests

#597 still contains version field, which is still hashed (see:

iavl/node.go

Line 364 in 36a150e

err = encoding.EncodeVarint(w, node.version)

). Removing version will affect all the hashes, and therefore migration will invalidate all existing proofs.

cool-develope · 2022-11-01T13:20:33Z

Removing version will affect all the hashes, and therefore migration will invalidate all existing proofs.

@tzdybal , good catch. @tac0turtle how about your thougt? Will we keep the version or refactor the ics23 part?

tac0turtle · 2022-11-01T14:22:53Z

@tzdybal , good catch. @tac0turtle how about your thougt? Will we keep the version or refactor the ics23 part?

what do you mean by refactor ics23?

So there are two paths, introduce this in a breaking release of the sdk which a coordinated upgraded is needed or leave it alone. for testing purposes and data collection we probably want to leave it but in the future we could break it. BUT we should check with IBC about this.

cool-develope · 2022-11-01T14:32:04Z

what do you mean by refactor ics23?

I mean update the validation part in ics23.

So there are two paths, introduce this in a breaking release of the sdk which a coordinated upgraded is needed or leave it alone. for testing purposes and data collection we probably want to leave it but in the future we could break it. BUT we should check with IBC about this.

Sounds good, then let's keep the version at this time. I will create the issue and we can revisit in the future

docs/architecture/adr-001-node-key-refactoring.md

alexanderbez · 2022-11-01T15:22:56Z

docs/architecture/adr-001-node-key-refactoring.md

+	leftNodeKey   int64     // new field, need to store in the storage
+	rightNodeKey  int64     // new field, need to store in the storage


Why do we need these fields, if we already have leftNode and rightNode?

leftNode and rightNode is only meaningful on memory side, we need these fields to get children from the storage side.

Co-authored-by: Aleksandr Bezobchuk <[email protected]>

aaronc

I'm unclear as to how an integer improves data locality. Can you explain this a bit more? It would seem that it puts nodes that are created at similar times near each other in the B-tree, but this doesn't actually improve things like range scans other than by reducing the size of the node key which is significant.

cool-develope · 2022-11-01T16:47:58Z

I'm unclear as to how an integer improves data locality. Can you explain this a bit more? It would seem that it puts nodes that are created at similar times near each other in the B-tree, but this doesn't actually improve things like range scans other than by reducing the size of the node key which is significant.

Reducing key size itself improves data locality. I am not sure if leveldb uses compressed trie or general bTree, but we can reduce the node count and improve the tree density.

yihuang · 2022-12-12T06:33:35Z

A different nonce assignment strategy can be useful in for example migration, we can do a sequential iteration on existing db, should be much faster than doing random access to find the nodes version by version

The migration would be complicated, it requires a more detail design. I don't think we can keep the current import/export structure in the new design.

https://github.com/cosmos/iavl/blob/master/export.go#L21
The current export node is already nonce agnostic, it just rely on a post-order traversal of the tree, I think we can keep that, so the nonce assignment strategy can be a node local decision.

docs/architecture/adr-001-node-key-refactoring.md

tac0turtle · 2023-01-02T10:47:02Z

docs/architecture/adr-001-node-key-refactoring.md

+The `Update` operation will require extra DB access because we need to take children to calculate the hash of updated nodes.
+It doesn't require more access in other cases including `Set`, `Remove`, and `Proof`.
+
+It is impossible to remove the individual version. The new design requires more restrict pruning strategies.


can you elaborate on this?

I think it is enough, it will remove orphans and it will be impossible to remove the intermediate version from storage.
And pruning part explains in more detail the new methods.

Even if removing orphans, it's certainly possible to delete the individual versions in between.

lets edit to include this.

yihuang · 2023-01-04T03:55:09Z

Another round of review:

Keep order of insertion sorted

when there are multiple stores in the same db, the insertion is not sorted by keys anymore.

Migration

You also need to record the mapping of hash -> new node key to update the references in node body.

Pruning

We'll need both in-order and pre-order somehow, in-order to keep the key order, pre-order to skip subtree early on.

Remove the individual version

It's still possible to remove individual version with new pruning alrogithm, we just need to be aware of predecessor version and avoid deleting nodes older than that.

Export Snapshot

Can we clarify that the new node key format don't affect the snapshot export format? so the nonce assignment strategy is not part of consensus, but a node local decision.

docs/architecture/adr-001-node-key-refactoring.md

aaronc · 2023-01-18T18:37:49Z

docs/architecture/adr-001-node-key-refactoring.md

+
+### Negative
+
+The `Update` operation will require extra DB access because we need to take children to calculate the hash of updated nodes.


Do you mean to say that it requires extra DB reads? Why is this not needed in Set and Remove?

because update only needs the re-calc of hash, but Set and Remove requires calcHeightAndSize (re-calculation of height and size) and this needs to splay the children nodes

so we are getting left and right in Set and Remove but not splay in Update since we have leftHash and rightHash in the original version

the trade off here is storage saving on per node basis, right? maybe we mention this as a tradeoff

docs/architecture/adr-001-node-key-refactoring.md

Co-authored-by: Aaron Craelius <[email protected]>

docs/architecture/adr-001-node-key-refactoring.md

add adr

4f9a004

cool-develope requested a review from a team as a code owner November 1, 2022 11:08

small fix

8646a9a

tac0turtle requested review from p0mvn and ValarDragon November 1, 2022 11:44

tac0turtle reviewed Nov 1, 2022

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Show resolved Hide resolved

tac0turtle reviewed Nov 1, 2022

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

tac0turtle reviewed Nov 1, 2022

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

cool-develope added 4 commits November 1, 2022 07:52

remove child hashes

6e5b081

small fix

4ee7804

add migration

9468ee2

add pruning

0c2d610

cool-develope commented Nov 1, 2022

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

cool-develope requested a review from tac0turtle November 1, 2022 12:52

tac0turtle reviewed Nov 1, 2022

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

tac0turtle reviewed Nov 1, 2022

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

alexanderbez reviewed Nov 1, 2022

View reviewed changes

cool-develope and others added 3 commits November 1, 2022 11:44

Update docs/architecture/adr-001-node-key-refactoring.md

1459a18

Co-authored-by: Aleksandr Bezobchuk <[email protected]>

Update docs/architecture/adr-001-node-key-refactoring.md

d8d0bf9

Co-authored-by: Aleksandr Bezobchuk <[email protected]>

suggestions

88885f1

cool-develope requested review from alexanderbez and tac0turtle November 1, 2022 16:01

aaronc reviewed Nov 1, 2022

View reviewed changes

cool-develope mentioned this pull request Dec 20, 2022

feat: refactor the node key as version + path #650

Closed

tac0turtle reviewed Jan 2, 2023

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Show resolved Hide resolved

tac0turtle reviewed Jan 2, 2023

View reviewed changes

cool-develope mentioned this pull request Jan 17, 2023

feat: refactor the export traversal order as pre-order #662

Closed

aaronc reviewed Jan 18, 2023

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

aaronc reviewed Jan 18, 2023

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

cool-develope and others added 5 commits January 18, 2023 14:11

Update docs/architecture/adr-001-node-key-refactoring.md

204d881

Co-authored-by: Aaron Craelius <[email protected]>

Merge branch 'master' into 592/adr

f743311

resolve conflicts

bf85d92

Update adr-001-node-key-refactoring.md

b064ede

Merge branch 'master' into 592/adr

6095bc4

tac0turtle reviewed Feb 20, 2023

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

tac0turtle reviewed Feb 20, 2023

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

tac0turtle reviewed Feb 20, 2023

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

tac0turtle reviewed Feb 20, 2023

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

tac0turtle reviewed Feb 20, 2023

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

tac0turtle reviewed Feb 20, 2023

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

tac0turtle reviewed Feb 20, 2023

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

tac0turtle reviewed Feb 20, 2023

View reviewed changes

docs/architecture/adr-001-node-key-refactoring.md Outdated Show resolved Hide resolved

cool-develope and others added 2 commits February 21, 2023 05:07

Merge branch 'master' into 592/adr

7873267

comments

fb0f182

tac0turtle requested a review from yihuang February 21, 2023 14:09

tac0turtle approved these changes Feb 21, 2023

View reviewed changes

yihuang approved these changes Feb 21, 2023

View reviewed changes

tac0turtle merged commit c61d1de into master Feb 21, 2023

tac0turtle deleted the 592/adr branch February 21, 2023 22:28

yihuang mentioned this pull request May 17, 2023

An optimal backend for the IAVL #140

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add adr-001 for node key refactoring #608

feat: add adr-001 for node key refactoring #608

cool-develope commented Nov 1, 2022

tac0turtle commented Nov 1, 2022

cool-develope commented Nov 1, 2022

tzdybal commented Nov 1, 2022

cool-develope commented Nov 1, 2022 •

edited

Loading

tac0turtle commented Nov 1, 2022

cool-develope commented Nov 1, 2022

alexanderbez Nov 1, 2022

cool-develope Nov 1, 2022

aaronc left a comment

cool-develope commented Nov 1, 2022 •

edited

Loading

yihuang commented Dec 12, 2022 •

edited

Loading

tac0turtle Jan 2, 2023

cool-develope Jan 3, 2023

yihuang Jan 4, 2023 •

edited

Loading

tac0turtle Feb 20, 2023

yihuang commented Jan 4, 2023 •

edited

Loading

aaronc Jan 18, 2023

cool-develope Jan 18, 2023

cool-develope Jan 18, 2023

tac0turtle Feb 20, 2023

		leftNodeKey int64 // new field, need to store in the storage
		rightNodeKey int64 // new field, need to store in the storage


		### Negative

		The `Update` operation will require extra DB access because we need to take children to calculate the hash of updated nodes.

feat: add adr-001 for node key refactoring #608

feat: add adr-001 for node key refactoring #608

Conversation

cool-develope commented Nov 1, 2022

tac0turtle commented Nov 1, 2022

cool-develope commented Nov 1, 2022

tzdybal commented Nov 1, 2022

cool-develope commented Nov 1, 2022 • edited Loading

tac0turtle commented Nov 1, 2022

cool-develope commented Nov 1, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aaronc left a comment

Choose a reason for hiding this comment

cool-develope commented Nov 1, 2022 • edited Loading

yihuang commented Dec 12, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yihuang Jan 4, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yihuang commented Jan 4, 2023 • edited Loading

Keep order of insertion sorted

Migration

Pruning

Remove the individual version

Export Snapshot

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cool-develope commented Nov 1, 2022 •

edited

Loading

cool-develope commented Nov 1, 2022 •

edited

Loading

yihuang commented Dec 12, 2022 •

edited

Loading

yihuang Jan 4, 2023 •

edited

Loading

yihuang commented Jan 4, 2023 •

edited

Loading