-
Notifications
You must be signed in to change notification settings - Fork 796
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Vision: Host functions and database support for non-Merklised-persistent data structures #245
Comments
Small question, I wonder if for the blob api we could address a given position, something like: |
This issue has been mentioned on Polkadot Forum. There might be relevant details there: |
As an aside, blake2 has a tree-hashing mode. It seemingly tuns a Amusingly, I'd expect this gives smaller Merkle proofs than using our trie directly, given our radix 16 trie bloats PoVs by 4x in dense trees, but even more size efficient options exist for typical Merkle proofs. |
I remember seeing different hash function using tree structure, but did not do my homework yet, remember also blake3 using it by default. Also think those kind of hash function are good candidates for |
This PR updates litep2p to the latest release. - `KademliaEvent::PutRecordSucess` is renamed to fix word typo - `KademliaEvent::GetProvidersSuccess` and `KademliaEvent::IncomingProvider` are needed for bootnodes on DHT work and will be utilized later ### Added - kad: Providers part 8: unit, e2e, and `libp2p` conformance tests ([#258](paritytech/litep2p#258)) - kad: Providers part 7: better types and public API, public addresses & known providers ([#246](paritytech/litep2p#246)) - kad: Providers part 6: stop providing ([#245](paritytech/litep2p#245)) - kad: Providers part 5: `GET_PROVIDERS` query ([#236](paritytech/litep2p#236)) - kad: Providers part 4: refresh local providers ([#235](paritytech/litep2p#235)) - kad: Providers part 3: publish provider records (start providing) ([#234](paritytech/litep2p#234)) ### Changed - transport_service: Improve connection stability by downgrading connections on substream inactivity ([#260](paritytech/litep2p#260)) - transport: Abort canceled dial attempts for TCP, WebSocket and Quic ([#255](paritytech/litep2p#255)) - kad/executor: Add timeout for writting frames ([#277](paritytech/litep2p#277)) - kad: Avoid cloning the `KademliaMessage` and use reference for `RoutingTable::closest` ([#233](paritytech/litep2p#233)) - peer_state: Robust state machine transitions ([#251](paritytech/litep2p#251)) - address_store: Improve address tracking and add eviction algorithm ([#250](paritytech/litep2p#250)) - kad: Remove unused serde cfg ([#262](paritytech/litep2p#262)) - req-resp: Refactor to move functionality to dedicated methods ([#244](paritytech/litep2p#244)) - transport_service: Improve logs and move code from tokio::select macro ([#254](paritytech/litep2p#254)) ### Fixed - tcp/websocket/quic: Fix cancel memory leak ([#272](paritytech/litep2p#272)) - transport: Fix pending dials memory leak ([#271](paritytech/litep2p#271)) - ping: Fix memory leak of unremoved `pending_opens` ([#274](paritytech/litep2p#274)) - identify: Fix memory leak of unused `pending_opens` ([#273](paritytech/litep2p#273)) - kad: Fix not retrieving local records ([#221](paritytech/litep2p#221)) See release changelog for more details: https://github.com/paritytech/litep2p/releases/tag/v0.8.0 cc @paritytech/networking --------- Signed-off-by: Alexandru Vasile <alexandru.vasile@parity.io> Co-authored-by: Dmitry Markin <dmitry@markin.tech>
This PR updates litep2p to the latest release. - `KademliaEvent::PutRecordSucess` is renamed to fix word typo - `KademliaEvent::GetProvidersSuccess` and `KademliaEvent::IncomingProvider` are needed for bootnodes on DHT work and will be utilized later - kad: Providers part 8: unit, e2e, and `libp2p` conformance tests ([#258](paritytech/litep2p#258)) - kad: Providers part 7: better types and public API, public addresses & known providers ([#246](paritytech/litep2p#246)) - kad: Providers part 6: stop providing ([#245](paritytech/litep2p#245)) - kad: Providers part 5: `GET_PROVIDERS` query ([#236](paritytech/litep2p#236)) - kad: Providers part 4: refresh local providers ([#235](paritytech/litep2p#235)) - kad: Providers part 3: publish provider records (start providing) ([#234](paritytech/litep2p#234)) - transport_service: Improve connection stability by downgrading connections on substream inactivity ([#260](paritytech/litep2p#260)) - transport: Abort canceled dial attempts for TCP, WebSocket and Quic ([#255](paritytech/litep2p#255)) - kad/executor: Add timeout for writting frames ([#277](paritytech/litep2p#277)) - kad: Avoid cloning the `KademliaMessage` and use reference for `RoutingTable::closest` ([#233](paritytech/litep2p#233)) - peer_state: Robust state machine transitions ([#251](paritytech/litep2p#251)) - address_store: Improve address tracking and add eviction algorithm ([#250](paritytech/litep2p#250)) - kad: Remove unused serde cfg ([#262](paritytech/litep2p#262)) - req-resp: Refactor to move functionality to dedicated methods ([#244](paritytech/litep2p#244)) - transport_service: Improve logs and move code from tokio::select macro ([#254](paritytech/litep2p#254)) - tcp/websocket/quic: Fix cancel memory leak ([#272](paritytech/litep2p#272)) - transport: Fix pending dials memory leak ([#271](paritytech/litep2p#271)) - ping: Fix memory leak of unremoved `pending_opens` ([#274](paritytech/litep2p#274)) - identify: Fix memory leak of unused `pending_opens` ([#273](paritytech/litep2p#273)) - kad: Fix not retrieving local records ([#221](paritytech/litep2p#221)) See release changelog for more details: https://github.com/paritytech/litep2p/releases/tag/v0.8.0 cc @paritytech/networking --------- Signed-off-by: Alexandru Vasile <alexandru.vasile@parity.io> Co-authored-by: Dmitry Markin <dmitry@markin.tech> Signed-off-by: Alexandru Vasile <alexandru.vasile@parity.io>
Related: #278
Related: #359
NOTE: This is intended to evolve into a full RFC, but is at present more of a bare collection of notes.
New overlay items:
Vec<u8>
, keyed byname: Name
.BTreeMap<Vec<u8>, Vec<u8>>
, keyed byname: Name
.These obey transactional principles. They may, at the direction of the runtime through a host function, be archived in a regular, non-Merklised database for later off-chain querying. An RPC may be provided in order to query the contents (if stored).
There are two sets of new host functions for creating, manipulating and querying these items; one set for the
Vec<_>
s and one for theBTreeMap<_>
s.The set for the
Vec<_>
s are prefixedblob_
and are:blob_new(name: Name, mode: Mode)
blob_set(name: Name, value: &[u8])
blob_clone(name: Name, target_name: Name)
blob_rename(name: Name, target_name: Name)
blob_edit(name: Name, data: &[u8], offset: u32) -> u32
blob_append(name: Name, suffix: &[u8])
blob_exists(name: Name) -> bool
blob_get(name: Name) -> Option<Vec<u8>>
blob_len(name: Name) -> Option<u32>
blob_hash32(name: Name, algorithm: Hash32Algorithm) -> Option<[u8; 32]>
blob_delete(name: Name)
The set for the
BTreeMap<_>
s are prefixedmap_
and are:map_new(name: Name, mode: Mode)
: Creates a new mapname
with no items. It is cleared if it already exists.map_clone(name: Name, target_name: Name)
: Creates a clone of the mapname
with nametarget_name
.map_rename(name: Name, target_name: Name)
: Alters the name of mapname
totarget_name
.map_insert(name: Name, key: &[u8], value: &[u8])
: Inserts a single (key
,value
) pair into mapname
, creating the map if it did not previously exist and overwriting the item if it did.map_remove_item(name: Name, key: &[u8])
: Removes the pair with the givenkey
from the mapname
, if the map exists and contains the item. Does nothing otherwise.map_exists(name: Name, key: &[u8]) -> bool
: Returnstrue
iff the mapname
is present in execution state.map_contains(name: Name, key: &[u8]) -> Option<bool>
: ReturnsSome
iff the mapname
exists,None
otherwise. The inner value ofSome
istrue
iff the mapname
containskey
.map_item_get(name: Name, key: &[u8]) -> Option<Vec<u8>>
: ReturnsSome
iff the mapname
exists and containskey
. If so, the inner value is that associated withkey
.map_item_len(name: Name, key: &[u8]) -> Option<u32>
: ReturnsSome
iff the mapname
exists and containskey
. If so, the inner value is the length of the value associated withkey
.map_item_hash32(name: Name, key: &[u8], algorithm: Hash32Algorithm) -> Option<[u8; 32]>
: ReturnsSome
iff the mapname
exists and containskey
. If so, the inner value is the value associated withkey
when hashed withalgorithm
.map_count(name: Name) -> Option<u32>
: ReturnsSome
iff the mapname
exists,None
otherwise. IfSome
, then the inner value is the number of items in the mapname
.map_root32(name: Name, structure: Root32Structure) -> Option<[u8; 32]>
: Calculates and returns the root of the data structurestructure
containing the items held in the mapname
. ReturnsNone
if mapname
does not exist.map_dump(name: Name) -> Option<Vec<(Vec<u8>, Vec<u8>)>>
: ReturnsSome
of aVec
of all items in the mapname
, sorted; orNone
if the mapname
does not exist.map_dump_hashed(name: Name, algorithm: Hash32Algorithm) -> Option<Vec<([u8; 32], [u8; 32])>>
: ReturnsSome
of aVec
of all pairs of keys and values in the mapname
hashed withalgorithm
and in order of the (unhashed) key; orNone
if the mapname
does not exist.map_next_key(name: Name, key: &[u8], count: u32) -> Option<Vec<Vec<u8>>>
: Returns up to the nextcount
keys in mapname
immediately followingkey
. If fewer items exist afterkey
thancount
, then only the remaining items are returned. If the mapname
does not exist thenNone
is returned.map_delete(name: Name)
: Delete the mapname
, clearing all data.There exists
Hash32Algorithm
andRoot32Structure
which should ultimately be defined in separate RFCs.Mode
is defined as:Drop
: The data may be discarded at the end of block execution without possibility of later querying, on-chain or off-chain. If creation of an item occurs without an explicit mode being given, then the default modeThrowAway
is assumed.Archive
: The data should be retained in an database associated with the block which is executing. It will not be accessible on-chain in later blocks (unless e.g. oraclised in some way).It is intended that future RFCs introduce additional items to this as needed.
Runtime interfaces may exist allowing the runtime to generate proofs for light-clients.
These items can be used to ween ourselves from (mis-)using the Main State Trie, especially for large items which should never be in the PoV.
This functionality can be used to remove system events and code (
:CODE
) from the main state trie, avoiding possible PoV issues and the problems with storing events as aVec<Event>
(effectively concatenating encoded items). It allows for custom indexing and proof systems, improving efficiency and efficacy.The host function
map_dump_hashed
allows for arbitrary digest ("Merkle hash") calculation within the runtime, allowing e.g. indexing by event topic and for proofs to use a base-2 Merkle structure. Multiple roots could be calculated to optimise for multiple indexing systems.The host function
blob_hash32
allows for comparison and retrieval of large blobs, e.g. of runtime code. The location of the "actual" code could even be moved outside of the main state into one of these items.Storage values and maps may be marked as non-persistent and these host functions may be used, resulting in better performance as well as guaranteed non-persistence. This is useful for block-specific data (like block height), contextual information (e.g. transaction index) as well safe "thread-local" storage.
Future Work
The
Mode
may be extended to include the possibility of persisting the regular (non-Merklised) key/value database between blocks. Additional configuration may allow a map's keys to be of fixed length.The text was updated successfully, but these errors were encountered: