Replace RPC request union with bytes #1119

wemeetagain · 2019-05-24T15:07:52Z

Using a union creates an unnecessary duplication with the method id (with
the union index of the serialized union as a stand-in for the request
method_id) and ultimately must be rechecked against the method_id to
ensure that the proper type has been/will be deserialized.

Instead, the union has been changed to a byte list. This allows for
simple programmatic switching on method_id to determine the deserialized
type as the sole determination, rather than having multiple coupled
switches, eg the method_id AND the union prefix. This also aligns the
Request type with the Response type, as both now have a "body" that is
a byte list and can be decoded with the method_id (and response_code).

Because request method_ids must be checked against body deserialized forms to ensure the proper deserialized type, it makes sense to perform the body deserialization only after we know which type is needed. Otherwise we require extra validation after-the-fact to determine that the decoded type is indeed what we want. Using a union creates an unnecessary duplication of the method id (with the union index of the serialized union as a stand-in for the request method_id) and ultimately must be checked against the method_id to ensure that the proper type has been deserialized. Instead, the union has been changed to a byte list. This allows for simple programmatic switching on method_id to determine the deserialized type as the sole determination, rather than having multiple coupled switches, eg the method_id AND the union prefix. This also aligns the Request type with the Response type, as both now have a "body" that can be decoded with the method_id (and response_code).

dankrad · 2019-05-25T08:03:43Z

The duplication does not seem a very strong argument for changing the structured union type to just unstructured "bytes" with a potentially new serialisation format (we're talking about 4 extra bytes here). Alternatively, can the request type be uniquely inferred from the message body type?

wemeetagain · 2019-05-25T16:25:53Z

Its less a matter of duplication, and more a matter of the ultimate primacy of the method id in the current scheme, and where the mapping of request types to request type bodies should be stored. Do we want it stored programmatically, where method ids are mapped to types? or do we want it stored in the union type?
I tend to think method ids are more friendly to experimentation, as they aren't necessarily tied to the order of an array that must have consensus. Eg: a client may add a nonstandard method id and it won't break if a new consensus method id is added. If the methods are implicit to the union type, any new addition of a consensus method id will break nonstandard methods (by changing the index of the nonstandard response body).

I don't think I'm proposing any new serialization format, we would still technically be using SSZ everywhere.

Interesting alternative, its similar in spirit. If the method id is removed, the request type can still be known by the decoded request body type. Or in another interpretation, the union prefix bytes become the new method id. See above why I think just using method ids is a better idea.
But if we go that direction, we might look at replacing the response result with a union as well, in the vein of unstructured bytes == bad, single-pass ssz == good.

protolambda · 2019-05-31T22:38:52Z

Not decoding the body immediately with standard SSZ can be a nice side-effect. E.g. you may not support the data types used in a specific method (i.e. you don't know how to decode), or you may want to proxy the method to some service that handles the specific method id.

wemeetagain · 2019-06-13T21:34:17Z

E.g. you may not support the data types used in a specific method (i.e. you don't know how to decode)

Exactly. If we want to let the rpc stream be extensible by client implementations, to allow for client-specific methods with client-specific request bodies, we should not also be relying on the union.
Doing so makes it difficult for other clients to differentiate incompatible methods from invalid request bodies, because unknown request bodies will trigger ssz parse errors before the parsed method id can be checked against known methods.

hwwhww · 2019-08-09T07:00:05Z

Hey @wemeetagain, it seems outdated after #1328, can we close it?

djrtwo · 2019-08-12T17:22:27Z

Closing this. Please bring up in an issue for discussion if still relevant

JustinDrake added the scope:networking label Jun 15, 2019

djrtwo closed this Aug 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace RPC request union with bytes #1119

Replace RPC request union with bytes #1119

wemeetagain commented May 24, 2019

dankrad commented May 25, 2019

wemeetagain commented May 25, 2019

protolambda commented May 31, 2019

wemeetagain commented Jun 13, 2019

hwwhww commented Aug 9, 2019

djrtwo commented Aug 12, 2019

Replace RPC request union with bytes #1119

Replace RPC request union with bytes #1119

Conversation

wemeetagain commented May 24, 2019

dankrad commented May 25, 2019

wemeetagain commented May 25, 2019

protolambda commented May 31, 2019

wemeetagain commented Jun 13, 2019

hwwhww commented Aug 9, 2019

djrtwo commented Aug 12, 2019