Consider a right decimals approach for encoding decimal numbers #149

robert-zaremba · 2020-11-05T14:15:12Z

Summary

For blockchain use-case we need to do arithmetic operation which is both deterministic and support big numbers. Moreover, since numbers are used very often we should have an efficient (storage and cpu wise) serialization.

Context

In Cosmos-SDK we are using sdk.Int which is a wrapper around Go big.Int. Big integers are pretty efficient for arithmetic operations. Moreover sdk.Int supports efficient serialization using Int.Bytes()

The problem with sdk.Int is that it's not very friendly to encode Decimal numbers. In the real world we like to think with divisible units, eg: 1.2 USD rather than 120 US Cents

Related Work

We started with an experiment in x/ecocredits to use apd. Currently it's used only in that module.

Considerations

With respect to cosmos/cosmos-sdk#7113, let's consider and be consistent how we want to represent decimal values across Regen Ledger and SDK.

Add Bytes serialization for apd
Don't use decimal library and use sdk.Int and establish a standard in the SDK community about Decimal numbers representation.

Goal: we should keep in mind, that we don't want to end up with 15 standards of coin / credit / token amount representation.

References:

Decide about future of sdk.Dec cosmos/cosmos-sdk#7773 : sdk.Dec doesn't work (has bugs).
Custom decimal places for specific calculations cosmos/cosmos-sdk#4399 Custom decimal places for specific calculations

The text was updated successfully, but these errors were encountered:

aaronc · 2020-11-05T14:44:18Z

I would just note that I'm not sure binary representation is a problem. The SDK has decided to use strings instead. We should discuss there. If we do use binary, I suggest decimal128

aaronc · 2020-11-05T14:47:56Z

However decimal128 is always 16 bytes. Not so space efficient. An efficient representation would be varints for mantissa and exponent. But again I'm not really sold on needing binary over strings.

robert-zaremba · 2020-11-05T15:27:54Z

In ecocredtis we are serializing numbers using a decimal string. We can keep adp and use big.Int.Bytes() for serialization in kvstore.

In x/bank we are using a binary serialization:

bz := k.cdc.MustMarshalBinaryBare(&balance)
accountStore.Set([]byte(balance.Denom), bz)

aaronc · 2020-11-05T15:38:08Z

Which is a proto wrapper around a string not binary. Pretty unnecessary imho to wrap it.

There are reasons we use strings in the SDK. Most notably because it's a standardized representation.

aaronc · 2020-11-05T15:39:17Z

How and why would we use bigint serialization for decimals? Do you mean mantissa and exponent. Wouldn't a varint be better than big median anyway?

robert-zaremba · 2020-11-05T16:11:01Z

Mantissa only, because the exponent is int32. We can use varint as well, but I think its an overhead. The serialization is super simple:

<mentisa><exponent>

mentisa is a low endian int32
exponent big.Int.Bytes

Deserializing is simple:

read first 4 bytes and set it to int32
pass the rest (data[4:]) to Int.SetBytes

The Advantage of this is that we can push this feature to the upstream (apd).

aaronc · 2020-11-05T16:24:30Z

Seems like varint will save a few bytes right?

robert-zaremba · 2020-11-05T16:34:24Z

I don't think so. Because we need to encode the mentisa anyway and varint has the bytes length prefix encoded in a sequence of bits (using unary encoding)

robert-zaremba · 2020-11-05T16:38:09Z

If we will like to optimize a storage, we just encode big.Int and read decimals from other place (eg: BatchInfo).
Simpler optimization is to use only one byte for int32 (exponent) - I can't think about any use case where we will need more than 32 decimal places.

robert-zaremba · 2020-11-05T16:40:00Z

and apd is limiting exponent as well: apd.MaxExponent = 100000

aaronc · 2020-11-05T16:42:41Z

Okay well these are details we can optimize later. I first would want to see if we can 1) get buy-in on the SDK side to adopt GDA (cosmos/cosmos-sdk#7773) and 2) whether they're open to a binary format. I personally don't believe we should just invent our own decimal standard which is what was done with sdk.Dec. Do you agree GDA (of some precision) is where we should be starting? Wonder what your thoughts are on Dev's comments in that thread.

aaronc · 2020-11-05T16:43:37Z

IMHO even int16 should be plenty for exponent no?

robert-zaremba · 2020-11-06T21:21:09Z

To be honest, I believe that 16 decimal places (int8) is more than enough. This is only for storing. For intermediate operations we could have more precision.

robert-zaremba · 2020-11-06T21:21:37Z

unless we want to support negative numbers, then 8 decimals might be not enough.

ruhatch · 2021-09-01T12:40:37Z

@clevinson can we close this as I think it's meaningfully tracked elsewhere?

robert-zaremba mentioned this issue Nov 5, 2020

Eco-Credit Module Proof of Concept #119

Merged

4 tasks

ryanchristo closed this as completed Mar 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider a right decimals approach for encoding decimal numbers #149

Consider a right decimals approach for encoding decimal numbers #149

robert-zaremba commented Nov 5, 2020 •

edited

Loading

aaronc commented Nov 5, 2020

aaronc commented Nov 5, 2020

robert-zaremba commented Nov 5, 2020 •

edited

Loading

aaronc commented Nov 5, 2020

aaronc commented Nov 5, 2020

robert-zaremba commented Nov 5, 2020 •

edited

Loading

aaronc commented Nov 5, 2020 •

edited

Loading

robert-zaremba commented Nov 5, 2020 •

edited

Loading

robert-zaremba commented Nov 5, 2020

robert-zaremba commented Nov 5, 2020

aaronc commented Nov 5, 2020

aaronc commented Nov 5, 2020

robert-zaremba commented Nov 6, 2020

robert-zaremba commented Nov 6, 2020

ruhatch commented Sep 1, 2021

Consider a right decimals approach for encoding decimal numbers #149

Consider a right decimals approach for encoding decimal numbers #149

Comments

robert-zaremba commented Nov 5, 2020 • edited Loading

Summary

Context

Related Work

Considerations

References:

aaronc commented Nov 5, 2020

aaronc commented Nov 5, 2020

robert-zaremba commented Nov 5, 2020 • edited Loading

aaronc commented Nov 5, 2020

aaronc commented Nov 5, 2020

robert-zaremba commented Nov 5, 2020 • edited Loading

aaronc commented Nov 5, 2020 • edited Loading

robert-zaremba commented Nov 5, 2020 • edited Loading

robert-zaremba commented Nov 5, 2020

robert-zaremba commented Nov 5, 2020

aaronc commented Nov 5, 2020

aaronc commented Nov 5, 2020

robert-zaremba commented Nov 6, 2020

robert-zaremba commented Nov 6, 2020

ruhatch commented Sep 1, 2021

robert-zaremba commented Nov 5, 2020 •

edited

Loading

robert-zaremba commented Nov 5, 2020 •

edited

Loading

robert-zaremba commented Nov 5, 2020 •

edited

Loading

aaronc commented Nov 5, 2020 •

edited

Loading

robert-zaremba commented Nov 5, 2020 •

edited

Loading