Issue72customdtoa #97

miloyip · 2014-08-09T14:09:14Z

Custom dtoa()

After two weeks of research and implementation, a custom dtoa() was implemented to replace sprintf() in Writer::Double(). A performance benchmark can be checked in another repository dtoa-benchmark.

Simply put, the custom dtoa() is an optimized header-only C++ implementation of Grisu2 algorithm (~400 lines of code). It always generate correct-rounding output. And it can generate shortest representation for >99.9% of input. sprintf() cannot output shortest representation.

Also, it fixes the local issue in #72, always generate valid JSON number text.

Removal of precision settings

Originally, Writer::SetPrecision() et al. APIs are added because sprintf("%g) has default precision of 6 decimal digits, that may lose precision in source values (thank to @pah 's effort in #19). The new dtoa() implementation makes this not necessary as it always generate output that can be convertible back to the source values, and it will try to make the output as short as possible.

So I think those APIs can be removed. A drawback is that user cannot reduce the precision in output as before.

Better number parsing

During the implementation, it is found that Reader::ParseNumber() cannot parse double correctly. After some research, the current "naive" implementation can be slightly modified to implement the fast path conversion. It may increase a little bit overhead by using division in half of time, but at the same time it cut down the lookup table in internal::Pow10() by half. By this modification, normal ranges of numbers can be converted roundtrip exactly.

Performance

During the benchmark, it is found that gcc (glib)'s sprintf(..., "%.17g") is very very slow, compared to VS2013 (Check this and this). I have not investigated the reasons behind at the moment.

Anyway, new dtoa() implementation is much faster.

Before:

VC2013 x64 
[       OK ] RapidJson.ReaderParseIterativeInsitu_DummyHandler_SSE42 (663 ms)
[       OK ] RapidJson.Writer_NullStream (325 ms)
[       OK ] RapidJson.Writer_StringBuffer (1020 ms)
[       OK ] RapidJson.PrettyWriter_StringBuffer (1420 ms)

Cygwin GCC x64
[       OK ] RapidJson.ReaderParseIterativeInsitu_DummyHandler_SSE42 (875 ms)
[       OK ] RapidJson.Writer_NullStream (5201 ms)
[       OK ] RapidJson.Writer_StringBuffer (5673 ms)
[       OK ] RapidJson.PrettyWriter_StringBuffer (6169 ms)

After:

VC2013 x64 After
[       OK ] RapidJson.ReaderParseIterativeInsitu_DummyHandler_SSE42 (661 ms)
[       OK ] RapidJson.Writer_NullStream (257 ms)
[       OK ] RapidJson.Writer_StringBuffer (697 ms)
[       OK ] RapidJson.PrettyWriter_StringBuffer (1037 ms)

Cygwin GCC x64 After
[       OK ] RapidJson.ReaderParseIterativeInsitu_DummyHandler_SSE42 (656 ms)
[       OK ] RapidJson.Writer_NullStream (250 ms)
[       OK ] RapidJson.Writer_StringBuffer (686 ms)
[       OK ] RapidJson.PrettyWriter_StringBuffer (1144 ms)

The JSON in these tests contains some JSON numbers, not extreme cases that most values are floating-point numbers. So these results only show part of improvement. Even this is not an extreme case, in NullStream tests VC2013 shows ~1.25x speedup, while gcc shows ~20x speedup.

Accurate rounding in normal numerical ranges, also reduce lookup table size.

Modified from Milo's Grisu2 implementation. 99.9% cases return shortest decimal format.

pah · 2014-08-10T15:46:39Z

Nice work! And I agree on the removal of the precision APIs. 👍

pah · 2014-08-10T15:50:15Z

include/rapidjson/internal/dtoa.h

+	uint32_t t = (i + 1) * 1233 >> 12;
+#elif __GNUC__
+	uint32_t t = (32 - __builtin_clz(n | 1)) * 1233 >> 12;
+#endif


Shouldn't we have an #else case here as well? At least with an #error?

Good catch.
I will add a standard C++ implementation.

It is simple and pure C++. And it is found in performance test that it is even faster than the original version, due to distribution of n. But the performance gain is not obvious in RapidJSON.

Fix #72

This drops #3 and #4, as their functionality has been superseded upstream, see Tencent/rapidjson#97 and Tencent/rapidjson#101. Conflicts: include/rapidjson/prettywriter.h include/rapidjson/reader.h include/rapidjson/writer.h

pah · 2014-09-23T07:54:49Z

See in the description above:

The new dtoa() implementation makes this not necessary as it always generate output that can be convertible back to the source values, and it will try to make the output as short as possible.

Do you really need explicitly lossy output?
Maybe you can round the values accordingly before writing them?

gidantribal · 2014-09-25T13:37:17Z

Yes, indeed I understood the reason behind the functionality on your pull request has been obsoleted. Actually the precision of our outputted doubles depends on the currency, thus should be possible to control it. I will push to remove this formatting limitation from the REST APIs, otherwise I'll use a custom Writer or a pre-rounding of the values for presenting them "nicely", as you suggested. Thanks a lot!

pah · 2014-09-25T17:45:05Z

It might be fairly simple to limit the number of "fraction digits" printed in Prettify (defined in include/rapidjson/internal/dtoa.h:352).

Forcing trailing zeroes up to the requested number of fractional digits could be done as well.

Limiting the total number of printed significant digits would be less useful, I guess.
Thoughts?

miloyip added 4 commits August 9, 2014 21:11

Change double parsing with fast-path conversion

6978778

Accurate rounding in normal numerical ranges, also reduce lookup table size.

Custom dtoa() impleemntation

a7762a3

Modified from Milo's Grisu2 implementation. 99.9% cases return shortest decimal format.

Fixed gcc effc++ warning in dtoa.h

0d91564

Remove double precision settings API in Writer

1900b7b

pah mentioned this pull request Aug 10, 2014

Please add license headers to source files #98

Closed

pah reviewed Aug 10, 2014
View reviewed changes

Change CountDecimalDigit32() to simple implementation

c549152

It is simple and pure C++. And it is found in performance test that it is even faster than the original version, due to distribution of n. But the performance gain is not obvious in RapidJSON.

miloyip added a commit that referenced this pull request Aug 11, 2014

Merge pull request #97 from miloyip/issue72customdtoa

adb3974

Fix #72

miloyip merged commit adb3974 into master Aug 11, 2014

pah mentioned this pull request Oct 7, 2014

User-defined double output precision #19

Merged

miloyip deleted the issue72customdtoa branch December 8, 2014 02:35

guzzard mentioned this pull request Jun 16, 2016

[LINUX] Editor crash when creating a new 3D project, or when trying to open some example projects AtomicGameEngine/AtomicGameEngine#710

Closed

sergiohs84 mentioned this pull request Jun 1, 2020

#14744. Improve webrtc stats meganz/MEGAchat#880

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue72customdtoa #97

Issue72customdtoa #97

miloyip commented Aug 9, 2014

pah commented Aug 10, 2014

pah Aug 10, 2014

miloyip Aug 10, 2014

pah commented Sep 23, 2014

gidantribal commented Sep 25, 2014

pah commented Sep 25, 2014

Issue72customdtoa #97

Issue72customdtoa #97

Conversation

miloyip commented Aug 9, 2014

Custom dtoa()

Removal of precision settings

Better number parsing

Performance

pah commented Aug 10, 2014

pah Aug 10, 2014

Choose a reason for hiding this comment

miloyip Aug 10, 2014

Choose a reason for hiding this comment

pah commented Sep 23, 2014

gidantribal commented Sep 25, 2014

pah commented Sep 25, 2014