Add support for decimal batch reader #22636

wypb · 2024-04-30T04:02:52Z

Description

Add support for decimal batch reader

Benchmark(The lower the better)

Benchmark                                                          (byteArrayLength)  (decimalPrimitiveTypeName)  (enableOptimizedReader)  (nullable)  (writerVersion)  Mode  Cnt    Score    Error  Units
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A                      BINARY                     true        true      PARQUET_1_0  avgt   60   35.978 ±  0.213  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A                      BINARY                     true        true      PARQUET_2_0  avgt   60   56.711 ±  0.247  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A                      BINARY                     true       false      PARQUET_1_0  avgt   60   63.970 ±  0.239  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A                      BINARY                     true       false      PARQUET_2_0  avgt   60  104.495 ±  1.744  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A                      BINARY                    false        true      PARQUET_1_0  avgt   60   72.606 ±  0.766  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A                      BINARY                    false        true      PARQUET_2_0  avgt   60   64.700 ±  1.676  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A                      BINARY                    false       false      PARQUET_1_0  avgt   60  119.463 ±  2.066  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A                      BINARY                    false       false      PARQUET_2_0  avgt   60   97.458 ±  5.932  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A    FIXED_LEN_BYTE_ARRAY(16)                     true        true      PARQUET_1_0  avgt   60   19.863 ±  0.125  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A    FIXED_LEN_BYTE_ARRAY(16)                     true        true      PARQUET_2_0  avgt   60   57.809 ±  0.486  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A    FIXED_LEN_BYTE_ARRAY(16)                     true       false      PARQUET_1_0  avgt   60   29.744 ±  0.314  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A    FIXED_LEN_BYTE_ARRAY(16)                     true       false      PARQUET_2_0  avgt   60  100.839 ±  0.838  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A    FIXED_LEN_BYTE_ARRAY(16)                    false        true      PARQUET_1_0  avgt   60   69.015 ±  0.418  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A    FIXED_LEN_BYTE_ARRAY(16)                    false        true      PARQUET_2_0  avgt   60   67.082 ±  2.169  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A    FIXED_LEN_BYTE_ARRAY(16)                    false       false      PARQUET_1_0  avgt   60  113.877 ±  3.237  ns/op
BenchmarkDecimalColumnBatchReader.readLongDecimal                                N/A    FIXED_LEN_BYTE_ARRAY(16)                    false       false      PARQUET_2_0  avgt   60  101.167 ±  4.742  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT32                     true        true      PARQUET_1_0  avgt   60    5.892 ±  0.081  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT32                     true        true      PARQUET_2_0  avgt   60    7.909 ±  0.077  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT32                     true       false      PARQUET_1_0  avgt   60    2.198 ±  0.016  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT32                     true       false      PARQUET_2_0  avgt   60    6.230 ±  0.190  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT32                    false        true      PARQUET_1_0  avgt   60   25.590 ±  2.622  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT32                    false        true      PARQUET_2_0  avgt   60   20.364 ±  0.143  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT32                    false       false      PARQUET_1_0  avgt   60   16.077 ±  0.222  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT32                    false       false      PARQUET_2_0  avgt   60   20.276 ±  3.783  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT64                     true        true      PARQUET_1_0  avgt   60   13.379 ±  4.040  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT64                     true        true      PARQUET_2_0  avgt   60    9.310 ±  0.036  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT64                     true       false      PARQUET_1_0  avgt   60    3.766 ±  0.035  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT64                     true       false      PARQUET_2_0  avgt   60    8.813 ±  0.083  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT64                    false        true      PARQUET_1_0  avgt   60   25.213 ±  0.222  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT64                    false        true      PARQUET_2_0  avgt   60   21.266 ±  0.068  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT64                    false       false      PARQUET_1_0  avgt   60   22.471 ±  0.617  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                       INT64                    false       false      PARQUET_2_0  avgt   60   18.181 ±  0.937  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                      BINARY                     true        true      PARQUET_1_0  avgt   60   13.513 ±  1.030  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                      BINARY                     true        true      PARQUET_2_0  avgt   60   31.413 ±  7.525  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                      BINARY                     true       false      PARQUET_1_0  avgt   60   16.384 ±  2.283  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                      BINARY                     true       false      PARQUET_2_0  avgt   60   37.136 ±  0.573  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                      BINARY                    false        true      PARQUET_1_0  avgt   60   38.115 ±  2.961  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                      BINARY                    false        true      PARQUET_2_0  avgt   60   33.686 ±  0.890  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                      BINARY                    false       false      PARQUET_1_0  avgt   60   38.408 ±  0.306  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A                      BINARY                    false       false      PARQUET_2_0  avgt   60   39.839 ±  0.300  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A     FIXED_LEN_BYTE_ARRAY(8)                     true        true      PARQUET_1_0  avgt   60    6.254 ±  0.055  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A     FIXED_LEN_BYTE_ARRAY(8)                     true        true      PARQUET_2_0  avgt   60   18.820 ±  0.159  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A     FIXED_LEN_BYTE_ARRAY(8)                     true       false      PARQUET_1_0  avgt   60    3.435 ±  0.110  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A     FIXED_LEN_BYTE_ARRAY(8)                     true       false      PARQUET_2_0  avgt   60   29.901 ±  0.606  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A     FIXED_LEN_BYTE_ARRAY(8)                    false        true      PARQUET_1_0  avgt   60   31.011 ±  0.310  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A     FIXED_LEN_BYTE_ARRAY(8)                    false        true      PARQUET_2_0  avgt   60   35.836 ±  3.396  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A     FIXED_LEN_BYTE_ARRAY(8)                    false       false      PARQUET_1_0  avgt   60   36.312 ±  0.709  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimal                               N/A     FIXED_LEN_BYTE_ARRAY(8)                    false       false      PARQUET_2_0  avgt   60   61.468 ± 15.297  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  1                         N/A                     true        true      PARQUET_1_0  avgt   60    0.784 ±  0.046  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  1                         N/A                     true        true      PARQUET_2_0  avgt   60    6.970 ±  0.188  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  1                         N/A                     true       false      PARQUET_1_0  avgt   60    0.842 ±  0.048  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  1                         N/A                     true       false      PARQUET_2_0  avgt   60   12.153 ±  5.229  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  1                         N/A                    false        true      PARQUET_1_0  avgt   60   39.380 ±  4.667  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  1                         N/A                    false        true      PARQUET_2_0  avgt   60   21.779 ±  1.257  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  1                         N/A                    false       false      PARQUET_1_0  avgt   60   33.686 ±  0.484  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  1                         N/A                    false       false      PARQUET_2_0  avgt   60   18.937 ±  0.199  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  2                         N/A                     true        true      PARQUET_1_0  avgt   60    1.316 ±  0.030  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  2                         N/A                     true        true      PARQUET_2_0  avgt   60   27.600 ±  0.792  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  2                         N/A                     true       false      PARQUET_1_0  avgt   60    1.307 ±  0.032  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  2                         N/A                     true       false      PARQUET_2_0  avgt   60   27.122 ±  0.425  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  2                         N/A                    false        true      PARQUET_1_0  avgt   60   33.514 ±  0.238  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  2                         N/A                    false        true      PARQUET_2_0  avgt   60   43.301 ±  1.226  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  2                         N/A                    false       false      PARQUET_1_0  avgt   60   33.281 ±  0.459  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  2                         N/A                    false       false      PARQUET_2_0  avgt   60   39.255 ±  0.485  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  3                         N/A                     true        true      PARQUET_1_0  avgt   60    1.710 ±  0.022  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  3                         N/A                     true        true      PARQUET_2_0  avgt   60   28.296 ±  0.558  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  3                         N/A                     true       false      PARQUET_1_0  avgt   60    1.813 ±  0.094  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  3                         N/A                     true       false      PARQUET_2_0  avgt   60   27.826 ±  0.314  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  3                         N/A                    false        true      PARQUET_1_0  avgt   60   36.132 ±  1.231  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  3                         N/A                    false        true      PARQUET_2_0  avgt   60   41.832 ±  1.558  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  3                         N/A                    false       false      PARQUET_1_0  avgt   60   34.004 ±  0.396  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  3                         N/A                    false       false      PARQUET_2_0  avgt   60   38.750 ±  0.302  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  4                         N/A                     true        true      PARQUET_1_0  avgt   60    2.203 ±  0.043  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  4                         N/A                     true        true      PARQUET_2_0  avgt   60   28.282 ±  0.533  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  4                         N/A                     true       false      PARQUET_1_0  avgt   60    2.149 ±  0.029  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  4                         N/A                     true       false      PARQUET_2_0  avgt   60   28.413 ±  0.598  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  4                         N/A                    false        true      PARQUET_1_0  avgt   60   34.270 ±  0.404  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  4                         N/A                    false        true      PARQUET_2_0  avgt   60   39.215 ±  0.549  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  4                         N/A                    false       false      PARQUET_1_0  avgt   60   36.183 ±  0.952  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  4                         N/A                    false       false      PARQUET_2_0  avgt   60   40.536 ±  0.995  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  5                         N/A                     true        true      PARQUET_1_0  avgt   60    2.498 ±  0.101  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  5                         N/A                     true        true      PARQUET_2_0  avgt   60   28.847 ±  0.456  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  5                         N/A                     true       false      PARQUET_1_0  avgt   60    2.499 ±  0.083  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  5                         N/A                     true       false      PARQUET_2_0  avgt   60   29.880 ±  0.273  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  5                         N/A                    false        true      PARQUET_1_0  avgt   60   34.867 ±  0.772  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  5                         N/A                    false        true      PARQUET_2_0  avgt   60   41.475 ±  1.487  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  5                         N/A                    false       false      PARQUET_1_0  avgt   60   38.280 ±  4.063  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  5                         N/A                    false       false      PARQUET_2_0  avgt   60   39.719 ±  0.523  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  6                         N/A                     true        true      PARQUET_1_0  avgt   60    2.642 ±  0.061  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  6                         N/A                     true        true      PARQUET_2_0  avgt   60   29.860 ±  0.367  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  6                         N/A                     true       false      PARQUET_1_0  avgt   60    2.663 ±  0.042  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  6                         N/A                     true       false      PARQUET_2_0  avgt   60   29.849 ±  0.626  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  6                         N/A                    false        true      PARQUET_1_0  avgt   60   35.169 ±  0.158  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  6                         N/A                    false        true      PARQUET_2_0  avgt   60   40.821 ±  0.510  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  6                         N/A                    false       false      PARQUET_1_0  avgt   60   38.378 ±  1.814  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  6                         N/A                    false       false      PARQUET_2_0  avgt   60   44.101 ±  4.470  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  7                         N/A                     true        true      PARQUET_1_0  avgt   60    2.788 ±  0.036  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  7                         N/A                     true        true      PARQUET_2_0  avgt   60   29.854 ±  0.341  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  7                         N/A                     true       false      PARQUET_1_0  avgt   60    2.868 ±  0.052  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  7                         N/A                     true       false      PARQUET_2_0  avgt   60   30.821 ±  0.544  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  7                         N/A                    false        true      PARQUET_1_0  avgt   60   38.564 ±  1.608  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  7                         N/A                    false        true      PARQUET_2_0  avgt   60   41.938 ±  0.737  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  7                         N/A                    false       false      PARQUET_1_0  avgt   60   37.818 ±  2.376  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  7                         N/A                    false       false      PARQUET_2_0  avgt   60   86.133 ± 29.822  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  8                         N/A                     true        true      PARQUET_1_0  avgt   60    4.066 ±  0.423  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  8                         N/A                     true        true      PARQUET_2_0  avgt   60   34.223 ±  3.815  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  8                         N/A                     true       false      PARQUET_1_0  avgt   60    4.593 ±  0.343  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  8                         N/A                     true       false      PARQUET_2_0  avgt   60   30.767 ±  1.049  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  8                         N/A                    false        true      PARQUET_1_0  avgt   60   50.825 ±  7.990  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  8                         N/A                    false        true      PARQUET_2_0  avgt   60   55.074 ±  6.835  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  8                         N/A                    false       false      PARQUET_1_0  avgt   60   43.135 ±  9.506  ns/op
BenchmarkDecimalColumnBatchReader.readShortDecimalByteArrayLength                  8                         N/A                    false       false      PARQUET_2_0  avgt   60   48.081 ±  5.818  ns/op

Impact

When we enable Parquet batch reader(parquet_batch_read_optimization_enabled=true), the Decimal type will read data in Batch mode.

Test Plan

Contributor checklist

Please make sure your submission complies with our development, formatting, commit message, and attribution guidelines.
PR description addresses the issue accurately and concisely. If the change is non-trivial, a GitHub Issue is referenced.
Documented new properties (with its default value), SQL syntax, functions, or other functionality.
If release notes are required, they follow the release notes guidelines.
Adequate tests were added if applicable.
CI passed.

Release Notes

== RELEASE NOTES ==

Hive Connector Changes
* Add support for decimal batch reader :pr:`22636`

CC: @zhenxiao

linux-foundation-easycla · 2024-04-30T04:02:55Z

The committers listed above are authorized under a signed CLA.

✅ login: wypb / name: wypb (cabc10c)

zhenxiao

nice work, @wypb
looks nice. a few comments about comments, and code structure

zhenxiao · 2024-04-30T19:58:51Z

presto-parquet/src/main/java/com/facebook/presto/parquet/ParquetTypeUtils.java

+        switch (length) {
+            case 8:
+                value |= bytes[startOffset + 7] & 0xFFL;
+                // fall through


this line comment seems not useful
same for following lines

zhenxiao · 2024-04-30T20:00:53Z

presto-parquet/src/main/java/com/facebook/presto/parquet/ParquetTypeUtils.java

+        }
+
+        DecimalLogicalTypeAnnotation decimalLogicalTypeAnnotation = (DecimalLogicalTypeAnnotation) logicalTypeAnnotation;
+        return decimalLogicalTypeAnnotation.getPrecision() <= Decimals.MAX_SHORT_PRECISION;


static import Decimals.MAX_SHORT_PRECISION

zhenxiao · 2024-04-30T20:11:34Z

...acebook/presto/parquet/batchreader/decoders/ShortDecimalFixedWidthByteArrayBatchDecoder.java

+public class ShortDecimalFixedWidthByteArrayBatchDecoder
+{
+    private static final ShortDecimalDecoder[] VALUE_DECODERS = new ShortDecimalDecoder[] {
+            new BigEndianReader1(),


why do we need 7 readers? add a comment?

This actually further optimizes the reading speed of short decimals. The implementation of ShortDecimalFixedWidthByteArrayBatchDecoder actually refers to the implementation of Trino: trinodb/trino@f71a815

zhenxiao · 2024-04-30T20:16:11Z

...facebook/presto/parquet/batchreader/decoders/delta/BinaryShortDecimalDeltaValuesDecoder.java

+import static com.facebook.presto.parquet.ParquetTypeUtils.getShortDecimalValue;
+import static com.google.common.base.Preconditions.checkArgument;
+
+public class BinaryShortDecimalDeltaValuesDecoder


shall we have an abstract class, where BinaryLongDecimalDeltaValuesDecoder and BinaryShortDecimalDeltaValuesDecoder are its subclasses?
The code of BinaryLongDecimalDeltaValuesDecoder and BinaryShortDecimalDeltaValuesDecoder has lots of same code

I'll see how to refactor these two classes because they implement different interfaces.

zhenxiao · 2024-04-30T20:17:20Z

.../facebook/presto/parquet/batchreader/decoders/delta/Int64ShortDecimalDeltaValuesDecoder.java

+import static com.google.common.base.Preconditions.checkArgument;
+import static java.util.Objects.requireNonNull;
+
+public class Int64ShortDecimalDeltaValuesDecoder


shall we have an abstract class, and Int32ShortDecimalDeltaValuesDecoder, Int64ShortDecimalDeltaValuesDecoder becomes its sub-classes? The classes share many same code

The common parts of Int32ShortDecimalDeltaValuesDecoder and Int64ShortDecimalDeltaValuesDecoder have been extracted into AbstractInt64AndInt32ShortDecimalDeltaValuesDecoder.

zhenxiao · 2024-04-30T20:18:12Z

...facebook/presto/parquet/batchreader/decoders/delta/LongDecimalApacheParquetValueDecoder.java

+import static io.airlift.slice.SizeOf.SIZE_OF_LONG;
+import static java.util.Objects.requireNonNull;
+
+public class LongDecimalApacheParquetValueDecoder


bad naming. LongDecimalApacheParquetValueDecoder why Apache and Parquet appears in the name?

I have renamed LongDecimalApacheParquetValueDecoder and ShortDecimalApacheParquetValueDecoder to FixedLenByteArrayShortDecimalDeltaValueDecoder and FixedLenByteArrayLongDecimalDeltaValueDecoder respectively.

zhenxiao · 2024-04-30T20:18:20Z

...acebook/presto/parquet/batchreader/decoders/delta/ShortDecimalApacheParquetValueDecoder.java

+import static com.google.common.base.Preconditions.checkArgument;
+import static java.util.Objects.requireNonNull;
+
+public class ShortDecimalApacheParquetValueDecoder


yingsu00

@wypb Thank you so much for this great work. I wonder if you could add tests?

presto-common/src/main/java/com/facebook/presto/common/type/UnscaledDecimal128Arithmetic.java

yingsu00 · 2024-05-05T00:47:55Z

presto-parquet/src/main/java/com/facebook/presto/parquet/ParquetTypeUtils.java

+        return decimalLogicalTypeAnnotation.getPrecision() <= Decimals.MAX_SHORT_PRECISION;
+    }
+
+    public static boolean isLongDecimalType(ColumnDescriptor descriptor)


This function is not used

Already removed.

presto-parquet/src/test/java/com/facebook/presto/parquet/BenchmarkParquetReader.java

presto-parquet/src/main/java/com/facebook/presto/parquet/batchreader/decoders/Decoders.java

presto-parquet/src/main/java/com/facebook/presto/parquet/ColumnReaderFactory.java

yingsu00 · 2024-05-05T04:52:49Z

...a/com/facebook/presto/parquet/batchreader/decoders/plain/ShortDecimalPlainValuesDecoder.java

+        int inputBytesOffset = input.getByteArrayOffset();
+        for (int i = offset; i < offset + length; i++) {
+            checkBytesFitInShortDecimal(inputBytes, inputBytesOffset, extraBytesLength, columnDescriptor);
+            values[i] = getShortDecimalValue(inputBytes, inputBytesOffset + extraBytesLength, Long.BYTES);


What if extraBytesLength < 0?

According to the implementation of the code, extraBytesLength will not be less than 0, but must be greater than 0.

if (typeLength <= Long.BYTES) { .... } int extraBytesLength = typeLength - Long.BYTES;

yingsu00 · 2024-05-05T04:59:58Z

...a/com/facebook/presto/parquet/batchreader/decoders/plain/ShortDecimalPlainValuesDecoder.java

+        // Equivalent to expectedValue = bytes[endOffset] < 0 ? -1 : 0
+        byte expectedValue = (byte) (bytes[endOffset] >> 7);
+        for (int i = offset; i < endOffset; i++) {
+            if (bytes[i] != expectedValue) {


Could you explain how this works? Suppose inputBytesOffset = 0 and typeLength=9 here, then your extraBytesLength = 1, and checkBytesFitInShortDecimal(inputBytes, 0, 1, descriptor) is called. And your expectedValue is to check if inputBytes[1] < 0 ? -1 : 0. Since the values are encoded big-endian byte order, I assume you wanted to check the the most significant byte which should be inputBytes[0], but you're checking the second most significant byte inputBytes[1]. Does that work?

Also The largest precision for short decimal is 18 and the value is 999,999,999,999,999,999. It can be expressed with 60 bits value 0xDE0B6B3A763FFFF. If you really need to verify there is no overflow, the bits 61-64 also need to be checked. I don't see it's done here. Could you explain a little bit your idea?

The bytes[endOffset] of the above code is actually the most significant byte. To illustrate this better, I built a test locally. The original data is 123456789012.12345678, typeLength=14. the contents of bytes are [2, 0, 0, 0, 3, 1, 0, 0, 0, 0, 0, 0, -85, 84, -87, -116, -23, -53, -11, 78]

NOTE: -85, 84, -87, -116, -23, -53, -11, 78 converted to binary is actually the high 56-bit data of 12345678901212345678 converted to binary.

-85, 84, -87, -116, -23, -53, -11, 78 binary representation:
10101011010101001010100110001100111010011100101111110101
12345678901212345678 binary representation:
1010101101010100101010011000110011101001110010111111010101001110

In this scenario, inputBytesOffset = 6, extraBytesLength = 6, so endOffset = 12, bytes[endOffset] = -85, which is actually the most significant byte. Then we check whether bytes[6.. 11] is -1. Attached is the file I tested.
86216465-ad5d-4acc-bc8b-0d1972149d8c.tgz

github-actions · 2024-05-09T11:00:58Z

Codenotify: Notifying subscribers in CODENOTIFY files for diff 9898886...cabc10c.

No notifications.

elharo

This PR seems short on new tests. I would have expected quite a number given all the new code.

presto-parquet/src/main/java/com/facebook/presto/parquet/ParquetTypeUtils.java

elharo · 2024-05-09T11:04:51Z

presto-parquet/src/main/java/com/facebook/presto/parquet/ParquetTypeUtils.java


-        for (int i = 0; i < bytes.length; i++) {
-            value |= ((long) bytes[bytes.length - i - 1] & 0xFFL) << (8 * i);
+    public static long getShortDecimalValue(byte[] bytes, int startOffset, int length)


can this be private or not public?

No, this method is used in many places.

elharo · 2024-05-09T11:06:39Z

presto-parquet/src/test/java/com/facebook/presto/parquet/reader/TestData.java

+        return data;
+    }
+
+    private static int propagateSignBit(int value, int bitsToPad)


why here and in ByteUtils?

I looked at the code and found that this function can actually be deleted.

elharo · 2024-05-09T11:07:34Z

presto-parquet/src/main/java/com/facebook/presto/parquet/batchreader/BytesUtils.java

@@ -61,4 +61,9 @@ public static void unpack8Values(byte inByte, byte[] out, int outPos)
        out[6 + outPos] = (byte) (inByte >> 6 & 1);
        out[7 + outPos] = (byte) (inByte >> 7 & 1);
    }
+
+    public static long propagateSignBit(long value, int bitsToPad)


This method could use a unit test that addresses it directly

elharo · 2024-05-09T11:08:20Z

...to-parquet/src/main/java/com/facebook/presto/parquet/batchreader/SimpleSliceInputStream.java

+    public SimpleSliceInputStream(Slice slice, int offset)
+    {
+        this.slice = requireNonNull(slice, "slice is null");
+        checkArgument(slice.length() == 0 || slice.hasByteArray(), "SimpleSliceInputStream supports only slices backed by byte array");


only supports

wypb · 2024-05-09T11:36:44Z

@zhenxiao @yingsu00 @elharo sorry for the late reply. I have made modifications according to the previous review comments.

I wonder if you could add tests?

I generated Parquet files with different encodings locally. I will see how to add them to the test set.
In addition, BenchmarkShortDecimalColumnReader and BenchmarkLongDecimalColumnReader have different coding data verification logic in batch mode and non-batch mode.

steveburnett · 2024-05-09T14:30:10Z

Nit, suggest release note entry change:

== RELEASE NOTES ==

Hive Connector Changes
* Add support decimal batch reader :pr:`22636`

wypb · 2024-05-10T03:10:08Z

@zhenxiao @yingsu00 I added three new tests in AbstractTestParquetReader.java: testLongDecimalBackedByBinary, testShortDecimalBackedByBinary and testShortDecimalBackedByFixedLenByteArray, plus the previous testDecimalBackedByFixedLenByteArray (already named to testLongDecimalBackedByFixedLenByteArray), testDecimalBackedByINT64 and DecimalBackedByINT32 test has been able to cover all the Decoder I added this time.

wypb · 2024-05-22T12:02:34Z

@zhenxiao @yingsu00 Any comments on this?

zhenxiao

looks good to me

elharo

There's a lot of new public API here that feels like it needs unit tests, e.g. SimpleSliceInputStream

tdcmeehan · 2024-06-05T15:04:45Z

@wypb could you add more tests to address @elharo's feedback?

wypb · 2024-06-06T02:41:25Z

HI @tdcmeehan @elharo Thank you for your review, I'll add some test cases soon.

yingsu00

@wypb Will you please add the tests @elharo requested? For example, you can reference TestBooleanStream for writing tests for SimpleSliceInputStream. Thank you!

yingsu00 · 2024-07-08T16:04:26Z

presto-parquet/src/main/java/com/facebook/presto/parquet/batchreader/decoders/Decoders.java

@@ -173,11 +234,46 @@ private static final ValuesDecoder createValuesDecoder(ColumnDescriptor columnDe

        if ((encoding == DELTA_BYTE_ARRAY || encoding == DELTA_LENGTH_BYTE_ARRAY) && type == PrimitiveTypeName.BINARY) {
            ByteBufferInputStream inputStream = ByteBufferInputStream.wrap(ByteBuffer.wrap(buffer, offset, length));
+
+            Optional<Type> prestoType = createDecimalType(columnDescriptor);


I know this is from existing code, but it is confusing to detect if the logical type is decimal by calling createDecimalType() and check its returned type. It'll be clearer to make the type detecting consistent with other encodings, e.g. as what you did on line 195, 196

Already refactored, thanks.

yingsu00 · 2024-07-08T16:31:02Z

...esto/parquet/batchreader/decoders/plain/FixedLenByteArrayShortDecimalPlainValuesDecoder.java

+
+    public FixedLenByteArrayShortDecimalPlainValuesDecoder(ColumnDescriptor columnDescriptor, byte[] byteBuffer, int bufferOffset, int length)
+    {
+        this.columnDescriptor = columnDescriptor;


I know the existing decoders code has the same pattern, but can you please add requireNonNull() on columnDescriptor and byteBuffer? Same for all other constructors for the classes you add. Thanks!

wypb · 2024-07-09T08:42:07Z

HI @elharo @tdcmeehan @yingsu00 thanks for your inputs and sorry for the delay.

I modified some code according to the previous review and added a new test case com.facebook.presto.parquet.batchreader.TestSimpleSliceInputStream. I have rebased the code and the diff file can be found in d527a91.

elharo · 2024-07-09T11:30:37Z

presto-hive/src/test/java/com/facebook/presto/hive/parquet/AbstractTestParquetReader.java

+            throws Exception
+    {
+        for (int precision = 1; precision <= MAX_SHORT_PRECISION; precision++) {
+            int scale = ThreadLocalRandom.current().nextInt(precision);


I get very nervous seeing random in tests. If something fails it's not reproducible, Use fixed constant test data instead, even if the fixed values are initially randomly chosen.

elharo · 2024-07-09T11:30:49Z

presto-hive/src/test/java/com/facebook/presto/hive/parquet/AbstractTestParquetReader.java

+            throws Exception
+    {
+        for (int precision = 1; precision <= MAX_SHORT_PRECISION; precision++) {
+            int scale = ThreadLocalRandom.current().nextInt(precision);


elharo · 2024-07-09T11:31:29Z

presto-parquet/src/main/java/com/facebook/presto/parquet/ColumnReaderFactory.java

+                    if (!isNested && isShortDecimalType(descriptor)) {
+                        int precision = ((DecimalLogicalTypeAnnotation) descriptor.getPrimitiveType().getLogicalTypeAnnotation()).getPrecision();
+                        if (precision < 10) {
+                            log.warn("PrimitiveTypeName is INT64 but precision is less then 10.");


delete the warning; it's not actionable

or if I'm wrong and this is a real problem, then a waring isn't enough. This should fail outright

The spec says we need to produce a warning when precision < 10 for INT64. See https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#decimal

elharo · 2024-07-09T11:35:20Z

...resto/parquet/batchreader/decoders/delta/FixedLenByteArrayShortDecimalDeltaValueDecoder.java

+ * is not a common one, just use the existing one provided by Parquet library and add a wrapper around it that satisfies the
+ * {@link ShortDecimalValuesDecoder} interface.
+ */
+public class FixedLenByteArrayShortDecimalDeltaValueDecoder


avoid abbreviations; thus FixedLen --> FixedLength

but aren't all arrays in java fixed length? So maybe just ByteArrayShortDecimalDeltaValueDecoder

The naming convention of the Decoder class name here is ParquetPrimitiveType + [Short|Long]Decimal + encoding + ValuesDecoder. The PrimitiveTypeName corresponding to this class is FIXED_LEN_BYTE_ARRAY, so this name is used.

elharo · 2024-07-09T11:38:18Z

.../facebook/presto/parquet/batchreader/decoders/plain/BinaryLongDecimalPlainValuesDecoder.java

+            int positionOffset = offsets[i];
+            int positionLength = offsets[i + 1] - positionOffset;
+            byte[] temp = new byte[positionLength];
+            System.arraycopy(byteBuffer, positionOffset, temp, 0, positionLength);


Personally I find Arrays.copyOf a little easier to read; up to you

I changed it to byte[] temp = Arrays.copyOfRange(byteBuffer, positionOffset, positionOffset + positionLength);

elharo · 2024-07-09T11:40:12Z

...resto/parquet/batchreader/decoders/plain/FixedLenByteArrayLongDecimalPlainValuesDecoder.java

+    @Override
+    public void readNext(long[] values, int offset, int length)
+    {
+        final byte[] localByteBuffer = byteBuffer;


Danger! Since this local variable is just a reference, byteBuffer is modified anyway when localByteBuffer is. Code below might be correct, I'm not sure, but this variable should be removed.

Nice catch.

elharo · 2024-07-09T11:40:47Z

...esto/parquet/batchreader/decoders/plain/FixedLenByteArrayShortDecimalPlainValuesDecoder.java

+        byte expectedValue = (byte) (bytes[endOffset] >> 7);
+        for (int i = offset; i < endOffset; i++) {
+            if (bytes[i] != expectedValue) {
+                throw new PrestoException(NOT_SUPPORTED, format(


string concatenation is simpler here

elharo · 2024-07-09T11:41:52Z

...k/presto/parquet/batchreader/decoders/plain/ShortDecimalFixedWidthByteArrayBatchDecoder.java

+
+                // We first shift the byte as left as possible. Then, when shifting back right,
+                // the sign bit will get propagated
+                values[offset] = value << 56 >> 56;


Order of operations is foggy. Please use parentheses to make this explicit.

elharo · 2024-07-09T11:43:19Z

presto-parquet/src/test/java/com/facebook/presto/parquet/reader/TestData.java

+            long[] result = new long[size];
+            for (int i = 0; i < size; i++) {
+                result[i] = Math.max(
+                        Math.min(randomLong(random, bitWidth), max),


avoid random numbers in tests. Test results need to be reproducible.

wypb · 2024-07-10T02:15:22Z

Thank you @elharo for the review! I have updated with a new (squashed) commit, and I have addressed all your comments. Let me know if I missed anything.

yingsu00 · 2024-07-10T15:31:23Z

@wypb There is a test failure: Failures:
[ERROR] TestPrestoNativeIcebergTpcdsQueriesParquetUsingThrift.doDeletesAndQuery:399->verifyDeletes:390->AbstractTestQueryFramework.computeScalar:149->AbstractTestQueryFramework.computeActual:139 ? Runtime size <= capacity_ (18446744073709551176 vs. 1440) Split [Hive: file:/tmp/iceberg_data/HIVE/tpcds/store_sales/data/183baea9-babf-4daa-875b-2148de383c5a.parquet 0 - 2071914] Task 20240710_023723_00114_2m4dj.1.0.2.0 Operator: TableScan[0] 0

Can you please investigate? THanks!

tdcmeehan · 2024-07-10T15:38:15Z

@yingsu00 this is actually likely due to facebookincubator/velox#10261 and is being backed out in facebookincubator/velox#10431. It was added in #23138 (which appeared to have been merged in spite of the failure it introduced). See: #23156

wypb · 2024-07-11T02:57:32Z

Hi @tdcmeehan thank you for sharing the information. @yingsu00 I synchronized the latest code, and now CI is all green.

yingsu00 · 2024-07-12T16:15:38Z

@yingsu00 this is actually likely due to facebookincubator/velox#10261 and is being backed out in facebookincubator/velox#10431. It was added in #23138 (which appeared to have been merged in spite of the failure it introduced). See: #23156

Thank you @tdcmeehan for letting me know!

wypb requested review from shangxinli and a team as code owners April 30, 2024 04:02

wypb requested a review from presto-oss April 30, 2024 04:02

wypb force-pushed the decimal_vector branch 6 times, most recently from b0972c0 to 96f2271 Compare April 30, 2024 13:26

tdcmeehan self-assigned this Apr 30, 2024

tdcmeehan requested a review from zhenxiao April 30, 2024 14:12

zhenxiao reviewed Apr 30, 2024

View reviewed changes

yingsu00 reviewed May 5, 2024

View reviewed changes

wypb force-pushed the decimal_vector branch 2 times, most recently from 6a1eda5 to 49c8f19 Compare May 9, 2024 11:00

elharo reviewed May 9, 2024

View reviewed changes

wypb force-pushed the decimal_vector branch from e3a76b6 to 7e30a23 Compare May 9, 2024 11:22

wypb force-pushed the decimal_vector branch from 7b61218 to 67b26e5 Compare May 9, 2024 11:42

wypb force-pushed the decimal_vector branch from 1d7eba8 to b5b30f3 Compare May 10, 2024 02:06

wypb changed the title ~~Add support decimal batch reader~~ Add support for decimal batch reader May 10, 2024

wypb force-pushed the decimal_vector branch 2 times, most recently from 153aa02 to c182a88 Compare May 10, 2024 03:04

wypb force-pushed the decimal_vector branch 2 times, most recently from 7fa98f4 to fd3a640 Compare May 10, 2024 03:42

zhenxiao previously approved these changes May 22, 2024

View reviewed changes

elharo reviewed May 23, 2024

View reviewed changes

yingsu00 reviewed Jul 8, 2024

View reviewed changes

wypb force-pushed the decimal_vector branch from 9a29c2e to da06b8e Compare July 9, 2024 06:13

wypb dismissed zhenxiao’s stale review via d527a91 July 9, 2024 08:35

wypb force-pushed the decimal_vector branch from d527a91 to c8bfc9c Compare July 9, 2024 08:36

elharo requested changes Jul 9, 2024

View reviewed changes

wypb force-pushed the decimal_vector branch from 361e069 to f49648b Compare July 10, 2024 02:11

wypb force-pushed the decimal_vector branch from 9ac7e19 to a3ab580 Compare July 10, 2024 02:17

elharo approved these changes Jul 10, 2024

View reviewed changes

Add support for decimal batch reader.

cabc10c

wypb force-pushed the decimal_vector branch from a2ec2e1 to cabc10c Compare July 11, 2024 01:31

yingsu00 approved these changes Jul 12, 2024

View reviewed changes

yingsu00 merged commit d76c2d3 into prestodb:master Jul 12, 2024
56 checks passed

wypb deleted the decimal_vector branch July 12, 2024 22:55

imjalpreet mentioned this pull request Jul 22, 2024

Iceberg read failing for Decimal type #23274

Closed

wypb mentioned this pull request Jul 26, 2024

Fix Iceberg read failing for Decimal type #23305

Merged

6 tasks

tdcmeehan mentioned this pull request Aug 23, 2024

Add release notes for 0.289 #23513

Merged

34 tasks

Add support for decimal batch reader #22636

Add support for decimal batch reader #22636

Conversation

wypb commented Apr 30, 2024 • edited Loading

Description

Impact

Test Plan

Contributor checklist

Release Notes

linux-foundation-easycla bot commented Apr 30, 2024 • edited Loading

zhenxiao left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yingsu00 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wypb May 9, 2024 • edited Loading

Choose a reason for hiding this comment

github-actions bot commented May 9, 2024 • edited Loading

elharo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wypb May 9, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wypb commented May 9, 2024

steveburnett commented May 9, 2024

wypb commented May 10, 2024 • edited Loading

wypb commented May 22, 2024

zhenxiao left a comment

Choose a reason for hiding this comment

elharo left a comment

Choose a reason for hiding this comment

tdcmeehan commented Jun 5, 2024

wypb commented Jun 6, 2024

yingsu00 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wypb commented Jul 9, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wypb commented Jul 10, 2024

yingsu00 commented Jul 10, 2024

tdcmeehan commented Jul 10, 2024

wypb commented Apr 30, 2024 •

edited

Loading

linux-foundation-easycla bot commented Apr 30, 2024 •

edited

Loading

wypb May 9, 2024 •

edited

Loading

github-actions bot commented May 9, 2024 •

edited

Loading

wypb May 9, 2024 •

edited

Loading

wypb commented May 10, 2024 •

edited

Loading