perf(array): optimize performance of literal expression #6877

wangrunji0408 · 2022-12-13T04:00:37Z

I hereby agree to the terms of the Singularity Data, Inc. Contributor License Agreement.

What's changed and what's your intention?

introduce Array::raw_iter to efficiently iterate over raw values regardless of null.
introduce ArrayBuilder::append_n to efficiently append the same value multiple times.
remove the head from Bitmap

With these optimizations, the evaluation time of literal expressions dropped from 2.2us to 150ns.

This PR also adds an optimized version of i32 + i32 using raw iterator in the bench, reducing time from 3.3us to 350ns.

Checklist

I have written necessary rustdoc comments
I have added necessary unit tests and integration tests
All checks passed in ./risedev check (or alias, ./risedev c)

Refer to a related PR or issue link (optional)

#6868

Signed-off-by: Runji Wang <wangrunji0408@163.com>

BugenZhao

🚀

src/common/src/array/iterator.rs

src/common/src/array/bytes_array.rs

src/common/src/array/primitive_array.rs

src/common/src/array/bytes_array.rs

BowenXiao1999

Great Job, LGTM!

How should use the append_n in executor (e.g. build a chunk) or is it only for bench? Usually we do not how many values are the same before.
I saw perf(expr): add initial microbenchmark for expressions #6856 has mentioned some potential improvements. Are all kinds of improvements has been includeded in this pr or there is more?

Signed-off-by: Runji Wang <wangrunji0408@163.com>

st1page · 2022-12-13T05:23:06Z

3.3us to 350us

350ns?

wangrunji0408 · 2022-12-13T08:13:09Z

3.3us to 350us

350ns?

Sure. Thanks for pointing it out. 🥵

wangrunji0408 · 2022-12-13T09:27:25Z

How should use the append_n in executor (e.g. build a chunk) or is it only for bench? Usually we do not how many values are the same before.

It is a method in ArrayBuilder trait. There's also a new method append_datum_n in ArrayBuilderImpl. So they should be generally available. But I think the most use case would still be evaluating literal expressions.

I saw #6856 has mentioned some potential improvements. Are all kinds of improvements has been includeded in this pr or there is more?

Yes, that's all.

Signed-off-by: Runji Wang <wangrunji0408@163.com>

src/expr/benches/expr.rs

jon-chuang · 2022-12-13T10:44:14Z

Although we demonstrated that we can improve the i32 add performance possibly by vectorizing each of the steps (null check, arithmetic and check overflow), can I ask whether any benchmark demonstrates improvement due to the use of append_n?

I guess the intention is simply to provide an additional method that we can benchmark the usage of in the future?

Signed-off-by: Runji Wang <wangrunji0408@163.com>

wangrunji0408 · 2022-12-13T11:06:59Z

Although we demonstrated that we can improve the i32 add performance possibly by vectorizing each of the steps (null check, arithmetic and check overflow), can I ask whether any benchmark demonstrates improvement due to the use of append_n?

Sure. The bench "expr/constant" (literal expression). Previously it could only append the same value one by one. Now it can append them all at once.

codecov · 2022-12-13T11:22:33Z

Codecov Report

Merging #6877 (6d8c3a2) into main (c188e5f) will decrease coverage by 0.01%.
The diff coverage is 71.08%.

@@            Coverage Diff             @@
##             main    #6877      +/-   ##
==========================================
- Coverage   73.16%   73.15%   -0.02%     
==========================================
  Files        1033     1033              
  Lines      164925   165019      +94     
==========================================
+ Hits       120665   120714      +49     
- Misses      44260    44305      +45

Flag	Coverage Δ
rust	`73.15% <71.08%> (-0.02%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/common/src/array/iterator.rs	`79.45% <0.00%> (-18.86%)`	⬇️
src/expr/src/expr/expr_binary_nonnull.rs	`62.60% <ø> (ø)`
src/common/src/array/utf8_array.rs	`90.76% <22.22%> (-2.52%)`	⬇️
src/common/src/array/struct_array.rs	`87.38% <53.84%> (-0.99%)`	⬇️
src/common/src/array/bool_array.rs	`90.40% <62.50%> (-2.69%)`	⬇️
src/common/src/array/primitive_array.rs	`87.00% <64.70%> (-2.48%)`	⬇️
src/common/src/array/list_array.rs	`91.43% <70.00%> (-0.71%)`	⬇️
src/common/src/array/bytes_array.rs	`84.37% <86.95%> (-0.71%)`	⬇️
src/common/src/array/mod.rs	`73.52% <100.00%> (+0.50%)`	⬆️
src/common/src/buffer/bitmap.rs	`95.94% <100.00%> (+0.12%)`	⬆️
... and 6 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

BugenZhao

LGTM!!!

src/common/src/array/primitive_array.rs

src/expr/benches/expr.rs

Signed-off-by: Runji Wang <wangrunji0408@163.com>

wangrunji0408 added 4 commits December 13, 2022 11:39

array: get raw value and add raw iter

5dbba1e

Signed-off-by: Runji Wang <wangrunji0408@163.com>

add an optimal implementation for add expr

995e3c0

Signed-off-by: Runji Wang <wangrunji0408@163.com>

optimize literal expr evaluation by append_n

3fa2d95

Signed-off-by: Runji Wang <wangrunji0408@163.com>

impl append_n for all arrays

0edc9ac

Signed-off-by: Runji Wang <wangrunji0408@163.com>

wangrunji0408 requested review from TennyZhuang, BugenZhao and BowenXiao1999 December 13, 2022 04:00

github-actions bot added the type/perf label Dec 13, 2022

wangrunji0408 mentioned this pull request Dec 13, 2022

Tracking: Optimize the performance of expressions #6868

Closed

BugenZhao reviewed Dec 13, 2022

View reviewed changes

src/common/src/array/iterator.rs Outdated Show resolved Hide resolved

src/common/src/array/bytes_array.rs Outdated Show resolved Hide resolved

src/common/src/array/primitive_array.rs Show resolved Hide resolved

src/common/src/array/bytes_array.rs Show resolved Hide resolved

BowenXiao1999 approved these changes Dec 13, 2022

View reviewed changes

fix Bitmap::append_n

610c095

Signed-off-by: Runji Wang <wangrunji0408@163.com>

introduce raw_value_at_unchecked

0b438bf

Signed-off-by: Runji Wang <wangrunji0408@163.com>

jon-chuang reviewed Dec 13, 2022

View reviewed changes

src/expr/benches/expr.rs Show resolved Hide resolved

fix bitmap. remove tailing ones

71a37ef

Signed-off-by: Runji Wang <wangrunji0408@163.com>

wangrunji0408 requested a review from BugenZhao December 13, 2022 11:02

BugenZhao approved these changes Dec 13, 2022

View reviewed changes

src/common/src/array/primitive_array.rs Outdated Show resolved Hide resolved

src/expr/benches/expr.rs Show resolved Hide resolved

fix unchecked

2fbacc0

Signed-off-by: Runji Wang <wangrunji0408@163.com>

wangrunji0408 added the mergify/can-merge label Dec 13, 2022

Merge branch 'main' into wrj/expr-bench

6d8c3a2

mergify bot merged commit 00741be into main Dec 13, 2022

mergify bot deleted the wrj/expr-bench branch December 13, 2022 13:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(array): optimize performance of literal expression #6877

perf(array): optimize performance of literal expression #6877

wangrunji0408 commented Dec 13, 2022 •

edited

Loading

BugenZhao left a comment

BowenXiao1999 left a comment •

edited

Loading

st1page commented Dec 13, 2022

wangrunji0408 commented Dec 13, 2022

wangrunji0408 commented Dec 13, 2022

jon-chuang commented Dec 13, 2022

wangrunji0408 commented Dec 13, 2022

codecov bot commented Dec 13, 2022 •

edited

Loading

BugenZhao left a comment

perf(array): optimize performance of literal expression #6877

perf(array): optimize performance of literal expression #6877

Conversation

wangrunji0408 commented Dec 13, 2022 • edited Loading

What's changed and what's your intention?

Checklist

Refer to a related PR or issue link (optional)

BugenZhao left a comment

Choose a reason for hiding this comment

BowenXiao1999 left a comment • edited Loading

Choose a reason for hiding this comment

st1page commented Dec 13, 2022

wangrunji0408 commented Dec 13, 2022

wangrunji0408 commented Dec 13, 2022

jon-chuang commented Dec 13, 2022

wangrunji0408 commented Dec 13, 2022

codecov bot commented Dec 13, 2022 • edited Loading

Codecov Report

BugenZhao left a comment

Choose a reason for hiding this comment

wangrunji0408 commented Dec 13, 2022 •

edited

Loading

BowenXiao1999 left a comment •

edited

Loading

codecov bot commented Dec 13, 2022 •

edited

Loading