Fix Aggregation Type Promotion: Ensure Unsigned Input Types Result in Unsigned Output for Sum and Multiply #14679

SurajAralihalli · 2023-12-28T23:14:31Z

Description

During aggregation, output types are modified to prevent overflow. Presently, summing INT32 yields INT64, but summing UINT32 still results in INT64 instead of UINT64. This pull request resolves #10149 to ensure the correct output type is used when summing or multiplying integers.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

Signed-off-by: Suraj Aralihalli <suraj.ara16@gmail.com>

copy-pr-bot · 2023-12-28T23:14:36Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

karthikeyann · 2024-01-02T13:37:25Z

@SurajAralihalli The groupby tests already tests this. Since the tests use cudf::detail::target_type_t, this was not caught.
Suggestion: Add static assert with std::is_unsigned_v<Source> in

cudf/cpp/tests/groupby/sum_tests.cpp

Line 38 in 580ee40

TYPED_TEST(groupby_sum_test, basic)

Perhaps, adding a separate test for type checking to cover all cases, could be considered too.

karthikeyann · 2024-01-02T13:38:57Z

/ok to test

Signed-off-by: Suraj Aralihalli <suraj.ara16@gmail.com>

SurajAralihalli · 2024-01-03T23:42:41Z

@SurajAralihalli The groupby tests already tests this. Since the tests use cudf::detail::target_type_t, this was not caught. Suggestion: Add static assert with std::is_unsigned_v<Source> in

cudf/cpp/tests/groupby/sum_tests.cpp

Line 38 in 580ee40

TYPED_TEST(groupby_sum_test, basic)

Perhaps, adding a separate test for type checking to cover all cases, could be considered too.

Thanks @karthikeyann, the groupby_sum_test doesn't include any unsigned types. I have added uint16_t and uint64_t in the supported_types to test.

karthikeyann · 2024-01-04T08:36:22Z

/ok to test

karthikeyann · 2024-01-10T01:11:13Z

/ok to test

karthikeyann · 2024-01-10T01:13:30Z

https://docs.github.com/en/issues/tracking-your-work-with-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword
Recommend using keyword like closes or fixes or resolves in description

SurajAralihalli · 2024-01-11T01:40:32Z

https://docs.github.com/en/issues/tracking-your-work-with-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword
Recommend using keyword like closes or fixes or resolves in description

Thanks @karthikeyann, I'll keep that in mind!

ttnghia · 2024-01-17T03:37:25Z

/ok to test

nvdbaranec · 2024-01-19T22:14:29Z

I don't see any specific handling here for int32/uint32. Is that handled in another way?

SurajAralihalli · 2024-01-19T22:44:19Z

I don't see any specific handling here for int32/uint32. Is that handled in another way?

For computing target_type_t? The target type is always upscaled to int64/uint64 for integers of any type.

karthikeyann · 2024-01-23T08:14:27Z

/merge

…esult in Unsigned Output for Sum and Multiply (rapidsai#14679)" This reverts commit a39897c.

This pull request reverses the modifications made to the sum/product aggregation target type, ensuring it always produces int64. The changes implemented by PR [14679](#14679) which led to degraded performance when the aggregation column had an unsigned type, are reverted. Additional details can be found in the issue [14886](#14886). Authors: - Suraj Aralihalli (https://github.com/SurajAralihalli) Approvers: - David Wendt (https://github.com/davidwendt) - Nghia Truong (https://github.com/ttnghia) - Karthikeyan (https://github.com/karthikeyann)

…esult in Unsigned Output for Sum and Multiply (rapidsai#14679)" This reverts commit a39897c.

SurajAralihalli added 2 commits December 28, 2023 13:15

fix type

dcfabed

Signed-off-by: Suraj Aralihalli <suraj.ara16@gmail.com>

fix type

c8b4e13

Signed-off-by: Suraj Aralihalli <suraj.ara16@gmail.com>

github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label Dec 28, 2023

karthikeyann added bug Something isn't working non-breaking Non-breaking change labels Jan 2, 2024

update tests

2537d10

Signed-off-by: Suraj Aralihalli <suraj.ara16@gmail.com>

SurajAralihalli marked this pull request as ready for review January 4, 2024 16:22

SurajAralihalli requested a review from a team as a code owner January 4, 2024 16:22

SurajAralihalli requested review from shrshi and nvdbaranec January 4, 2024 16:22

Merge branch 'branch-24.02' into fix_agg_type

1e47850

ttnghia approved these changes Jan 17, 2024

View reviewed changes

Merge branch 'branch-24.02' into fix_agg_type

c4fda37

shrshi approved these changes Jan 19, 2024

View reviewed changes

karthikeyann approved these changes Jan 23, 2024

View reviewed changes

rapids-bot bot merged commit a39897c into rapidsai:branch-24.02 Jan 23, 2024
73 checks passed

This was referenced Jan 24, 2024

[BUG] hash aggregate test failures due to type conversion errors NVIDIA/spark-rapids#10264

Closed

Update to libcudf unsigned sum aggregation types change NVIDIA/spark-rapids#10267

Merged

abellina mentioned this pull request Jan 25, 2024

[BUG] Performance regression in cuDF after #14679 #14886

Closed

abellina added a commit to abellina/cudf that referenced this pull request Jan 26, 2024

Revert "Fix Aggregation Type Promotion: Ensure Unsigned Input Types R…

663814a

…esult in Unsigned Output for Sum and Multiply (rapidsai#14679)" This reverts commit a39897c.

SurajAralihalli mentioned this pull request Jan 26, 2024

Revert sum/product aggregation to always produce int64_t type #14907

Merged

3 tasks

abellina added a commit to abellina/cudf that referenced this pull request Jan 31, 2024

Revert "Fix Aggregation Type Promotion: Ensure Unsigned Input Types R…

90819c8

…esult in Unsigned Output for Sum and Multiply (rapidsai#14679)" This reverts commit a39897c.

GregoryKimball mentioned this pull request Feb 1, 2024

[BUG] Sum and multiply aggregations promote unsigned input types to a signed output #10149

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Aggregation Type Promotion: Ensure Unsigned Input Types Result in Unsigned Output for Sum and Multiply #14679

Fix Aggregation Type Promotion: Ensure Unsigned Input Types Result in Unsigned Output for Sum and Multiply #14679

SurajAralihalli commented Dec 28, 2023 •

edited

Loading

copy-pr-bot bot commented Dec 28, 2023

karthikeyann commented Jan 2, 2024

karthikeyann commented Jan 2, 2024

SurajAralihalli commented Jan 3, 2024

karthikeyann commented Jan 4, 2024

karthikeyann commented Jan 10, 2024

karthikeyann commented Jan 10, 2024 •

edited

Loading

SurajAralihalli commented Jan 11, 2024

ttnghia commented Jan 17, 2024

nvdbaranec commented Jan 19, 2024

SurajAralihalli commented Jan 19, 2024 •

edited

Loading

karthikeyann commented Jan 23, 2024

Fix Aggregation Type Promotion: Ensure Unsigned Input Types Result in Unsigned Output for Sum and Multiply #14679

Fix Aggregation Type Promotion: Ensure Unsigned Input Types Result in Unsigned Output for Sum and Multiply #14679

Conversation

SurajAralihalli commented Dec 28, 2023 • edited Loading

Description

Checklist

copy-pr-bot bot commented Dec 28, 2023

karthikeyann commented Jan 2, 2024

karthikeyann commented Jan 2, 2024

SurajAralihalli commented Jan 3, 2024

karthikeyann commented Jan 4, 2024

karthikeyann commented Jan 10, 2024

karthikeyann commented Jan 10, 2024 • edited Loading

SurajAralihalli commented Jan 11, 2024

ttnghia commented Jan 17, 2024

nvdbaranec commented Jan 19, 2024

SurajAralihalli commented Jan 19, 2024 • edited Loading

karthikeyann commented Jan 23, 2024

SurajAralihalli commented Dec 28, 2023 •

edited

Loading

karthikeyann commented Jan 10, 2024 •

edited

Loading

SurajAralihalli commented Jan 19, 2024 •

edited

Loading