Remove legacy Arrow interop APIs #16590

vyasr · 2024-08-17T01:36:38Z

Description

Contributes to #15193.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

KyleFromNVIDIA

Approved CMake changes

revans2

I'm not 100% sure what is happening, but we are getting failures trying to use this in Spark when we write data to python processes for python UDF processing.

E                   Caused by: ai.rapids.cudf.CudfException: writer failed to write table with the following error: Invalid: Tried to write record batch with different schema
E                       at ai.rapids.cudf.Table.writeArrowIPCArrowChunk(Native Method)

java/src/main/native/src/ColumnVectorJni.cpp

revans2 · 2024-08-20T16:14:59Z

I did some more debugging and the issue appears to happen when after writing batches with non-null values out, we then try to write a batch with only null values in it.

DEBUG ARROW WRITE Table{columns=[ColumnVector{rows=4, type=INT8, nullCount=Optional.empty, offHeap=(ID: 283 7ebea0322ac0)}], cudfTable=139357196914992, rows=4}
GPU COLUMN 0 - NC: 0 DATA: DeviceMemoryBufferView{address=0x40a006ec0, length=4, id=-1} VAL: DeviceMemoryBufferView{address=0x40a006e80, length=64, id=-1}
COLUMN 0 - INT8
0 48
1 -68
2 -107
3 6
...
DEBUG ARROW WRITE Table{columns=[ColumnVector{rows=1, type=INT8, nullCount=Optional.empty, offHeap=(ID: 464 7ebea038cab0)}], cudfTable=139357196742224, rows=1}
GPU COLUMN 0 - NC: 0 DATA: DeviceMemoryBufferView{address=0x40a00d2c0, length=1, id=-1} VAL: DeviceMemoryBufferView{address=0x40a00d280, length=64, id=-1}
COLUMN 0 - INT8
0 88
DEBUG ARROW WRITE Table{columns=[ColumnVector{rows=1, type=INT8, nullCount=Optional.empty, offHeap=(ID: 474 7ebea038cbf0)}], cudfTable=139357196750432, rows=1}
GPU COLUMN 0 - NC: 1 DATA: DeviceMemoryBufferView{address=0x40a00d4c0, length=1, id=-1} VAL: DeviceMemoryBufferView{address=0x40a00d480, length=64, id=-1}
COLUMN 0 - INT8
0 NULL

That is when the error happens. We end up doing this somewhat regularly when we do groupby aggregations using python UDFS. Spark has a kind of crappy way of doing UDFS in python where each grouping key gets a separate arrow batch written out for it. This happens when we generate a group by where all of the values we are computing the agg for ended up being null (in this case it is a single row).

I think the problem is that the schema translation either expects to never see nulls if it didn't see it in the first batch, or that it has some kind of issue with the batch being all nulls. Not really sure. All I know is the schema discovery is not happy with this situation.

vyasr · 2024-08-20T16:37:41Z

OK got it, thanks for tracking this case down. If you have time to write a minimal C++ repro that would be amazing, but no worries if not; I will try and create one myself by EOD today.

vyasr · 2024-08-20T23:25:28Z

OK, I pushed a fix. We've had a few issues crop up around the new API around the nullability of columns when converting from cudf if the cudf column has no nulls. We may want to rethink defaults, but when I tried fiddling with them here I kept finding edge cases in other scenarios where things broke, so I suspect that the defaults in other libraries like arrow are not entirely consistent. For now, I've just patched the implementation for the JNI since we presumably want to rewrite this to avoid libarrow altogether. I'll open a new issue to discuss the default behavior, but I think this solution ought to be OK for now.

vyasr · 2024-08-20T23:37:04Z

Opened #16621 for further discussion.

revans2 · 2024-08-21T20:53:24Z

I just tested and I am still getting the same error. I'll do some more debugging and see what I can come up with for this.

java/src/main/native/src/TableJni.cpp

revans2

That did it

vyasr · 2024-08-21T21:39:15Z

Thanks for the quick turnaround on testing!

…e_legacy_interop

cpp/tests/interop/arrow_utils.hpp

PointKernel

A small question otherwise looks good to me.

cpp/tests/interop/arrow_utils.hpp

vyasr · 2024-08-22T03:42:46Z

/merge

I noticed from #16590 (comment) that there was one other file where `#pragma once` was not at the top. This PR fixes that. Authors: - Bradley Dice (https://github.com/bdice) Approvers: - Vyas Ramasubramani (https://github.com/vyasr) URL: #16636

This reverts commit 6c4905d.

vyasr added 4 commits August 17, 2024 01:22

Remove legacy functions

cc9c689

Remove arrow_allocator

8d5ffd1

Remove includes

9c6ddc2

Remove a couple more includes

3cb81e2

vyasr added improvement Improvement / enhancement to an existing function breaking Breaking change labels Aug 17, 2024

vyasr self-assigned this Aug 17, 2024

github-actions bot added libcudf Affects libcudf (C++/CUDA) code. CMake CMake build issue labels Aug 17, 2024

Temp patch for JNI

7dce595

github-actions bot added the Java Affects Java cuDF API. label Aug 19, 2024

Minimal changes to get JNI building

5bf9801

vyasr mentioned this pull request Aug 19, 2024

Remove arrow_io_source #16607

Merged

3 tasks

vyasr marked this pull request as ready for review August 19, 2024 21:48

vyasr requested review from a team as code owners August 19, 2024 21:48

vyasr requested review from shrshi and srinivasyadav18 August 19, 2024 21:48

KyleFromNVIDIA approved these changes Aug 19, 2024

View reviewed changes

revans2 requested changes Aug 20, 2024

View reviewed changes

java/src/main/native/src/ColumnVectorJni.cpp Show resolved Hide resolved

Add a relevant test and fix the issue for the JNI

4676684

vyasr mentioned this pull request Aug 20, 2024

[FEA] Consider default nullability setting when converting cudf data to arrow C Data interface #16621

Open

vyasr requested a review from revans2 August 20, 2024 23:37

vyasr added 2 commits August 21, 2024 00:19

Simplify test to avoid dependence on temporary file generation

f6ad811

Move the test

d0e4762

revans2 reviewed Aug 21, 2024

View reviewed changes

java/src/main/native/src/TableJni.cpp Outdated Show resolved Hide resolved

vyasr added 2 commits August 21, 2024 21:12

Set nullability recursively

5762b38

Doc and simplify

8970bfd

revans2 approved these changes Aug 21, 2024

View reviewed changes

vyasr added 2 commits August 21, 2024 21:40

Remove incomplete test

f52ee0e

Merge remote-tracking branch 'upstream/branch-24.10' into chore/remov…

ccc00fb

…e_legacy_interop

vyasr mentioned this pull request Aug 21, 2024

Rebuild for & Support NumPy 2 #16300

Merged

3 tasks

davidwendt reviewed Aug 21, 2024

View reviewed changes

cpp/tests/interop/arrow_utils.hpp Outdated Show resolved Hide resolved

bdice approved these changes Aug 21, 2024

View reviewed changes

Move pragma

083c46f

davidwendt approved these changes Aug 21, 2024

View reviewed changes

PointKernel approved these changes Aug 21, 2024

View reviewed changes

cpp/tests/interop/arrow_utils.hpp Outdated Show resolved Hide resolved

Remove duplicate definition of max_precision

c478d0c

bdice mentioned this pull request Aug 21, 2024

Move pragma once in rolling/jit/operation.hpp. #16636

Merged

3 tasks

raydouglass removed the request for review from srinivasyadav18 August 21, 2024 22:46

rapids-bot bot merged commit 6c4905d into rapidsai:branch-24.10 Aug 22, 2024
79 checks passed

vyasr deleted the chore/remove_legacy_interop branch August 22, 2024 04:13

mythrocks mentioned this pull request Sep 11, 2024

[BUG] [Java] CudfException on conversion of data between Arrow and Cudf #16794

Closed

vyasr added a commit to vyasr/cudf that referenced this pull request Oct 14, 2024

Revert "Remove legacy Arrow interop APIs (rapidsai#16590)"

c87fecf

This reverts commit 6c4905d.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove legacy Arrow interop APIs #16590

Remove legacy Arrow interop APIs #16590

vyasr commented Aug 17, 2024

KyleFromNVIDIA left a comment

revans2 left a comment

revans2 commented Aug 20, 2024

vyasr commented Aug 20, 2024

vyasr commented Aug 20, 2024

vyasr commented Aug 20, 2024

revans2 commented Aug 21, 2024

revans2 left a comment

vyasr commented Aug 21, 2024

PointKernel left a comment

vyasr commented Aug 22, 2024

Remove legacy Arrow interop APIs #16590

Remove legacy Arrow interop APIs #16590

Conversation

vyasr commented Aug 17, 2024

Description

Checklist

KyleFromNVIDIA left a comment

Choose a reason for hiding this comment

revans2 left a comment

Choose a reason for hiding this comment

revans2 commented Aug 20, 2024

vyasr commented Aug 20, 2024

vyasr commented Aug 20, 2024

vyasr commented Aug 20, 2024

revans2 commented Aug 21, 2024

revans2 left a comment

Choose a reason for hiding this comment

vyasr commented Aug 21, 2024

PointKernel left a comment

Choose a reason for hiding this comment

vyasr commented Aug 22, 2024