[FEA] Faster dataframe to cupy conversion when dataframe is a single allocation #12928

beckernick · 2023-03-13T14:59:58Z

When we convert a dataframe to a cupy array, we iterate over each column (as they’re independent allocations) and assign each one to a column in an empty matrix. This means it can be slow for thousands or millions of small columns.

In a select set of circumstances, all of the columns in a DataFrame may be part of a single, contiguous allocation of memory. One scenario in which this can occur is after a call to transpose. It would be nice if, in this scenario, we didn't need to iterate over every column when converting to a cupy array.

A real-world example of when this can matter is if a user is trying to run a dot product after a calling transpose. Because of the bottleneck, we're slower than pandas by quite a bit.

import cudf
import cupy as cp
import pandas as pd

nrows = 10000
ncols = 4

gdf = cudf.DataFrame(cp.random.randint(0, 1000, size=(nrows, ncols)))
pdf = gdf.to_pandas()

%time gdf.T.dot(gdf)
%time pdf.T.dot(pdf)
CPU times: user 1.52 s, sys: 3.96 ms, total: 1.53 s
Wall time: 1.53 s
CPU times: user 912 µs, sys: 41 µs, total: 953 µs
Wall time: 855 µs

If we were to do any special casing here, we'd want to closely evaluate any impact on performance for the more general case, as the dataframe to cupy codepath is used across the board.

shwina · 2023-03-13T21:16:08Z

The resulting CuPy array would need to be F-contiguous, IIUC. Would that be alright?

wence- · 2023-03-15T16:46:11Z

The resulting CuPy array would need to be F-contiguous, IIUC. Would that be alright?

Yes, since cupy dot handles that fine.

The right answer here (since the .T is already expensive) is to do:

tmp = gdf.values
result = cudf.DataFrame(tmp.T.dot(tmp))

But that is certainly not as seamless

beckernick · 2023-03-15T18:44:21Z

Agreed, F contiguousness would be fine. With that said, if we decide that a fastpath for this sceniaro has a non-trivial cost (in performance or complexity) associated with it and isn't worth the cost, that's also reasonable.

For completeness, when the user hit this originally they switched to CuPy to do the operations (like Lawrence suggested).

vyasr · 2024-05-17T18:29:19Z

Related: #11648

beckernick added feature request New feature or request Needs Triage Need team to review and classify labels Mar 13, 2023

GregoryKimball added 0 - Backlog In queue waiting for assignment Python Affects Python cuDF API. and removed Needs Triage Need team to review and classify labels Jun 6, 2023

bdice mentioned this issue Aug 2, 2024

[FEA] Accelerate cupy array creation from DataFrame.values #16483

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Faster dataframe to cupy conversion when dataframe is a single allocation #12928

[FEA] Faster dataframe to cupy conversion when dataframe is a single allocation #12928

beckernick commented Mar 13, 2023 •

edited

Loading

shwina commented Mar 13, 2023

wence- commented Mar 15, 2023

beckernick commented Mar 15, 2023

vyasr commented May 17, 2024

[FEA] Faster dataframe to cupy conversion when dataframe is a single allocation #12928

[FEA] Faster dataframe to cupy conversion when dataframe is a single allocation #12928

Comments

beckernick commented Mar 13, 2023 • edited Loading

shwina commented Mar 13, 2023

wence- commented Mar 15, 2023

beckernick commented Mar 15, 2023

vyasr commented May 17, 2024

beckernick commented Mar 13, 2023 •

edited

Loading