-
Notifications
You must be signed in to change notification settings - Fork 318
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEATURE REQUEST]: Support GroupedMapUdf in Spark-3.0.0 #654
Comments
@eerhardt Would you be able to help with ARROW-6972? I'm not sure if anyone from the Arrow community is already working on it. Looks like we cannot support GroupedMap for Spark 3.0 in .NET without that functionality. |
I should be able to look into it later this week. |
Great, thank you so much in advance! |
Just adding a note that I'll be working on this starting tomorrow or Monday :) I'll update this post as I make progress! |
I'm closing this issue considering it is done. Thank you all! |
Spark has changed the expected contents of an
Arrow RecordBatch
from 2.4 and 3.0. Spark 3.0 expects the results of the GroupedMap to be a column of StructType instead of separating each column from the DataFrame.SPARK-27163
Spark2.4
Spark3.0
To support these changes we need to use Arrow's
StructArray
andStructType
. Unfortunately it is currently unsupported ARROW-6972The text was updated successfully, but these errors were encountered: