-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-5775] [SQL] BugFix: GenericRow cannot be cast to SpecificMutableRow when nested data and partitioned table #4792
Conversation
Test build #28012 has started for PR 4792 at commit
|
@marmbrus As discussed offline, I tried to use |
Can't we do the same check that we do inside of parquet to see if the row is all primitives once and switch on that, instead of doing per tuple extraction and run time type checking? |
Test build #28012 has finished for PR 4792 at commit
|
Test PASSed. |
Remove runtime pattern matching.
Test build #28039 has started for PR 4792 at commit
|
Test build #28039 has finished for PR 4792 at commit
|
Test PASSed. |
@@ -126,6 +126,9 @@ private[sql] case class ParquetTableScan( | |||
conf) | |||
|
|||
if (requestedPartitionOrdinals.nonEmpty) { | |||
// This check if based on CatalystConverter.createRootConverter. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I made a typo at here... if => is.
Test build #28117 has started for PR 4792 at commit
|
@yhuai Thanks for the review! I've addressed the comments. Will merge this to master and branch-1.3 after Jenkins approves. |
Test build #28117 has finished for PR 4792 at commit
|
Test PASSed. |
…leRow when nested data and partitioned table This PR adapts anselmevignon's #4697 to master and branch-1.3. Please refer to PR description of #4697 for details. <!-- Reviewable:start --> [<img src="https://reviewable.io/review_button.png" height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/apache/spark/4792) <!-- Reviewable:end --> Author: Cheng Lian <lian@databricks.com> Author: Cheng Lian <liancheng@users.noreply.github.com> Author: Yin Huai <yhuai@databricks.com> Closes #4792 from liancheng/spark-5775 and squashes the following commits: 538f506 [Cheng Lian] Addresses comments cee55cf [Cheng Lian] Merge pull request #4 from yhuai/spark-5775-yin b0b74fb [Yin Huai] Remove runtime pattern matching. ca6e038 [Cheng Lian] Fixes SPARK-5775 (cherry picked from commit e6003f0) Signed-off-by: Cheng Lian <lian@databricks.com>
This PR adapts @anselmevignon's #4697 to master and branch-1.3. Please refer to PR description of #4697 for details.