-
Notifications
You must be signed in to change notification settings - Fork 28.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-13255][SQL] Update vectorized reader to directly return ColumnarBatch instead of InternalRows. #11435
Closed
Commits on Mar 4, 2016
-
[SPARK-13255][SQL] Update vectorized reader to directly return Column…
…arBatch instead of InternalRows. Currently, the parquet reader returns rows one by one which is bad for performance. This patch updates the reader to directly return ColumnarBatches. This is only enabled with whole stage codegen, which is the only operator currently that is able to consume ColumnarBatches (instead of rows). The current implementation is a bit of a hack to get this to work and we should do more refactoring of these low level interfaces to make this work better. Results: TPCDS: Best/Avg Time(ms) Rate(M/s) Per Row(ns) --------------------------------------------------------------------------------- q55 (before) 8897 / 9265 12.9 77.2 q55 5486 / 5753 21.0 47.6
Configuration menu - View commit details
-
Copy full SHA for 59dec91 - Browse repository at this point
Copy the full SHA 59dec91View commit details -
Configuration menu - View commit details
-
Copy full SHA for 058556c - Browse repository at this point
Copy the full SHA 058556cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2330576 - Browse repository at this point
Copy the full SHA 2330576View commit details -
Configuration menu - View commit details
-
Copy full SHA for 42875ac - Browse repository at this point
Copy the full SHA 42875acView commit details -
Configuration menu - View commit details
-
Copy full SHA for cab64e5 - Browse repository at this point
Copy the full SHA cab64e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for f35394c - Browse repository at this point
Copy the full SHA f35394cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3450313 - Browse repository at this point
Copy the full SHA 3450313View commit details -
Configuration menu - View commit details
-
Copy full SHA for f5f1e2b - Browse repository at this point
Copy the full SHA f5f1e2bView commit details -
Configuration menu - View commit details
-
Copy full SHA for ed79eee - Browse repository at this point
Copy the full SHA ed79eeeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 48102e3 - Browse repository at this point
Copy the full SHA 48102e3View commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.