-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-33094][SQL][3.0] Make ORC format propagate Hadoop config from DS options to underlying HDFS file system #29985
Conversation
…tions to underlying HDFS file system Propagate ORC options to Hadoop configs in Hive `OrcFileFormat` and in the regular ORC datasource. There is a bug that when running: ```scala spark.read.format("orc").options(conf).load(path) ``` The underlying file system will not receive the conf options. Yes Added UT to `OrcSourceSuite`. Closes apache#29976 from MaxGekk/orc-option-propagation. Authored-by: Max Gekk <max.gekk@gmail.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com> (cherry picked from commit c5f6af9) Signed-off-by: Max Gekk <max.gekk@gmail.com>
The changes conflict with |
Kubernetes integration test starting |
Kubernetes integration test status failure |
…DS options to underlying HDFS file system ### What changes were proposed in this pull request? Propagate ORC options to Hadoop configs in Hive `OrcFileFormat` and in the regular ORC datasource. ### Why are the changes needed? There is a bug that when running: ```scala spark.read.format("orc").options(conf).load(path) ``` The underlying file system will not receive the conf options. ### Does this PR introduce _any_ user-facing change? Yes ### How was this patch tested? Added UT to `OrcSourceSuite`. Authored-by: Max Gekk <max.gekkgmail.com> Signed-off-by: Dongjoon Hyun <dhyunapple.com> (cherry picked from commit c5f6af9) Signed-off-by: Max Gekk <max.gekkgmail.com> Closes #29985 from MaxGekk/orc-option-propagation-3.0. Authored-by: Max Gekk <max.gekk@gmail.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>
Merged to branch-3.0. Sure, let's open a PR for branch-2.4 as well. |
…DS options to underlying HDFS file system Propagate ORC options to Hadoop configs in Hive `OrcFileFormat` and in the regular ORC datasource. There is a bug that when running: ```scala spark.read.format("orc").options(conf).load(path) ``` The underlying file system will not receive the conf options. Yes Added UT to `OrcSourceSuite`. Authored-by: Max Gekk <max.gekkgmail.com> Signed-off-by: Dongjoon Hyun <dhyunapple.com> (cherry picked from commit c5f6af9) Signed-off-by: Max Gekk <max.gekkgmail.com> Closes apache#29985 from MaxGekk/orc-option-propagation-3.0. Authored-by: Max Gekk <max.gekk@gmail.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org> (cherry picked from commit 9892b3e) Signed-off-by: Max Gekk <max.gekk@gmail.com>
Here is the backport to 2.4: #29987 |
Test build #129582 has finished for PR 29985 at commit
|
+1, late LGTM. |
…DS options to underlying HDFS file system ### What changes were proposed in this pull request? Propagate ORC options to Hadoop configs in Hive `OrcFileFormat` and in the regular ORC datasource. ### Why are the changes needed? There is a bug that when running: ```scala spark.read.format("orc").options(conf).load(path) ``` The underlying file system will not receive the conf options. ### Does this PR introduce _any_ user-facing change? Yes ### How was this patch tested? Added UT to `OrcSourceSuite`. Authored-by: Max Gekk <max.gekkgmail.com> Signed-off-by: Dongjoon Hyun <dhyunapple.com> (cherry picked from commit c5f6af9) Signed-off-by: Max Gekk <max.gekkgmail.com> Closes apache#29985 from MaxGekk/orc-option-propagation-3.0. Authored-by: Max Gekk <max.gekk@gmail.com> Signed-off-by: HyukjinKwon <gurwls223@apache.org>
What changes were proposed in this pull request?
Propagate ORC options to Hadoop configs in Hive
OrcFileFormat
and in the regular ORC datasource.Why are the changes needed?
There is a bug that when running:
spark.read.format("orc").options(conf).load(path)
The underlying file system will not receive the conf options.
Does this PR introduce any user-facing change?
Yes
How was this patch tested?
Added UT to
OrcSourceSuite
.Authored-by: Max Gekk max.gekk@gmail.com
Signed-off-by: Dongjoon Hyun dhyun@apple.com
(cherry picked from commit c5f6af9)
Signed-off-by: Max Gekk max.gekk@gmail.com