-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GH-41562: [C++][Parquet] Decoding: Fix num_value handling in ByteStreamSplitDecoder #41565
Conversation
|
b51608a
to
ddd0cde
Compare
cpp/src/parquet/encoding.h
Outdated
// In current implementations, `num_values` is the `num_values` field in the | ||
// data page header, which may greater than the number of values in the data | ||
// buffer. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
// In current implementations, `num_values` is the `num_values` field in the | |
// data page header, which may greater than the number of values in the data | |
// buffer. | |
// `num_values` comes from the data page header, and may be greater than the number of | |
// physical values in the data buffer if there are some omitted (null) values. | |
// `len`, on the other hand, is the size in bytes of the data buffer and | |
// directly relates to the number of physical values. |
@github-actions crossbow submit -g cpp |
Revision: 7daee1c Submitted crossbow builds: ursacomputing/crossbow @ actions-655ed96d55 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks a lot for fixing this, @mapleFU .
CI failures seem unrelated.
After merging your PR, Conbench analyzed the 7 benchmarking runs that have been run so far on merge-commit 9cf0ee7. There were no benchmark performance regressions. 🎉 The full Conbench report has more details. It also includes information about 8 possible false positives for unstable benchmarks that are known to sometimes produce them. |
…amSplitDecoder (#41565) ### Rationale for this change This problem is raised from #40094 . Original bug fixed here: #34140 , but this is corrupt in #40094 . ### What changes are included in this PR? Refine checking ### Are these changes tested? * [x] Will add ### Are there any user-facing changes? Bugfix * GitHub Issue: #41562 Authored-by: mwish <maplewish117@gmail.com> Signed-off-by: Antoine Pitrou <antoine@python.org>
…teStreamSplitDecoder (apache#41565) ### Rationale for this change This problem is raised from apache#40094 . Original bug fixed here: apache#34140 , but this is corrupt in apache#40094 . ### What changes are included in this PR? Refine checking ### Are these changes tested? * [x] Will add ### Are there any user-facing changes? Bugfix * GitHub Issue: apache#41562 Authored-by: mwish <maplewish117@gmail.com> Signed-off-by: Antoine Pitrou <antoine@python.org>
Rationale for this change
This problem is raised from #40094 . Original bug fixed here: #34140 , but this is corrupt in #40094 .
What changes are included in this PR?
Refine checking
Are these changes tested?
Are there any user-facing changes?
Bugfix