Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix error in Iceberg predicate pushdown to Parquet files #15408

Merged

Conversation

alexjo2144
Copy link
Member

Description

The Paruqet reader does not support pushdown on fields of a Row type. The checks in IcebergPageSourceProvider#getParquetTupleDomain used to prevent this, but they stopped working when dereference pushdown was implemented.

If a row field had the same name as a top level column this would have resulted in a correctness issue.

Additional context and related issues

Deref/projection pushdown was implemented in: #8129

Release notes

( ) This is not user-visible or docs only and no release notes are required.
(x) Release notes are required, please propose a release note for me.
( ) Release notes are required, with the following suggested text:

@cla-bot cla-bot bot added the cla-signed label Dec 14, 2022
@alexjo2144 alexjo2144 added correctness bug Something isn't working labels Dec 14, 2022
@alexjo2144 alexjo2144 force-pushed the iceberg/parquet-nested-predicate-pushdown branch 4 times, most recently from f25f034 to d6b26b5 Compare December 15, 2022 22:56
The Paruqet reader does not support pushdown on fields of a Row type.
The checks in `IcebergPageSourceProvider#getParquetTupleDomain` used
to prevent this, but they stopped working when dereference pushdown
was implemented.

If a row field had the same name as a top level column this would
have resulted in a correctness issue.
@alexjo2144 alexjo2144 force-pushed the iceberg/parquet-nested-predicate-pushdown branch from d6b26b5 to bd01541 Compare December 16, 2022 02:06
@alexjo2144
Copy link
Member Author

alexjo2144 commented Dec 16, 2022

Thanks all, cleaned those little things up

@ebyhr ebyhr merged commit e33b3ac into trinodb:master Dec 16, 2022
@github-actions github-actions bot added this to the 404 milestone Dec 16, 2022
@alexjo2144 alexjo2144 deleted the iceberg/parquet-nested-predicate-pushdown branch December 19, 2022 15:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working cla-signed correctness
Development

Successfully merging this pull request may close these issues.

3 participants