Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

'Encoding DELTA_BYTE_ARRAY is not supported' with parquet arrow readers #708

Closed
ilya-biryukov opened this issue Aug 23, 2021 · 0 comments · Fixed by #709
Closed

'Encoding DELTA_BYTE_ARRAY is not supported' with parquet arrow readers #708

ilya-biryukov opened this issue Aug 23, 2021 · 0 comments · Fixed by #709
Labels

Comments

@ilya-biryukov
Copy link
Contributor

Describe the bug
Reading parquet files with byte arrays encoded using DELTA_BYTE_ARRAY causes the error mentioned from the title.
Corresponding fields must be of UTF8 converted type.

To Reproduce
Steps to reproduce the behavior:

  1. read parquet file with strings encoded as DELTA_BYTE_ARRAY with parquet's arrow readers.

Expected behavior

  1. file reads with no error

Actual: the aforementioned error is produced.

Additional context
We fixed this in our fork, I'll send a PR for review and further discussion soon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant