Skip to content

How to efficiently process extremely large delta table #2711

Answered by WhiteWolf1776
jiaw314 asked this question in Q&A
Discussion options

You must be logged in to vote

If daily, and if possible, partition the data daily executing compact a few times through the day. Then filter on that partition when you read the delta table.

Anything more efficient would require a lot more code.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by rtyler
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants