-
-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
perf: Cache path resolving of scan
functions
#17616
Conversation
Yeah, got the same isslue with hive partitioning. First one who knows what it is shares. ;) (afk atm) I think the mutex is better here as the read time is super short. |
a85ece7
to
ec90d1d
Compare
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #17616 +/- ##
==========================================
- Coverage 80.69% 80.68% -0.02%
==========================================
Files 1484 1484
Lines 195421 195453 +32
Branches 2782 2782
==========================================
+ Hits 157695 157700 +5
- Misses 37214 37241 +27
Partials 512 512 ☔ View full report in Codecov by Sentry. |
} => expand_paths(&lock.0, file_options.glob, cloud_options.as_ref())?, | ||
#[cfg(feature = "json")] | ||
FileScan::NDJson { .. } => expand_paths(&lock.0, file_options.glob, None)?, | ||
FileScan::Anonymous { .. } => lock.0.clone(), // Anonymous scans are already expanded. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FileScan::Anonymous { .. } => lock.0.clone(), // Anonymous scans are already expanded. | |
FileScan::Anonymous { .. } => unreachable!(), // Invariant: Anonymous scans are already expanded. |
Closes #17584
Changes
DslPlan
after resolving paths. This avoids re-resolving the paths upon repeatedcollect
calls on the same LazyFrame.