Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace stdlib CSV reader with simpler detector #1

Merged
merged 1 commit into from
Jul 10, 2024

Conversation

wagoodman
Copy link

@wagoodman wagoodman commented Jul 10, 2024

There is evidence that using the stdlib csv reader can be resource intensive from a memory perspective:

We're seeing evidence of this in stereoscope:

Screenshot 2024-07-10 at 12 28 47 PM

Since we are not in need of the full CSV reader functionality, this PR drops usage of the CSV reader and adds a CSV detector in its place. This yields a drastic performance improvement memory-wise:

Screenshot 2024-07-10 at 12 29 38 PM

@wagoodman wagoodman force-pushed the improve-csv-memory-perf branch 2 times, most recently from 8efc58a to 8f71d34 Compare July 10, 2024 16:33
@wagoodman wagoodman marked this pull request as draft July 10, 2024 16:35
Signed-off-by: Alex Goodman <wagoodman@users.noreply.github.com>
@wagoodman wagoodman marked this pull request as ready for review July 10, 2024 16:55
@wagoodman wagoodman merged commit f966690 into master Jul 10, 2024
6 checks passed
@wagoodman wagoodman deleted the improve-csv-memory-perf branch July 10, 2024 16:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant