feature: optional pandas and polars support #467
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fixes #394.
I recently ran into an issue where my pipipegraph2 failed to to recalculate nodes downstream of a changed output because deepdiff assigned the same hash to different DataFrames.
Turns out, it was essentially only hashing the column names.
This PR fixes that for pandas, and while I had it open, for polars as well.
The code paths are optional on a successful pandas/polars import.
The added tests of course require pandas and polars. I tried for both with the older versions I listed in requirements-dev.txt and the current versions
I observe 3 failing & 3 error test cases here locally,
but they also failed before I touched the code, so I'll blame them on my local venv.