Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

coords: retain str dtype #4759

Merged
merged 11 commits into from
Jan 13, 2021
Merged

Conversation

mathause
Copy link
Collaborator

@mathause mathause commented Jan 4, 2021

pd.Index("a") has dtype object. Therefore string coords change their dtype on certain operations - e.g. align, __setitem__ (& assign), IndexVariable.concat. This can be avoided by using the coords instead of the index in some cases but in two instances it was unavoidable to cast a pd.Index back to a np.array. I probably did not catch all of these conversions.

What I am not sure: does this somehow contradict the index refactor?

@mathause mathause changed the title Coords retain str dtype coords: retain str dtype Jan 4, 2021
@mathause mathause mentioned this pull request Jan 4, 2021
8 tasks
@mathause
Copy link
Collaborator Author

mathause commented Jan 9, 2021

This would be ready. I think the complexity is not very high but is it worth it?

Copy link
Collaborator

@max-sixty max-sixty left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great, LGTM!

@mathause mathause merged commit fb67358 into pydata:master Jan 13, 2021
@mathause mathause deleted the coords_retain_str_dtype branch January 13, 2021 17:09
dcherian added a commit to TomNicholas/xarray that referenced this pull request Jan 18, 2021
* upstream/master: (342 commits)
  fix decode for scale/ offset list (pydata#4802)
  Expand user dir paths (~) in open_mfdataset and to_zarr. (pydata#4795)
  add a version info step to the upstream-dev CI (pydata#4815)
  fix the ci trigger action (pydata#4805)
  scatter plot by order of the first appearance of hue (pydata#4723)
  don't skip the scheduled CI (pydata#4806)
  coords: retain str dtype (pydata#4759)
  Fix interval labels with units (pydata#4794)
  Always force dask arrays to float in missing.interp_func (pydata#4771)
  Print number of variables in repr (pydata#4762)
  install conda as a library in the minimum dependency check CI (pydata#4792)
  Migrate CI from azure pipelines to GitHub Actions (pydata#4730)
  use conda.api instead of parallel calls to the conda binary (pydata#4775)
  Speed up missing._get_interpolator (pydata#4776)
  Remove special case in guess_engines (pydata#4777)
  improve typing of OrderedSet (pydata#4774)
  CI: ignore some warnings (pydata#4773)
  DOC: update hyperlink for xskillscore (pydata#4778)
  drop support for python 3.6 (pydata#4720)
  Trigger upstream CI on cron schedule (by default) (pydata#4729)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants