Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RELEASE] dask-cuda v23.06 #1188

Merged
merged 26 commits into from
Jun 7, 2023
Merged

[RELEASE] dask-cuda v23.06 #1188

merged 26 commits into from
Jun 7, 2023

Conversation

GPUtester
Copy link
Contributor

❄️ Code freeze for branch-23.06 and v23.06 release

What does this mean?

Only critical/hotfix level issues should be merged into branch-23.06 until release (merging of this PR).

What is the purpose of this PR?

  • Update documentation
  • Allow testing for the new release
  • Enable a means to merge branch-23.06 into main for the release

raydouglass and others added 23 commits March 23, 2023 14:56
Forward-merge branch-23.04 to branch-23.06
Forward-merge branch-23.04 to branch-23.06
Forward-merge branch-23.04 to branch-23.06
Forward-merge branch-23.04 to branch-23.06
Forward-merge branch-23.04 to branch-23.06
Close #1152 .

Authors:
  - Jiaming Yuan (https://github.com/trivialfis)
  - Lawrence Mitchell (https://github.com/wence-)

Approvers:
  - Lawrence Mitchell (https://github.com/wence-)

URL: #1157
This PR is updating the runner labels to use ARC V2 self-hosted runners for GPU jobs. This is needed to resolve the auto-scalling issues.

Authors:
  - Jordan Jacobelli (https://github.com/jjacobelli)

Approvers:
  - AJ Schmidt (https://github.com/ajschmidt8)

URL: #1159
With the release of zict 3.0 a few changes were made to resources that were used in spilling tests that are being updated here.

Authors:
  - Peter Andreas Entschev (https://github.com/pentschev)

Approvers:
  - Benjamin Zaitlen (https://github.com/quasiben)
  - AJ Schmidt (https://github.com/ajschmidt8)

URL: #1160
Instead of using `rapids-get-rapids-version-from-git` we can just hardcode the version and use `update-version.sh` to update it

Authors:
  - Jordan Jacobelli (https://github.com/jjacobelli)

Approvers:
  - AJ Schmidt (https://github.com/ajschmidt8)

URL: #1163
Authors:
  - Ashwin Srinath (https://github.com/shwina)
  - AJ Schmidt (https://github.com/ajschmidt8)

Approvers:
  - AJ Schmidt (https://github.com/ajschmidt8)
  - Charles Blackmon-Luca (https://github.com/charlesbluca)
  - Peter Andreas Entschev (https://github.com/pentschev)

URL: #1164
This PR unblocks RAPIDS CI since many places attempt to install dask-cuda from source. We can undo this change once the rest of RAPIDS has moved to Python 3.9. We will also want to discuss better strategies for handling dask-cuda in CI as part of our ongoing discussions around improving latest dask usage in CI.

Authors:
  - Vyas Ramasubramani (https://github.com/vyasr)

Approvers:
  - GALI PREM SAGAR (https://github.com/galipremsagar)
  - Charles Blackmon-Luca (https://github.com/charlesbluca)

URL: #1166
Should be safe to do this now that cuDF 3.9 nightlies are being published

Authors:
  - Charles Blackmon-Luca (https://github.com/charlesbluca)

Approvers:
  - AJ Schmidt (https://github.com/ajschmidt8)

URL: #1168
Forward-merge branch-23.04 to branch-23.06
We undid the pinning in order to unblock the Python 3.8->3.9 transition in RAPIDS, which is now complete.

Authors:
  - Vyas Ramasubramani (https://github.com/vyasr)

Approvers:
  - Peter Andreas Entschev (https://github.com/pentschev)

URL: #1171
The `rapids-env-update` command needs a `GH_TOKEN` environment for CI now due to the changes below:

- rapidsai/gha-tools#53

Similar to: rapidsai/shared-workflows#87

Authors:
   - AJ Schmidt (https://github.com/ajschmidt8)

Approvers:
   - Ray Douglass (https://github.com/raydouglass)
As discussed in #1174, we must workaround test failures until Distributed can be unpinned.

Authors:
  - Peter Andreas Entschev (https://github.com/pentschev)

Approvers:
  - Lawrence Mitchell (https://github.com/wence-)

URL: #1175
This PR configures `dask-cuda` docs builds to also run nightly (not just on PR merges only)

Authors:
  - Jake Awe (https://github.com/AyodeAwe)

Approvers:
  - AJ Schmidt (https://github.com/ajschmidt8)
  - Peter Andreas Entschev (https://github.com/pentschev)

URL: #1176
Since the `build.yaml` workflow only runs on branch pushes, tag pushes, or nightly calls, it should always upload the wheel to PyPI like it does for conda packages.

This will fix the missing release uploads like this: https://github.com/rapidsai/dask-cuda/actions/runs/4678841210/jobs/8288889977

Authors:
  - Ray Douglass (https://github.com/raydouglass)

Approvers:
  - Vyas Ramasubramani (https://github.com/vyasr)
  - AJ Schmidt (https://github.com/ajschmidt8)
  - Peter Andreas Entschev (https://github.com/pentschev)

URL: #1177
Authors:
  - Peter Andreas Entschev (https://github.com/pentschev)

Approvers:
  - Lawrence Mitchell (https://github.com/wence-)

URL: #1182
Fixes #1180 

Making the CLI runnable with `python -m ` so that we can use the same call for both CLIs in `dask-jobqueue`

Authors:
  - Hugo MacDermott-Opeskin (https://github.com/hmacdope)
  - Peter Andreas Entschev (https://github.com/pentschev)

Approvers:
  - Lawrence Mitchell (https://github.com/wence-)
  - Peter Andreas Entschev (https://github.com/pentschev)

URL: #1181
Aligns with the rest of RAPIDS. Also needed for CUDA 12 support.

Authors:
  - https://github.com/jakirkham

Approvers:
  - Peter Andreas Entschev (https://github.com/pentschev)
  - Ray Douglass (https://github.com/raydouglass)

URL: #1185
@GPUtester GPUtester requested review from a team as code owners June 2, 2023 14:28
@github-actions github-actions bot added ci python python code needed labels Jun 2, 2023
pentschev and others added 3 commits June 6, 2023 15:57
We have been getting timeouts waiting for workers in CI, those are not reproducible locally. The reason for that is probably some sort of congestion causing spinup to take longer in CI, therefore this change introduces a variable that can be used to control the minimum timeout and the minimum timeout is doubled in CI.

Authors:
   - Peter Andreas Entschev (https://github.com/pentschev)

Approvers:
   - GALI PREM SAGAR (https://github.com/galipremsagar)
   - Ray Douglass (https://github.com/raydouglass)
Rather than individual tests hanging, the primary nightly problem seems to be that the `pytest` timeout is too short, increase it by 10 minutes to check if that is sufficient.

Authors:
   - Peter Andreas Entschev (https://github.com/pentschev)

Approvers:
   - Ray Douglass (https://github.com/raydouglass)
@raydouglass raydouglass merged commit ec3186d into main Jun 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci python python code needed
Projects
None yet
Development

Successfully merging this pull request may close these issues.