Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use ARC V2 self-hosted runners for GPU jobs #13123

Merged
merged 2 commits into from
Apr 17, 2023

Conversation

jjacobelli
Copy link
Contributor

This PR is updating the runner labels to use ARC V2 self-hosted runners for GPU jobs. This is needed to resolve the auto-scalling issues.

Signed-off-by: Jordan Jacobelli <jjacobelli@nvidia.com>
@jjacobelli jjacobelli self-assigned this Apr 17, 2023
@jjacobelli jjacobelli added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Apr 17, 2023
@jjacobelli jjacobelli marked this pull request as ready for review April 17, 2023 15:28
@jjacobelli jjacobelli requested a review from a team as a code owner April 17, 2023 15:28
Copy link
Contributor

@bdice bdice left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is there no longer a way to specify “use the latest driver version, any GPU hardware is fine”?

@ajschmidt8
Copy link
Member

Is there no longer a way to specify “use the latest driver version, any GPU hardware is fine”?

There is not, unfortunately.

arc-v2 only allows a single label per runner set.

Per our discussions with the arc team, this change is intentional.

I'm tracking the issue below for any future developments:

@bdice
Copy link
Contributor

bdice commented Apr 17, 2023

arc-v2 only allows a single label per runner set.

Then does "latest" make sense, or should we always need to specify the driver version?

@ajschmidt8
Copy link
Member

arc-v2 only allows a single label per runner set.

Then does "latest" make sense, or should we always need to specify the driver version?

We will no longer be using driver-specific labels at all for arc-v2.

There are more details in the PR below (the PR was closed before the force-push that was recently done to that repository, but will be rebased and re-opened soon)

@jjacobelli
Copy link
Contributor Author

/merge

@rapids-bot rapids-bot bot merged commit b8ab63d into rapidsai:branch-23.06 Apr 17, 2023
@jjacobelli jjacobelli deleted the arc-gpu branch April 17, 2023 19:05
shwina pushed a commit to shwina/cudf that referenced this pull request Apr 18, 2023
This PR is updating the runner labels to use ARC V2 self-hosted runners for GPU jobs. This is needed to resolve the auto-scalling issues.

Authors:
  - Jordan Jacobelli (https://github.com/jjacobelli)

Approvers:
  - AJ Schmidt (https://github.com/ajschmidt8)

URL: rapidsai#13123
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
improvement Improvement / enhancement to an existing function non-breaking Non-breaking change
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants