Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Specify ASCEND NPU for inference. #3635

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

Conversation

as12138
Copy link

@as12138 as12138 commented Nov 29, 2024

Why are these changes needed?

When deploying inference services with ASCEND NPU, it is not possible to specify the card to be used. @infwinston @CodingWithTim

Related issue number (if applicable)

Checks

  • I've run format.sh to lint the changes in this PR.
  • I've included any doc changes needed.
  • I've made sure the relevant tests are passing (if applicable).

@as12138 as12138 force-pushed the main branch 2 times, most recently from 83d36e2 to 6892888 Compare November 29, 2024 09:11
@as12138
Copy link
Author

as12138 commented Dec 13, 2024

@infwinston

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant