Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Accelerator] Cambricon MLU support #6472

Merged
merged 9 commits into from
Sep 26, 2024

Conversation

Andy666G
Copy link
Contributor

@Andy666G Andy666G commented Sep 2, 2024

Description

This PR includes Cambricon MLU accelerator support.
With this PR, DeepSpeed supports MLU as backend for training and inference tasks.

@tjruwase
Copy link
Contributor

tjruwase commented Sep 3, 2024

Please see here to fix formatting issues.

Also, consider updating the following accelerator docs:

  1. Tutorial.
  2. HW list.

@loadams
Copy link
Contributor

loadams commented Sep 11, 2024

@Andy666G - curious if you would be able to run the precommit formatter to resolve the formatting errors?

@Andy666G
Copy link
Contributor Author

@Andy666G - curious if you would be able to run the precommit formatter to resolve the formatting errors?

Hi, the formatting errors have been resolved.

@tjruwase
Copy link
Contributor

@Andy666G, do you plan to address my feedback regarding documentation in a separate PR?
#6472 (comment)

@tjruwase tjruwase added this pull request to the merge queue Sep 26, 2024
Merged via the queue into microsoft:master with commit 0fbe96a Sep 26, 2024
12 checks passed
loadams added a commit that referenced this pull request Sep 27, 2024
…ch workflow triggers (#6584)

Changes from #6472 caused the no-torch workflow that is an example of
how we build the DeepSpeed release package to fail (so we caught this
before a release, see more in #6402). These changes also copy the style
used to include torch in other accelerator op_builder implementations,
such as npu
[here](https://github.com/microsoft/DeepSpeed/blob/master/op_builder/npu/fused_adam.py#L8)
and hpu
[here](https://github.com/microsoft/DeepSpeed/blob/828ddfbbda2482412fffc89f5fcd3b0d0eba9a62/op_builder/hpu/fused_adam.py#L15).

This also updates the no-torch workflow to run on all changes to the
op_builder directory. The test runs quickly and shouldn't add any
additional testing burden there.

Resolves: #6576
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants