Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IntelMPI support #1807

Open
eero-t opened this issue May 17, 2023 · 4 comments
Open

IntelMPI support #1807

eero-t opened this issue May 17, 2023 · 4 comments

Comments

@eero-t
Copy link

eero-t commented May 17, 2023

mpi-operator MPIJob got IntelMPI support already in Summer 2021. Although traning-operator added MPIJob (shortly) after that, it's still missing IntelMPI support.

Related mpi-operator PRs can be seen from this list: https://github.com/kubeflow/mpi-operator/pulls?q=is%3Apr+intel+mpi+is%3Aclosed

PR #1804 adds IntelMPI env var support, but there are also other things that are needed.

IMHO most important ones from the mpi-operator are:

And these few other PRs could also be relevant:

@eero-t
Copy link
Author

eero-t commented May 17, 2023

Having (eventually) same API and MPI implementations support for MPIJob as in mpi-operator would help in switching between them & comparing them. I understood there are some differences in how they do things, so this would help in getting some data on whether there's an actual difference in practice.

@johnugeorge
Copy link
Member

Related: #1804

@github-actions
Copy link

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@johnugeorge
Copy link
Member

/lifecycle frozen

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants