Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dist: Change default library to libnccl-net-ofi.so #808

Merged
merged 1 commit into from
Mar 20, 2025

Conversation

bwbarrett
Copy link
Contributor

The NGC containers have added some logic to try and support multiple plugins by renaming the OFI plugin to libnccl-net-aws.so. Ignoring the naming for a second, it's clearly useful for us to conform to Sylvain's original expectation that everyone named plugins as libnccl-net-.so and providing a symlink to the generic name where appropriate. This patch causes us to generate the plugin as libnccl-net-ofi.so and create a symlink from libnccl-net-ofi.so to libnccl-net.so for backwards compatibility.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

rauteric
rauteric previously approved these changes Mar 14, 2025
@sielicki
Copy link

mind adding a 'fixes: #508' to the commit message?

rajachan
rajachan previously approved these changes Mar 14, 2025
The NGC containers have added some logic to try and support multiple
plugins by renaming the OFI plugin to libnccl-net-aws.so.  Ignoring
the naming for a second, it's clearly useful for us to conform to
Sylvain's original expectation that everyone named plugins as
libnccl-net-<network>.so and providing a symlink to the generic
name where appropriate.  This patch causes us to generate the plugin
as libnccl-net-ofi.so and create a symlink from libnccl-net-ofi.so to
libnccl-net.so for backwards compatibility.

Signed-off-by: Brian Barrett <bbarrett@amazon.com>
@bwbarrett bwbarrett dismissed stale reviews from rajachan and rauteric via 262ca04 March 18, 2025 16:54
@bwbarrett bwbarrett force-pushed the help-ngc-containers branch from bb536c3 to 262ca04 Compare March 18, 2025 16:54
@bwbarrett bwbarrett marked this pull request as ready for review March 18, 2025 18:34
@bwbarrett bwbarrett requested a review from a team as a code owner March 18, 2025 18:34
@rajachan rajachan merged commit f71aea2 into aws:master Mar 20, 2025
23 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants