-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unify ProcessGroupNCCL
APIs underlying implementation
#48163
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
ProcessGroupNCCL
APIs underlying implementation
phi::DenseTensor* tensor, | ||
int rank, | ||
std::shared_ptr<ProcessGroup::Task> ProcessGroupNCCL::NCCLEnv( | ||
const phi::DenseTensor& tensor, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
如果想要这个语义的话,不建议传入一个tensor,还不如place和holder
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
当时是考虑到万一后面还有别的参数,tensor 里包含的信息比较多
002cb2a
to
73bda53
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR types
Others
PR changes
APIs
Describe
由于集合通信
ProcessGroupNCCL
底层接口的实现高度重合,在此统一接口实现,并规范接口参数,为动静统一时静态图传入的 const input 提供支持。