-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Reshard] Implement replicated to split with same placement #55552
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
d2ad29f
to
ae6fc4b
Compare
bc01ab6
to
fd19cbb
Compare
ReshardFunction() = default; | ||
virtual ~ReshardFunction() = default; | ||
|
||
virtual bool Check(const DistTensor& in, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
细节问题,这个接口语义可以再具体一些吗?尽量让代码可以自解释一些,比如这里是check什么?Check是否Valid吗
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
改成了IsSuitable,检查当前状态转换函数是否适合,
paddle/phi/core/distributed/auto_parallel/reshard_split_functor.h
Outdated
Show resolved
Hide resolved
paddle/phi/core/distributed/auto_parallel/reshard_split_functor.cc
Outdated
Show resolved
Hide resolved
55bb8b6
to
1a4debb
Compare
ae347a6
to
6c53507
Compare
virtual bool Check(const DistTensor& in, | ||
const std::shared_ptr<TensorDistAttr>& out_dist_attr) = 0; | ||
|
||
virtual std::shared_ptr<DistTensor> Eval( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What does "Eval" mean?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
…ddle#55552) * Implement replicated to split reshard function * fix link error in clang * refine split functor * simplify reshard code
…ddle#55552) * Implement replicated to split reshard function * fix link error in clang * refine split functor * simplify reshard code
PR types
New features
PR changes
Others
Description
Pcard-73145
支持Replicated到Shard的状态转换,要求:
以4卡为例,输入输出都是一维process_mesh,[0, 1, 2, 3],输入为二维Replicated状态,in_tensor_shape = [4, 8],in_dims_mapping = [-1, -1],输出为二维Shard状态。