-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PIR & Inference] Refactor fused_weight_only_linear_pass #59792
[PIR & Inference] Refactor fused_weight_only_linear_pass #59792
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
✅ This PR's description meets the template requirements! |
if (sm_vesion != 70 && sm_vesion != 80 && sm_vesion != 86 && | ||
sm_vesion != 75) { | ||
// TODO(Wanglongzhi2001): only support sm80 for now | ||
if (sm_vesion != 80) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
当前develop版本已经支持了70 75 80 86这四个架构
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
@@ -30,5 +30,7 @@ bool DtypeInterface::operator==(const DtypeInterface& other) const { | |||
return *dtype_ == *other.dtype_; | |||
} | |||
|
|||
IrDtype DtypeInterface::dtype() const { return *(this->dtype_); } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里返回类型直接用pir::Type
吧,设计早起本来是不希望暴露太多pir的概念到上层,目前看pir::Type
在drr中的使用场景比较有限,可以直接用
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
是指像下面这样吗
pir::Type DtypeInterface::dtype() const { return this->dtype_->get(); }
PR types
Others
PR changes
Others
Description
Pcard-71502
修改了之前的 fused_weight_only_linear_pass 的 PR #59366