Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[XPU] refine scale_value in AdamW #58805

Merged
merged 9 commits into from
Nov 13, 2023

Conversation

houj04
Copy link
Contributor

@houj04 houj04 commented Nov 8, 2023

PR types

Function optimization

PR changes

APIs

Description

在之前的PR #57077 中,给XPU下的DenseTensor新增了scale_value属性。

该PR有两个问题:(1)对DenseTensor的内存布局有影响(2)没有支持优化器的state_dictset_state_dict功能,因此在使用checkpoint进行训练状态保存和恢复的时候会有问题。

本PR对上述两个问题进行了优化和修复。

@houj04 houj04 changed the title [XPU] fix adamw optimizer state_dict [XPU] access scale_value of Tensor in python Nov 9, 2023
@houj04 houj04 changed the title [XPU] access scale_value of Tensor in python [XPU] refine scale_value in AdamW Nov 12, 2023
Copy link
Contributor

@chenwhql chenwhql left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor

@lj970926 lj970926 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@houj04 houj04 merged commit 93b4f21 into PaddlePaddle:develop Nov 13, 2023
28 checks passed
danleifeng pushed a commit to danleifeng/Paddle that referenced this pull request Nov 14, 2023
* [XPU] fix adamw optimizer state_dict

* fix typo

* rename scale to scale_value

* revert unnecessary codes.

* remove scale_value from DenseTensorMeta

* fix typo
SecretXV pushed a commit to SecretXV/Paddle that referenced this pull request Nov 28, 2023
* [XPU] fix adamw optimizer state_dict

* fix typo

* rename scale to scale_value

* revert unnecessary codes.

* remove scale_value from DenseTensorMeta

* fix typo
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants