Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make note about grad accum and prec in performance documentation #1296

Merged
merged 1 commit into from
Apr 6, 2023

Conversation

muellerzr
Copy link
Collaborator

Helps #1277 by adding a note based on my tests yesterday. tl;dr: When trying to compare performance you won't see a 1:1 if you use gradient accumulation or mixed precision due to averaging of the gradients and their overall precision, though the end result should be similar.

@muellerzr muellerzr added the documentation Improvements or additions to documentation label Apr 6, 2023
@muellerzr muellerzr requested a review from sgugger April 6, 2023 14:20
@HuggingFaceDocBuilderDev
Copy link

HuggingFaceDocBuilderDev commented Apr 6, 2023

The documentation is not available anymore as the PR was closed or merged.

Copy link
Collaborator

@sgugger sgugger left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the note!

@muellerzr muellerzr merged commit 419ecf3 into main Apr 6, 2023
@muellerzr muellerzr deleted the performance-docs branch April 6, 2023 15:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants