-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add inference kv cache support for transformer TE path #6627
Commits on Jun 2, 2023
-
Add kv cache support for transformer TE path
Signed-off-by: Yen-Shi Wang <yenshiw@nvidia.com>
Yen-Shi Wang committedJun 2, 2023 Configuration menu - View commit details
-
Copy full SHA for 5cb0bb5 - Browse repository at this point
Copy the full SHA 5cb0bb5View commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for 6b817eb - Browse repository at this point
Copy the full SHA 6b817ebView commit details -
Mark get_data_parallel_group as WAR
Signed-off-by: Yen-Shi Wang <yenshiw@nvidia.com>
Yen-Shi Wang committedJun 2, 2023 Configuration menu - View commit details
-
Copy full SHA for a770914 - Browse repository at this point
Copy the full SHA a770914View commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for 512e6ee - Browse repository at this point
Copy the full SHA 512e6eeView commit details -
Initialize process group for FP8 training
Signed-off-by: Tim Moon <tmoon@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for cb4ee8d - Browse repository at this point
Copy the full SHA cb4ee8dView commit details -
Update Megatron GPT eval script for non-FP8 path
Signed-off-by: Yen-Shi Wang <yenshiw@nvidia.com>
Yen-Shi Wang committedJun 2, 2023 Configuration menu - View commit details
-
Copy full SHA for c047d8c - Browse repository at this point
Copy the full SHA c047d8cView commit details
Commits on Jun 3, 2023
-
Merge branch 'main' into dev-yenshiw-te-fp8-inference
Signed-off-by: Yen-Shi Wang <6960565+yen-shi@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 2134a9a - Browse repository at this point
Copy the full SHA 2134a9aView commit details -
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Configuration menu - View commit details
-
Copy full SHA for 9e5518e - Browse repository at this point
Copy the full SHA 9e5518eView commit details
Commits on Jun 4, 2023
-
Configuration menu - View commit details
-
Copy full SHA for cc26e5b - Browse repository at this point
Copy the full SHA cc26e5bView commit details
Commits on Jun 6, 2023
-
Configuration menu - View commit details
-
Copy full SHA for e2a52be - Browse repository at this point
Copy the full SHA e2a52beView commit details