Support FP8 mixed precision training for Ada Lovelace GPUs #1348

Dango233 · 2023-04-22T15:52:41Z

As mentioned in NVIDIA/TransformerEngine#15 (comment)

Today a CUDA Toolkit 12.1 update 1 was released. It contains cuBLAS 12.1.3.1 enabling FP8 kernels for Ada. With this version of cuBLAS together with Transformer Engine 0.7 which added Ada to the compilation targets, FP8 computation is now supported on Ada.

GPUs with computation capacity 8.9, (Ada Lovelace, includes RTX4xxx and RTX6000ada) can now benifit from FP8 acceleration with CUDA Toolkit 12.1 update 1 and Transformer Engine 0.7.

This pull request reflects the change.

HuggingFaceDocBuilderDev · 2023-04-22T15:58:49Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for your PR!

sgugger · 2023-04-24T11:30:55Z

src/accelerate/accelerator.py

                    "insufficient for FP8 mixed precision training (requires a GPU Hopper or higher, compute "
                    "capability of 9 or higher). Will use FP16 instead."


The rest of the error message needs to be updated as well.

Updated! Thank you :D

Support FP8 mixed training for Ada Lovelace GPUs

598fa9f

Black format

8db39d9

muellerzr requested a review from sgugger April 22, 2023 18:45

sgugger reviewed Apr 24, 2023

View reviewed changes

Updating error message

45d1455

Dango233 requested a review from sgugger April 24, 2023 15:50

sgugger approved these changes Apr 24, 2023

View reviewed changes

sgugger merged commit e06e7b3 into huggingface:main Apr 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support FP8 mixed precision training for Ada Lovelace GPUs #1348

Support FP8 mixed precision training for Ada Lovelace GPUs #1348

Dango233 commented Apr 22, 2023

HuggingFaceDocBuilderDev commented Apr 22, 2023 •

edited

Loading

sgugger left a comment

sgugger Apr 24, 2023

Dango233 Apr 24, 2023

		"insufficient for FP8 mixed precision training (requires a GPU Hopper or higher, compute "
		"capability of 9 or higher). Will use FP16 instead."

Support FP8 mixed precision training for Ada Lovelace GPUs #1348

Support FP8 mixed precision training for Ada Lovelace GPUs #1348

Conversation

Dango233 commented Apr 22, 2023

HuggingFaceDocBuilderDev commented Apr 22, 2023 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

sgugger Apr 24, 2023

Choose a reason for hiding this comment

Dango233 Apr 24, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Apr 22, 2023 •

edited

Loading