Checking we use fused kernels to compute scaled masked softmax on prefix lm #209

thomasw21 · 2021-11-26T13:03:31Z

Fixes: [PrefixLM] Improve test to test out custom cuda kernel #206

There seems to be no issues with cuda kernels, as tests pass locally.

stas00 · 2021-11-27T03:45:39Z

@thomasw21, this PR appears to have broken the test suite:

>               self.assertIn("Using fused softmax", cs.out)
E               AssertionError: 'Using fused softmax' not found in 'using world size: 1, data-parallel-size: 1, tensor-model-parallel size: 1, pipeline-model-parallel size: 1 \nusing torch.float16 for parameters ...\n------------------------ arguments ------------------------\n  accumulate_allreduce_grads_in_fp32 .......

=========================== short test summary info ============================
FAILED tests/test_model.py::MyTestCase::test_prefix_lm_wo_reset_attention_mask
FAILED tests/test_training.py::MegDSTestTraining::test_training_prefix_lm_all_0
FAILED tests/test_training.py::MegDSTestTraining::test_training_prefix_lm_all_1
FAILED tests/test_training.py::MegDSTestTraining::test_training_prefix_lm_all_2
FAILED tests/test_training.py::MegDSTestTraining::test_training_prefix_lm_all_3

Full logs:

https://github.com/bigscience-workshop/Megatron-DeepSpeed/runs/4339199658?check_suite_focus=true

In general please try to run the test suite locally if AWS doesn't give resources to run the CI (which unfortunately sucks :( )

…x on prefix lm (#209)" This reverts commit b227590.

thomasw21 · 2021-11-27T09:35:45Z

Hmm reverted in c9afebc Though this passed locally. I'll have second look at it on monday.

stas00 · 2021-11-27T17:42:44Z

Thank you, Thomas! That's helpful to other PRs.

thomasw21 added 3 commits November 26, 2021 14:02

WIP

46d5c33

Turns out there's no issue with the way we build prefix lm

e7a12e7

Lint

16ed621

thomasw21 changed the title ~~[WIP] Checking when we use fused kernels to compute scaled masked softmax~~ Checking we use fused kernels to compute scaled masked softmax on prefix lm Nov 26, 2021

thomasw21 marked this pull request as ready for review November 26, 2021 16:02

thomasw21 merged commit b227590 into main Nov 26, 2021

thomasw21 added a commit that referenced this pull request Nov 27, 2021

Revert "Checking we use fused kernels to compute scaled masked softma…

c9afebc

…x on prefix lm (#209)" This reverts commit b227590.

thomasw21 mentioned this pull request Nov 29, 2021

Checking we use fused kernels to compute scaled masked softmax on prefix lm #213

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Checking we use fused kernels to compute scaled masked softmax on prefix lm #209

Checking we use fused kernels to compute scaled masked softmax on prefix lm #209

thomasw21 commented Nov 26, 2021 •

edited

Loading

stas00 commented Nov 27, 2021 •

edited

Loading

thomasw21 commented Nov 27, 2021

stas00 commented Nov 27, 2021

Checking we use fused kernels to compute scaled masked softmax on prefix lm #209

Checking we use fused kernels to compute scaled masked softmax on prefix lm #209

Conversation

thomasw21 commented Nov 26, 2021 • edited Loading

stas00 commented Nov 27, 2021 • edited Loading

thomasw21 commented Nov 27, 2021

stas00 commented Nov 27, 2021

thomasw21 commented Nov 26, 2021 •

edited

Loading

stas00 commented Nov 27, 2021 •

edited

Loading