metrics.Accuracy is not calculated correctly when the first argument is of type float16 #4840

pgagarinov · 2020-11-24T21:36:03Z

🐛 Bug

To Reproduce

>a = tensor([0.5015, 0.5068, 0.4597, 0.5176, 0.5063, 0.4873, 0.5073, 0.5049, 0.4871,
        0.4939, 0.5132, 0.5151, 0.5269, 0.5229, 0.4797, 0.5435],
       device='cuda:0', dtype=torch.float16)
>b = tensor([1., 0., 1., 1., 1., 0., 1., 0., 0., 0., 0., 1., 1., 0., 0., 0.],
       device='cuda:0')

>acc = metrics.Accuracy(compute_on_step = True).to('cuda')

>acc(a, b)
tensor(0., device='cuda:0')

Expected behavior

Same as for float32:

>a = tensor([0.5015, 0.5068, 0.4597, 0.5176, 0.5063, 0.4873, 0.5073, 0.5049, 0.4871,
        0.4939, 0.5132, 0.5151, 0.5269, 0.5229, 0.4797, 0.5435],
       device='cuda:0')
>b = tensor([1., 0., 1., 1., 1., 0., 1., 0., 0., 0., 0., 1., 1., 0., 0., 0.],
       device='cuda:0')

>acc(a, b)
tensor(0.6250, device='cuda:0')

Additional context

I run training with precision=16, the model spits out float16 logits which I pass to F.softmax and then to metrics.Accuracy.
The type of F.softmax(logit) depends on whether precision=16 is specified or not.
Precision, recall and F1 metrics seem to be calculated correctly.

The text was updated successfully, but these errors were encountered:

SkafteNicki · 2020-11-25T07:20:19Z

The problem is this check:
https://github.com/PyTorchLightning/pytorch-lightning/blob/78076ea0d99e4ba1f76a7992b7090812258c0d4d/pytorch_lightning/metrics/utils.py#L98-L100
as it assumes preds are float. It will probably be solved by PR #4837 but need to check up on that.

tchaton · 2020-11-27T12:25:38Z

Hey @SkafteNicki,

Any update there ?

Best,
T.C

SkafteNicki · 2020-11-30T09:52:54Z

@tchaton just confirmed that it will be solved when PR #4837 is merged.

pgagarinov added bug Something isn't working help wanted Open to be worked on labels Nov 24, 2020

awaelchli added the Metrics label Nov 25, 2020

SkafteNicki linked a pull request Nov 30, 2020 that will close this issue

Classification metrics overhaul: input formatting standardization (1/n) #4837

Merged

tchaton assigned SkafteNicki Nov 30, 2020

Borda added the good first issue Good for newcomers label Dec 1, 2020

Borda closed this as completed in #4837 Dec 7, 2020

luzuku mentioned this issue Dec 8, 2020

Accuracy metric for preds at half precision is zero with pl=1.0.8 #5013

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metrics.Accuracy is not calculated correctly when the first argument is of type float16 #4840

metrics.Accuracy is not calculated correctly when the first argument is of type float16 #4840

pgagarinov commented Nov 24, 2020

SkafteNicki commented Nov 25, 2020

tchaton commented Nov 27, 2020

SkafteNicki commented Nov 30, 2020

metrics.Accuracy is not calculated correctly when the first argument is of type float16 #4840

metrics.Accuracy is not calculated correctly when the first argument is of type float16 #4840

Comments

pgagarinov commented Nov 24, 2020

🐛 Bug

To Reproduce

Expected behavior

Additional context

SkafteNicki commented Nov 25, 2020

tchaton commented Nov 27, 2020

SkafteNicki commented Nov 30, 2020