Fix gradients with ignore_idx in softmax_with_cross_entropy #28622

guoshengCS · 2020-11-14T07:32:22Z

PR types

Bug fixes

PR changes

OPs

Describe

Fix gradient calculation with ignore_idx in softmax_with_cross_entropy.

Example code for reproducing error gradients on elements with ignore_idx:

import paddle
import numpy as np

paddle.seed(123)
np.random.seed(123)

class Net(paddle.nn.Layer):
    def __init__(self):
        super(Net, self).__init__()
        self.embedder = paddle.nn.Embedding(100, 64)
        self.linear = paddle.nn.Linear(64, 5)

    def forward(self, x, y):
        x = self.embedder(x)
        self.logits = logits = self.linear(x)
        print(y)
        loss = paddle.nn.functional.softmax_with_cross_entropy(logits, y, ignore_index=1)
        loss = paddle.mean(loss)
        return loss

x_data = np.random.randint(0, 5, (4,)).astype("int64")
x = paddle.to_tensor(x_data)
y_data = np.random.randint(0, 5, (4, 1))
y_data[0, 0] = 1  # ignore_idx
y = paddle.to_tensor(y_data)

net = Net()
loss = net(x, y)
loss.backward()
print(net.logits.grad)

test=develop

paddle-bot-old · 2020-11-14T07:32:29Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-bot-old · 2020-11-14T07:32:30Z

✅ This PR's description meets the template requirements!
Please wait for other CI results.

Remove softmax_with_cross_entropy from op_threshold_white_list. test=develop

test=develop

Xreki

LGTM for the change of max_relative_error

Fix gradients with ignore_idx in softmax_with_cross_entropy.

6ca6aaa

test=develop

guoshengCS added 2 commits November 14, 2020 20:28

Fix gradients with ignore_idx in softmax_with_cross_entropy on cpu.

332141f

Remove softmax_with_cross_entropy from op_threshold_white_list. test=develop

Fix test_softmax_cross_entropy_op.py.

7e590ed

test=develop

PaddlePaddle locked and limited conversation to collaborators Nov 14, 2020

PaddlePaddle unlocked this conversation Nov 14, 2020

guoshengCS requested review from ZeyuChen, heavengate and Xreki November 16, 2020 05:32

Xreki approved these changes Nov 16, 2020

View reviewed changes

heavengate approved these changes Nov 16, 2020

View reviewed changes

guoshengCS merged commit 110febd into PaddlePaddle:develop Nov 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gradients with ignore_idx in softmax_with_cross_entropy #28622

Fix gradients with ignore_idx in softmax_with_cross_entropy #28622

guoshengCS commented Nov 14, 2020 •

edited

Loading

paddle-bot-old bot commented Nov 14, 2020

paddle-bot-old bot commented Nov 14, 2020 •

edited

Loading

Xreki left a comment

Fix gradients with ignore_idx in softmax_with_cross_entropy #28622

Fix gradients with ignore_idx in softmax_with_cross_entropy #28622

Conversation

guoshengCS commented Nov 14, 2020 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Nov 14, 2020

paddle-bot-old bot commented Nov 14, 2020 • edited Loading

Xreki left a comment

Choose a reason for hiding this comment

guoshengCS commented Nov 14, 2020 •

edited

Loading

paddle-bot-old bot commented Nov 14, 2020 •

edited

Loading