Decouple the program desc with batch_size in Transformer. #783

guoshengCS · 2018-03-27T19:47:10Z

Decouple the program desc with batch_size in Transformer. The inference program has been validated to have the same generated sentences for different batch size.
It relies on PaddlePaddle/Paddle#9008 .

… fix-transformer-batchsize-dev

pkuyym · 2018-04-08T05:48:12Z

fluid/neural_machine_translation/transformer/infer.py

@@ -273,6 +301,9 @@ def main():

    trg_idx2word = paddle.dataset.wmt16.get_dict(
        "de", dict_size=ModelHyperParams.trg_vocab_size, reverse=True)
+    # Append the <pad> token since the dict provided by dataset.wmt16 does
+    # not include it.
+    trg_idx2word[ModelHyperParams.trg_pad_idx] = "<pad>"


Please fix this in next PR.

pkuyym · 2018-04-08T05:54:33Z

fluid/neural_machine_translation/transformer/train.py

@@ -138,12 +144,14 @@ def test(exe):
        test_avg_costs = []
        for batch_id, data in enumerate(val_data()):
            if len(data) != TrainTaskConfig.batch_size:


Please fix this

Done. Refine the validation and use the global statistics.

pkuyym

LGTM

guoshengCS added 8 commits March 28, 2018 03:39

Decouple the program desc with batch_size in Transformer.

f3c247d

Fix the inference batch_size in Transformer.

ec70ff4

Merge branch 'develop' of https://github.com/PaddlePaddle/models into…

b6ab2f4

… fix-transformer-batchsize-dev

Merge branch 'develop' of https://github.com/PaddlePaddle/models into…

2e7494d

… fix-transformer-batchsize-dev

Fix the merged conflicts in Transformer inference

8ca8903

Merge branch 'develop' of https://github.com/PaddlePaddle/models into…

86fe83f

… fix-transformer-batchsize-dev

Increase the batch size in inference config of Transformer

f82f47c

Append the <pad> token to the dict when converting ids to words

afe55c9

pkuyym reviewed Apr 8, 2018

View reviewed changes

Refine the validation in Transformer.

baa01f6

pkuyym approved these changes Apr 8, 2018

View reviewed changes

guoshengCS merged commit f14db82 into PaddlePaddle:develop Apr 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decouple the program desc with batch_size in Transformer. #783

Decouple the program desc with batch_size in Transformer. #783

guoshengCS commented Mar 27, 2018 •

edited

Loading

pkuyym Apr 8, 2018

guoshengCS Apr 8, 2018

pkuyym Apr 8, 2018

guoshengCS Apr 8, 2018

pkuyym left a comment

Decouple the program desc with batch_size in Transformer. #783

Decouple the program desc with batch_size in Transformer. #783

Conversation

guoshengCS commented Mar 27, 2018 • edited Loading

pkuyym Apr 8, 2018

Choose a reason for hiding this comment

guoshengCS Apr 8, 2018

Choose a reason for hiding this comment

pkuyym Apr 8, 2018

Choose a reason for hiding this comment

guoshengCS Apr 8, 2018

Choose a reason for hiding this comment

pkuyym left a comment

Choose a reason for hiding this comment

guoshengCS commented Mar 27, 2018 •

edited

Loading