Showing equivalency b/w two branches. #1

taylanbil · 2019-11-16T01:33:32Z

No description provided.

optimizer fix progress bar comment out temporarily some changes to train_tpu int mask instead of float

* Loss function replaced with an equivalent logic that doesn't resize tensors. * cli args changed to guarantee consistency * collate_tokens function in fairseq/data/data_utils.py overwritten to guarantee consistency

some irrelevant changes to train_tpu.py

+ Tried to include more explanation why skip optimizer step this time

Delete optimizer step in Fairseq's trainer

deleted obsolete file

…torch changes (#6)

* Adding tpu capabilities to train.py * flush when printing for better user experience * separated cli_main into parse_args, maingpu and maintpu deleted unused line in datautils.py

* Adding tpu capabilities to train.py * flush when printing for better user experience * separated cli_main into parse_args, maingpu and maintpu deleted unused line in datautils.py * Enumerate the loader * enumerate the loader

…arch#10) * Add option to assert on training and/or validation loss * applied suggestion

* initial commit for multiprocess api * indentation fixes and import fix * no need to softlink, fix save/load * Remove the hacks to only save from master ordinal as xm.save takes care of that * fix indentation; 3 -> 4 spaces * Moved xu.eprints after spawn and dropping last batches better

…earch#15)

…acebookresearch#17)

… taylanbil-tpu-rebase-master

…d the multihead attention switch case

taylanbil and others added 30 commits June 13, 2019 23:23

Making tpu training work

420a3b0

optimizer fix progress bar comment out temporarily some changes to train_tpu int mask instead of float

pfpfpfpf

8a61e9f

fix

26c8256

printing device index per loop

9698a1a

Merge branch 'master' of github.com:pytorch/fairseq into xla

fea239f

bkpt to investigate resize_ call

1d65da9

attempting to init buffer size to 2*dim

20d92b8

attempting to init buffer size to 2*dim

9e7aee2

Merge branch 'xla' of github.com:taylanbil/fairseq into xla

35891c3

Merge branch 'xla' of github.com:taylanbil/fairseq into xla

cf19243

Merge branch 'xla' of github.com:taylanbil/fairseq into xla

c74706c

bkpt

174762e

blabla

51a3496

better print

22d5f8f

do not drop records when computing loss

7fa3d95

Merge branch 'master' of github.com:pytorch/fairseq into xla

ddb3586

Merge branch 'xla' of github.com:taylanbil/fairseq into xla

61ef1fe

Changes that reduce graph compiles.

918207d

* Loss function replaced with an equivalent logic that doesn't resize tensors. * cli args changed to guarantee consistency * collate_tokens function in fairseq/data/data_utils.py overwritten to guarantee consistency

undoing some changes made while debugging

396e614

Merge branch 'master' of github.com:pytorch/fairseq into xla

aed593b

progress_bar implements len

7f3097b

some irrelevant changes to train_tpu.py

Merge branch 'xla' of github.com:taylanbil/fairseq into xla

3401e23

Merge branch 'master' of github.com:pytorch/fairseq into xla

442b532

Merge branch 'xla' of github.com:taylanbil/fairseq into xla

11bff1e

Merge branch 'master' of github.com:pytorch/fairseq into xla

a9ac079

Merge branch 'master' of github.com:pytorch/fairseq into xla

8e1178a

Merge branch 'master' of github.com:pytorch/fairseq into xla

8507019

Merge branch 'master' of github.com:pytorch/fairseq into xla

6c92931

Merge branch 'master' of github.com:pytorch/fairseq into xla

b46e018

Merge branch 'master' of github.com:pytorch/fairseq into xla

51bcfe5

taylanbil and others added 24 commits July 26, 2019 16:30

removing the last batch that is of diferent size from the iterator

2ee5407

delete optimizer step in fairseq s trainer

7229556

Added self.xla flag that controls if Trainer includes optimizer step

0301fa4

+ Tried to include more explanation why skip optimizer step this time

Merge pull request #1 from taylanbil/deloptstep

03970bb

Delete optimizer step in Fairseq's trainer

deleted obsolete file

f97bc92

Merge pull request #2 from taylanbil/deloldtrain

5a872af

deleted obsolete file

isolated chkp path to a util function so I can use it in the runner (#3)

c72d8af

add norm clipping count back in (#4)

7a47018

remove grad norm clip count (#5)

d3bfbda

Change masked_fill_ input in loss in order to accomodate necessary py…

15901c0

…torch changes (#6)

Adding tpu capabilities to train.py (facebookresearch#8)

7a9e90e

* Adding tpu capabilities to train.py * flush when printing for better user experience * separated cli_main into parse_args, maingpu and maintpu deleted unused line in datautils.py

Add option to assert on training and/or validation loss (facebookrese…

63c19e9

…arch#10) * Add option to assert on training and/or validation loss * applied suggestion

None loss should be filled to inf (facebookresearch#11)

6ce1fad

trainers->trainer (facebookresearch#13)

22c4e9b

fix bug in assert_on_losses

87e0cf6

Replace usage of unsqueeze with transpose + broadcasting (facebookres…

01d2c0f

…earch#15)

master ordinal makedirs (facebookresearch#16)

92f19a2

Enable usage of batch_by_size for other code paths than just train (f…

aa2c3b3

…acebookresearch#17)

option to suppress loss report

fd7dcac

send meters to device

f17ad03

Merge branch 'tpu-rebase-master' of github.com:taylanbil/fairseq into…

d370e6b

… taylanbil-tpu-rebase-master

git wtf

043b6a9

taylanbil mentioned this pull request Nov 16, 2019

Rebasing tpu branch on a more recent fairseq upstream commit pytorch-tpu/fairseq#19

Merged

taylanbil added 4 commits November 18, 2019 17:56

Clean up comments, unused imports, and reuse var in checkpoint saving

12aaf54

Added comments to various places of tpu related code change, and fixe…

8de1826

…d the multihead attention switch case

Added comments to various places of tpu related code change, and fixe…

5120a2b

…d the multihead attention switch case

More documentation for sequence padding

bbfeec9

taylanbil closed this Dec 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Showing equivalency b/w two branches. #1

Showing equivalency b/w two branches. #1

taylanbil commented Nov 16, 2019

Showing equivalency b/w two branches. #1

Showing equivalency b/w two branches. #1

Conversation

taylanbil commented Nov 16, 2019