Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TTS] Add Mixed Representation Training #3473

Merged
merged 35 commits into from
Feb 11, 2022
Merged

Conversation

blisc
Copy link
Collaborator

@blisc blisc commented Jan 20, 2022

  • Adds phoneme probability to our current g2p module.
  • Updates FastPitch yaml to use this functionality.

redoctopus and others added 11 commits January 14, 2022 14:25
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
@blisc blisc changed the title Add Mixed Representation Training [TTS] Add Mixed Representation Training Jan 20, 2022
blisc added 3 commits January 21, 2022 09:19
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
@blisc blisc changed the base branch from r1.6.0 to main February 1, 2022 14:54
blisc added 3 commits February 1, 2022 06:57
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
@Oktai15
Copy link
Contributor

Oktai15 commented Feb 3, 2022

See this bug: #3593

@Oktai15
Copy link
Contributor

Oktai15 commented Feb 3, 2022

@blisc

blisc added 7 commits February 3, 2022 08:13
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
@blisc blisc requested a review from Oktai15 February 4, 2022 16:37
@blisc blisc marked this pull request as ready for review February 4, 2022 16:38
Copy link
Contributor

@Oktai15 Oktai15 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great PR, thanks for this feature. Need to fix comments.

@@ -10,8 +10,8 @@ sup_data_path: ???
sup_data_types: [ "align_prior_matrix", "pitch" ]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some important notes about FastPitch + Aligner in mixed representation training overall:

  1. Did you try to train this model? According to FastPitch paper, "However, the mixed approach of training on phonemes and graphemes introduced unpleasant artifacts", have you checked that the artifacts do not appear? If so, it is very-very big result!
  2. Despite the fact that this config is very cool, I would prefer to create separate config for mixer representation training, because it is not ordinary case. Moreover, beta_binomial_interpolator is not so accurate. What do you think about it? I suggest to keep it unchanged and create conf/fastpitch_align_mixed_representation_v1.05.yaml (we will rename it to conf/fastpitch_align_mixed_representation.yaml in 1.8.0).

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this will be the default fastpitch config moving forward as it is what will be used in riva.

examples/tts/conf/fastpitch_align_v1.05.yaml Outdated Show resolved Hide resolved
examples/tts/conf/fastpitch_align_v1.05.yaml Outdated Show resolved Hide resolved
nemo/collections/tts/torch/data.py Outdated Show resolved Hide resolved
nemo/collections/tts/torch/data.py Show resolved Hide resolved
nemo/collections/tts/torch/data.py Outdated Show resolved Hide resolved
nemo/collections/tts/torch/g2ps.py Outdated Show resolved Hide resolved
nemo/collections/tts/torch/g2ps.py Outdated Show resolved Hide resolved
nemo/collections/tts/torch/tts_tokenizers.py Show resolved Hide resolved
nemo/collections/tts/torch/tts_tokenizers.py Show resolved Hide resolved
blisc added 3 commits February 7, 2022 09:11
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
@lgtm-com
Copy link

lgtm-com bot commented Feb 7, 2022

This pull request introduces 2 alerts when merging 4f09e27 into 8313b01 - view on LGTM.com

new alerts:

  • 1 for Non-callable called
  • 1 for Unused local variable

Signed-off-by: Jason <jasoli@nvidia.com>
Jenkinsfile Outdated Show resolved Hide resolved
nemo/collections/tts/models/fastpitch.py Show resolved Hide resolved
nemo/collections/tts/models/fastpitch.py Outdated Show resolved Hide resolved
nemo/collections/tts/models/fastpitch.py Show resolved Hide resolved
nemo/collections/tts/models/mixer_tts.py Show resolved Hide resolved
nemo/collections/tts/torch/g2ps.py Show resolved Hide resolved
nemo/collections/tts/torch/tts_tokenizers.py Show resolved Hide resolved
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
nemo/collections/tts/models/fastpitch.py Outdated Show resolved Hide resolved
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
@Oktai15 Oktai15 self-requested a review February 11, 2022 20:58
Copy link
Contributor

@Oktai15 Oktai15 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great PR, thanks.

We need to think more about accuracy of beta binomial interpolator and also think about potential bug in TTSDataset

@blisc blisc merged commit c645c4c into NVIDIA:main Feb 11, 2022
@blisc blisc deleted the tts_finetuning_updates branch February 11, 2022 22:26
fayejf pushed a commit that referenced this pull request Mar 2, 2022
* Update CMUdict with ADLR version pronunciations

Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>

* minor updates for finetuning

Signed-off-by: Jason <jasoli@nvidia.com>

* update conf

Signed-off-by: Jason <jasoli@nvidia.com>

* merge

Signed-off-by: Jason <jasoli@nvidia.com>

* update

Signed-off-by: Jason <jasoli@nvidia.com>

* update

Signed-off-by: Jason <jasoli@nvidia.com>

* bug fixes

Signed-off-by: Jason <jasoli@nvidia.com>

* update config

Signed-off-by: Jason <jasoli@nvidia.com>

* bf16 support

Signed-off-by: Jason <jasoli@nvidia.com>

* bf16 support

Signed-off-by: Jason <jasoli@nvidia.com>

* bugfix

Signed-off-by: Jason <jasoli@nvidia.com>

* update

Signed-off-by: Jason <jasoli@nvidia.com>

* finalize changes

Signed-off-by: Jason <jasoli@nvidia.com>

* undo notebook 1.6.0 pins

Signed-off-by: Jason <jasoli@nvidia.com>

* more 1.6.0 undos

Signed-off-by: Jason <jasoli@nvidia.com>

* wip

Signed-off-by: Jason <jasoli@nvidia.com>

* update num_workers

Signed-off-by: Jason <jasoli@nvidia.com>

* update hypers

Signed-off-by: Jason <jasoli@nvidia.com>

* revert to main _align yamls

Signed-off-by: Jason <jasoli@nvidia.com>

* update yamls

Signed-off-by: Jason <jasoli@nvidia.com>

* cleanup

Signed-off-by: Jason <jasoli@nvidia.com>

* remove unnecessary line

Signed-off-by: Jason <jasoli@nvidia.com>

* address comments

Signed-off-by: Jason <jasoli@nvidia.com>

* update vocoder mel uploading; add contextmanager to mixed g2p

Signed-off-by: Jason <jasoli@nvidia.com>

* update comments; make prob required argument

Signed-off-by: Jason <jasoli@nvidia.com>

* added val check

Signed-off-by: Jason <jasoli@nvidia.com>

* update message

Signed-off-by: Jason <jasoli@nvidia.com>

* update

Signed-off-by: Jason <jasoli@nvidia.com>

* revert num workers

Signed-off-by: Jason <jasoli@nvidia.com>

Co-authored-by: Jocelyn Huang <jocelynh@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants