ForwardTTSE2E implementations and related API changes #1510

erogol · 2022-04-19T07:25:51Z

No description provided.

e0xextazy · 2022-04-19T09:01:44Z

Do I understand correctly that these changes were made for the only VITS model (YourTTS) from the repository? Since it is the only End2End you have?

WeberJulian · 2022-04-19T12:48:09Z

TTS/tts/layers/feed_forward/decoder.py

@@ -191,6 +191,9 @@ def __init__(
    ):
        super().__init__()

+        if c_in_channels and c_in_channels != 0:


c_in_channels != 0 is redundant since the first clause of the and would be evaluated to False if c_in_channels == 0

WeberJulian · 2022-04-19T13:14:06Z

TTS/tts/layers/feed_forward/decoder.py

@@ -225,6 +228,9 @@ def forward(self, x, x_mask, g=None):  # pylint: disable=unused-argument
            x_mask: [B, 1, T]
            g: [B, C_g, 1]
        """
-        # TODO: implement multi-speaker
-        o = self.decoder(x, x_mask, g)
+        # multi-speaker conditioning


won't that break the vctk/fast_pitch released model?

Not sure. Even if it does, I think this is the right way to go, similar to the VITS model.

WeberJulian · 2022-04-19T15:46:38Z

TTS/tts/models/forward_tts_e2e.py

+            return outputs, loss_dict
+
+        if optimizer_idx == 1:
+            mel = batch["mel_input"].transpose(1, 2)


Maybe implementing steps_to_start_discriminator would allow for faster training

Good idea! I'll try.

WeberJulian · 2022-04-19T17:28:43Z

TTS/utils/audio/torch_transforms.py

+import torch
+from torch import nn
+
+


I don't know how useful it would be, but maybe it would be nice to have a differentiable equivalent of functions in numpy_transform here when possible.

erogol · 2022-04-19T22:57:18Z

Thx for the review @WeberJulian

Update CI badges

stale · 2022-07-27T17:23:32Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

erogol changed the base branch from main to dev April 19, 2022 07:26

erogol requested review from WeberJulian and Edresson April 19, 2022 07:26

WeberJulian reviewed Apr 19, 2022

View reviewed changes

WeberJulian approved these changes Apr 19, 2022

View reviewed changes

erogol mentioned this pull request Apr 20, 2022

🐸 TTS roadmap #378

Closed

58 tasks

erogol and others added 20 commits May 13, 2022 14:58

Merge pull request #1574 from coqui-ai/update_badge

f237e4c

Update CI badges

Implement ForwardTTSE2Eg

fccda5a

Implement FastPitchE2EConfig

8573148

Implement ForwardTTSE2E tests

2a61b8f

Implement FastPitchE2E LJSpeech recipe

aea8cb7

Implement ForwardTTSE2E Loss

b16613c

Implement BaseTTSE2E

c125024

Refactor multi-speaker init in ForwardTTS

28a53c7

Add cond layer in decoder

775a6ab

Rename vars in VITS

760f045

Fix Vocoder logging

0738cb0

Remove redundancy

9f8d86b

Update import statements

5f9d559

Update fastpitche2e recipe

4556c61

Remove AP from FastPitchE2e

231c69b

Add missing kernel size attr to transformer layer

e7c5db0

Make plot results more general

cc57c20

Refactor ForwardTTS to skip decoder

c3fb49b

Add numpy and torch transforms

6a53b77

Make AP optional in BaseTTS

dbe5eb9

erogol added 12 commits May 17, 2022 13:44

Update ForwardTTSE2eLoss

4171f4e

Refactor TTSDataset to use numpy transforms

0b585b4

Update ForwardTTSe2e tests

edd59c8

Make style

9291d13

Return duration by ForwardTTS inference

96779e7

Remove remaned trainer functions

ce4f962

Implement get_state_dict

b3fb0e1

Fix audio_config handling

a05c82f

Fix dirt

c437db1

Make hifigan discriminator configurable

8e915b7

Fix up

2d29e82

Rename g as spk_emb

8adcd1d

erogol force-pushed the unified_api_forwardtts2 branch from b6073d1 to 8adcd1d Compare May 17, 2022 11:57

WeberJulian force-pushed the dev branch from 74f5c3f to ee99a6c Compare May 20, 2022 13:53

stale bot added the wontfix This will not be worked on but feel free to help. label Jun 19, 2022

stale bot closed this Jun 27, 2022

coqui-ai deleted a comment from stale bot Jun 27, 2022

erogol reopened this Jun 27, 2022

stale bot removed the wontfix This will not be worked on but feel free to help. label Jun 27, 2022

stale bot added the wontfix This will not be worked on but feel free to help. label Jul 27, 2022

stale bot closed this Aug 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ForwardTTSE2E implementations and related API changes #1510

ForwardTTSE2E implementations and related API changes #1510

erogol commented Apr 19, 2022

e0xextazy commented Apr 19, 2022

WeberJulian Apr 19, 2022

WeberJulian Apr 19, 2022

erogol Apr 19, 2022

WeberJulian Apr 19, 2022

erogol Apr 19, 2022

WeberJulian Apr 19, 2022

erogol commented Apr 19, 2022

stale bot commented Jul 27, 2022

ForwardTTSE2E implementations and related API changes #1510

ForwardTTSE2E implementations and related API changes #1510

Conversation

erogol commented Apr 19, 2022

e0xextazy commented Apr 19, 2022

WeberJulian Apr 19, 2022

Choose a reason for hiding this comment

WeberJulian Apr 19, 2022

Choose a reason for hiding this comment

erogol Apr 19, 2022

Choose a reason for hiding this comment

WeberJulian Apr 19, 2022

Choose a reason for hiding this comment

erogol Apr 19, 2022

Choose a reason for hiding this comment

WeberJulian Apr 19, 2022

Choose a reason for hiding this comment

erogol commented Apr 19, 2022

stale bot commented Jul 27, 2022