Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ForwardTTSE2E implementations and related API changes #1510

Closed
wants to merge 32 commits into from

Conversation

erogol
Copy link
Member

@erogol erogol commented Apr 19, 2022

No description provided.

@erogol erogol changed the base branch from main to dev April 19, 2022 07:26
@erogol erogol requested review from WeberJulian and Edresson April 19, 2022 07:26
@e0xextazy
Copy link

Do I understand correctly that these changes were made for the only VITS model (YourTTS) from the repository? Since it is the only End2End you have?

@@ -191,6 +191,9 @@ def __init__(
):
super().__init__()

if c_in_channels and c_in_channels != 0:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

c_in_channels != 0 is redundant since the first clause of the and would be evaluated to False if c_in_channels == 0

@@ -225,6 +228,9 @@ def forward(self, x, x_mask, g=None): # pylint: disable=unused-argument
x_mask: [B, 1, T]
g: [B, C_g, 1]
"""
# TODO: implement multi-speaker
o = self.decoder(x, x_mask, g)
# multi-speaker conditioning
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

won't that break the vctk/fast_pitch released model?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure. Even if it does, I think this is the right way to go, similar to the VITS model.

return outputs, loss_dict

if optimizer_idx == 1:
mel = batch["mel_input"].transpose(1, 2)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe implementing steps_to_start_discriminator would allow for faster training

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good idea! I'll try.

import torch
from torch import nn


Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't know how useful it would be, but maybe it would be nice to have a differentiable equivalent of functions in numpy_transform here when possible.

@erogol
Copy link
Member Author

erogol commented Apr 19, 2022

Thx for the review @WeberJulian

@erogol erogol mentioned this pull request Apr 20, 2022
58 tasks
@erogol erogol force-pushed the unified_api_forwardtts2 branch from b6073d1 to 8adcd1d Compare May 17, 2022 11:57
@stale stale bot added the wontfix This will not be worked on but feel free to help. label Jun 19, 2022
@stale stale bot closed this Jun 27, 2022
@coqui-ai coqui-ai deleted a comment from stale bot Jun 27, 2022
@erogol erogol reopened this Jun 27, 2022
@stale stale bot removed the wontfix This will not be worked on but feel free to help. label Jun 27, 2022
@stale
Copy link

stale bot commented Jul 27, 2022

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

@stale stale bot added the wontfix This will not be worked on but feel free to help. label Jul 27, 2022
@stale stale bot closed this Aug 3, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
wontfix This will not be worked on but feel free to help.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants