A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
-
Updated
Dec 6, 2018 - Python
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09017.pdf)
PyTorch implementation of Tacotron and Tacotron2
Training GST-Tacotron for Persian language as a multi-speaker Persian speech synthesis.
Add a description, image, and links to the gst-tacotron topic page so that developers can more easily learn about it.
To associate your repository with the gst-tacotron topic, visit your repo's landing page and select "manage topics."