🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Aug 16, 2024 - Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊
Ultrafast GAN based Vocoder for Text to Speech
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.
zero-shot realtime TTS system, fully offline, free and open source
MelGAN Multi GPU Implementation.
MelGAN with catalyst framework
Catalan Text to Speech
Unofficial implementation of Multi-band MelGAN
A neural network (GAN) trained to apply metal screaming effects, turning vocals from songs, speeches or whispers into realistic screams and growls.
Add a description, image, and links to the melgan topic page so that developers can more easily learn about it.
To associate your repository with the melgan topic, visit your repo's landing page and select "manage topics."