A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
-
Updated
Apr 2, 2023 - Python
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
Singing Voice Synthesis based on VITS, different from VISinger
Multispeaker Community Vocoder Model for DiffSinger
A python GUI toolkit for creating/editing Aesthetic YAML dictionaries for OpenUtau
🎼🎵𝐄𝐱𝐩𝐫𝐞𝐬𝐬𝐢𝐯𝐞 | 适用于OpenUtau的DiffSinger歌手表情参数导入工具。从真实歌手的人声中提取表情,并导入到工程的相应轨道上 Migrate expressions from real singers to DiffSingers
Convert the UTAU Voicebank to a configuration compatible with DiffSinger Dataset
A fork of genon2nnsvs with modifications made for english speakers and diffsinger users
A Streamlit-based web application that converts Japanese text (romaji/hiragana/katakana) into MIDI files with customizable parameters. Perfect for UTAU/DIFFSINGER and other voice synthesis development workflows.
Singing Voice Synthesis via Shallow Diffusion Mechanism: explore phoneme-mapped cross-lingual transfer learning using minimal target language data (English to German)
Add a description, image, and links to the diffsinger topic page so that developers can more easily learn about it.
To associate your repository with the diffsinger topic, visit your repo's landing page and select "manage topics."