Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
-
Updated
Apr 24, 2024 - Python
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.
Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd like!
Fine-tune SpeechT5 for non-English text-to-speech task, implemented in PyTorch.
This repository contains any code related to audio generated with artificial intelligence, such as voice cloning, text-to-speech, audio classification, speech recognition, etc.
Different Task Guides for Audio Data
Add a description, image, and links to the speecht5 topic page so that developers can more easily learn about it.
To associate your repository with the speecht5 topic, visit your repo's landing page and select "manage topics."