Real-Time Voice Translator is a machine learning project that aims to provide a seamless and natural experience of cross-lingual communication. It uses deep neural networks to translate voice from one language to another in real time while preserving the tone and emotion of the speaker. It is a desktop application that supports Windows, Linux, and Mac operating systems.
The application is easy to use: simply select the languages you want to translate between and start speaking. The application will listen to your voice and provide instant translations in real-time. You can also use the application to translate conversations between two or more people.
<=Python3.11, gTTS, pyaudio, playsound==1.2.2, deep-translator, SpeechRecognition, google-transliteration-api, cx-Freeze
-
Clone this project and create virtualenv (recommended) and activate virtualenv.
# Create virtualenv python -m venv env # Linux/MacOS source env/bin/activate # Windows env\Scripts\activate
-
Install require dependencies.
pip install --upgrade wheel pip install -r requirements.txt
-
Run code and speech (have fun).
python main.py
I am using cx_Freeze to build executable file of this app. The build settings can be changed by modifying the setup.py file.
- Windows:
python setup.py bdist_msi
- Linux:
python setup.py bdist_rpm
- Mac:
python setup.py bdist_mac