This is a Flask web app that allows you to translate audio and video files to an audio output in any languag of your choice using OpenAI's whisper, gpt-3.5-turbo and Eleven labs. It uses the pytube, moviepy, and pydub libraries to download and process the videos, and nltk for tokenizing the text.
6428e6f5e849b34fcd6d5b53_webm.1.mp4
Clone this repository and navigate to the project directory:
git clone https://github.com/<username>/<project-name>.git
cd <project-name>
Create a virtual environment and install the required packages:
python -m venv venv
source venv/bin/activate # on Windows, use "venv\Scripts\activate"
pip install -r requirements.txt
Run the app locally using Flask:
export FLASK_APP=app.py
export FLASK_ENV=development
flask run
Navigate to http://localhost:5000/
in your web browser to access the app.
- Upload audio or video files or links.
- Transcribe audio or video and generate a text output.
- Chuck words of over 3000 tokens.
- Use the OpenAI API to translate.
- Use Elevenlabs API to convert text to audio.
- Flask
- Flask-SocketIO
- Flask-Bootstrap
- PyTube
- moviepy
- pydub
- nltk
- OpenAI API key
- Elevenlabs API key
Contributions to this project are welcome. To contribute, please follow these steps:
- Fork this repository.
- Create a new branch:
git checkout -b my-new-branch
- Make your changes and commit them:
git commit -m "Add some feature"
- Push to the branch:
git push origin my-new-branch
- Create a new pull request.
- Please include a clear description of your changes and their purpose.