This application transcribes and translates audio files using the Whisper library.
- Install PyTorch from pytorch.org/get-started/locally.
- Install ffmpeg based on your operating system.
- Run
pip install -U openai-whisper
to install the Whisper library. - Install Streamlit with
pip install streamlit
.
Note: Refer to Whisper Repository README for detailed installation procedure.
- Upload an audio file in one of the supported formats: mp3, mp4, mpeg, mpga, m4a, wav, webm, flac.
- Click on "Transcribe Audio" to transcribe the uploaded audio file.
- Click on "Translate Audio" to translate the transcribed text to English.
- The application displays the detected language, transcribed or translated text, and text with timestamps.
- Optionally, specify the desired model type in the
app.py
script to download and use a model of your choice.
- The default model for this app is the small Multilingual model. It is downloaded the first time the script is run.
- Users can specify the model type in the
app.py
script to download and use a model of their choice.