Notes on how to setup dependencies; In general, create a new env with conda:
conda env create --file=environments-cp311.yml
Python 3.11 is the only version that works with all dependencies at this point.
Some dependencies need manual setup:
One time, in Python repl:
import openwakeword
openwakeword.utils.download_models()
Needs a CUDA capabale NVidia GPU to run fast whisper.
conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia
https://pypi.org/project/flash-attn/
Notes:
- Get C++ build tools: https://visualstudio.microsoft.com/visual-cpp-build-tools/
- MSVC C++ 2022 build tools latest
- Windows 11 SDK
- Get ninja:
pip install ninja
- Set MAX_JOBS=4 if less than 100Gb of RAM
Install tesseract with language data. (Windows: https://github.com/UB-Mannheim/tesseract/wiki). If installed to non-default location adjust tessdata_dir in config.
external/fast_ssim requires the Microsoft Visual C++ Redistributable packages for Visual Studio 2013.
https://learn.microsoft.com/en-us/cpp/windows/latest-supported-vc-redist?view=msvc-170
Install with your preferred engine, for example realtimetts[system,coqui]
Requires C++ redistributable