Skip to content

Latest commit

 

History

History
55 lines (33 loc) · 1.35 KB

Installation.md

File metadata and controls

55 lines (33 loc) · 1.35 KB

Installation

Notes on how to setup dependencies; In general, create a new env with conda:

conda env create --file=environments-cp311.yml

Python 3.11 is the only version that works with all dependencies at this point.

Some dependencies need manual setup:

Download openwakeword models

One time, in Python repl:

import openwakeword
openwakeword.utils.download_models()

pytorch with CUDA

Needs a CUDA capabale NVidia GPU to run fast whisper.

conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia

Flash attention

https://pypi.org/project/flash-attn/

Notes:

tesseract

Install tesseract with language data. (Windows: https://github.com/UB-Mannheim/tesseract/wiki). If installed to non-default location adjust tessdata_dir in config.

Fast-SSIM

external/fast_ssim requires the Microsoft Visual C++ Redistributable packages for Visual Studio 2013.

https://learn.microsoft.com/en-us/cpp/windows/latest-supported-vc-redist?view=msvc-170

Realtime-TTS

Install with your preferred engine, for example realtimetts[system,coqui]

Requires C++ redistributable