Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
roatienza authored May 17, 2023
1 parent ad56cef commit cdc0682
Showing 1 changed file with 26 additions and 5 deletions.
31 changes: 26 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,10 +33,10 @@ python3 demo.py --checkpoint https://github.com/roatienza/efficientspeech/releas
--infer-device cpu --text "the quick brown fox jumps over the lazy dog" --wav-filename fox.wav
```

Output file is under `wav_outputs`. Play the wav file:
Output file is under `outputs`. Play the wav file:

```
ffplay wav_outputs/fox.wav-1.wav
ffplay outputs/fox.wav
```

After downloading the weights, it can be reused:
Expand All @@ -50,7 +50,7 @@ python3 demo.py --checkpoint tiny_eng_266k.ckpt --infer-device cpu \
Playback:

```
ffplay wav_outputs/color.wav-1.wav
ffplay outputs/color.wav
```

**Small ES**
Expand All @@ -65,7 +65,7 @@ python3 demo.py --checkpoint https://github.com/roatienza/efficientspeech/releas
Playback:

```
ffplay wav_outputs/color-small.wav-1.wav
ffplay outputs/color-small.wav
```


Expand All @@ -80,7 +80,7 @@ python3 demo.py --checkpoint https://github.com/roatienza/efficientspeech/relea
Playback:

```
ffplay wav_outputs/bees-base.wav-1.wav
ffplay outputs/bees-base.wav
```

**GPU** for Inference
Expand All @@ -100,6 +100,27 @@ python3 demo.py --checkpoint https://github.com/roatienza/efficientspeech/releas
pip3 install torch==1.12 torchvision==0.13
```

### ONNX

Only supports fixed input phoneme length. Padding or truncation is applied if needed. Modify using `--onnx-insize=<desired valu>`.

```
python3 convert.py --checkpoint tiny_eng_266k.ckpt --onnx tiny_eng_266k.onnx
```

Might need to downgrade `protobuf` if an error occurs.

```
pip install protobuf==3.20
```

Usage:

```
python3 demo.py --checkpoint tiny_eng_266k.onnx --infer-device cpu \
--text "the primary colors are red, green, and blue." --wav-filename primary.wav
```

### Train

**Data Preparation**
Expand Down

0 comments on commit cdc0682

Please sign in to comment.