[Bug] KeyError with Tortoise and custom speaker directory #2745

tazz4843 · 2023-07-07T02:36:25Z

Describe the bug

The docs do not have proper documentation on how to run Tortoise. They immediately throw a KeyError upon attempted use.

To Reproduce

Download the following file and unzip it somewhere (contains sample files):
jfk.zip
Run the following command or execute the following code (both exhibit the same result), replacing your voice directory as required.

tts --model_name  tts_models/en/multi-dataset/tortoise-v2 \
  --text "This is an example." \
  --out_path "output.wav" \
  --voice_dir /home/zero/data/audio-samples/ \
  --speaker_idx "jfk" \
  --progress_bar True

from TTS.api import TTS

# Load the model
tortoise = TTS("tts_models/en/multi-dataset/tortoise-v2")
print(" - Loaded `tortoise`")

speaker = "jfk"
src_dir = "/home/zero/data/audio-samples/"
output_base = "/home/zero/data/audio-samples/{}/quick-brown-fox-{}.wav"
text_sample = "The quick brown fox jumps over the lazy dog."

tortoise.tts_to_file(text=text_sample,
                     file_path=output_base.format(speaker, "tortoise-ultra_fast"),
                     voice_dir=src_dir,
                     speaker=speaker,
                     preset="ultra_fast")

Expected behavior

No KeyError exception.

Logs

zero@zero-desktop ~/PycharmProjects/scripty-tts-server > tts --model_name  tts_models/en/multi-dataset/tortoise-v2 \
                                                             --text "This is an example." \
                                                             --out_path "output.wav" \
                                                             --voice_dir /home/zero/data/audio-samples/ \
                                                             --speaker_idx "jfk" \
                                                             --progress_bar True
 > tts_models/en/multi-dataset/tortoise-v2 is already downloaded.
 > Model's license - apache 2.0
 > Check https://choosealicense.com/licenses/apache-2.0/ for more info.
 > Using model: tortoise
 > Text: This is an example.
 > Text splitted to sentences.
['This is an example.']
Traceback (most recent call last):
  File "/home/zero/PycharmProjects/scripty-tts-server/venv/bin/tts", line 8, in <module>
    sys.exit(main())
             ^^^^^^
  File "/home/zero/PycharmProjects/scripty-tts-server/venv/lib/python3.11/site-packages/TTS/bin/synthesize.py", line 447, in main
    wav = synthesizer.tts(args.text, speaker_name=args.speaker_idx)
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/zero/PycharmProjects/scripty-tts-server/venv/lib/python3.11/site-packages/TTS/utils/synthesizer.py", line 365, in tts
    outputs = self.tts_model.synthesize(
              ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/zero/PycharmProjects/scripty-tts-server/venv/lib/python3.11/site-packages/TTS/tts/models/tortoise.py", line 520, in synthesize
    voice_samples, conditioning_latents = load_voice(speaker_id)
                                          ^^^^^^^^^^^^^^^^^^^^^^
  File "/home/zero/PycharmProjects/scripty-tts-server/venv/lib/python3.11/site-packages/TTS/tts/layers/tortoise/audio_utils.py", line 122, in load_voice
    paths = voices[voice]
            ~~~~~~^^^^^^^
KeyError: 'jfk'

Environment

{
    "CUDA": {
        "GPU": [],
        "available": false,
        "version": null
    },
    "Packages": {
        "PyTorch_debug": false,
        "PyTorch_version": "2.0.1+cpu",
        "TTS": "0.15.5",
        "numpy": "1.24.1"
    },
    "System": {
        "OS": "Linux",
        "architecture": [
            "64bit",
            "ELF"
        ],
        "processor": "",
        "python": "3.11.3",
        "version": "#1 SMP PREEMPT_DYNAMIC Sat, 01 Jul 2023 16:17:21 +0000"
    }
}

Additional context

No response

The text was updated successfully, but these errors were encountered:

manmay-nakhashi · 2023-07-07T02:56:09Z

@tazz4843 does your voice dir have a folder called jfk ?

tazz4843 · 2023-07-07T03:00:34Z

It does

erogol · 2023-07-07T10:22:46Z

@tazz4843 can you try #2748

tazz4843 · 2023-07-07T17:09:15Z

Seems to solve this issue, but I get a different, unrelated error now. I'll open a new issue for it.

fyunusa · 2024-01-31T21:29:34Z

hi, please i've been trying to locate the fix but i cant seem to figure out what you have as the proposed solution.
Can you please point me to it. Also having same issues here.
@erogol

tazz4843 added the bug Something isn't working label Jul 7, 2023

erogol added a commit that referenced this issue Jul 7, 2023

Fix #2745

2e69e94

This was referenced Jul 7, 2023

Fix #2745 #2748

Merged

[Bug] RuntimeError: Attempting to deserialize object on a CUDA device but torch.cuda.is_available() is False. #2749

Closed

erogol closed this as completed Jul 7, 2023

erogol added a commit that referenced this issue Jul 7, 2023

Fix #2745 (#2748)

a2984fb

FeatureSpitter mentioned this issue Jul 19, 2023

[Bug] Bark examples not working out of the box? #2781

Closed

Tindell pushed a commit to pugtech-co/TTS that referenced this issue Sep 4, 2023

Fix coqui-ai#2745 (coqui-ai#2748)

6ae29ed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] KeyError with Tortoise and custom speaker directory #2745

[Bug] KeyError with Tortoise and custom speaker directory #2745

tazz4843 commented Jul 7, 2023 •

edited

Loading

manmay-nakhashi commented Jul 7, 2023

tazz4843 commented Jul 7, 2023

erogol commented Jul 7, 2023

tazz4843 commented Jul 7, 2023

fyunusa commented Jan 31, 2024

[Bug] KeyError with Tortoise and custom speaker directory #2745

[Bug] KeyError with Tortoise and custom speaker directory #2745

Comments

tazz4843 commented Jul 7, 2023 • edited Loading

Describe the bug

To Reproduce

Expected behavior

Logs

Environment

Additional context

manmay-nakhashi commented Jul 7, 2023

tazz4843 commented Jul 7, 2023

erogol commented Jul 7, 2023

tazz4843 commented Jul 7, 2023

fyunusa commented Jan 31, 2024

tazz4843 commented Jul 7, 2023 •

edited

Loading