Accelerated train using deepspeed and use an enlarged CommonVoice dataset #1

bl4dylion4ik · 2023-07-14T10:29:51Z

Add train script that uses deepspeed.
Add deepspeed config
Make script for process raw CommonVoice dataset which is designed to increase train dataset
Edit run_speech_recognition_seq2seq_streaming.py to use dataset from disk

bl4dylion4ik added 6 commits July 14, 2023 13:15

Add script for prepare raw CV dataset

36a3eaa

Add test function

3974704

Add train script by using deepspeed

5b07930

Init deepspeed config

2848c5b

Add option '--from_disk', that load dataset to train from disk

2300b8e

add deepspeed and accelerate library

38ab6a2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accelerated train using deepspeed and use an enlarged CommonVoice dataset #1

Accelerated train using deepspeed and use an enlarged CommonVoice dataset #1

bl4dylion4ik commented Jul 14, 2023

Accelerated train using deepspeed and use an enlarged CommonVoice dataset #1

Are you sure you want to change the base?

Accelerated train using deepspeed and use an enlarged CommonVoice dataset #1

Conversation

bl4dylion4ik commented Jul 14, 2023