Real-Time-Speech-Recognition

Overview

This project includes a system that can record live speech using the microphone and then transcribe it using speech recognition.
This can be used to automatically record and transcribe meetings, lectures, and other events.
This repository contains all the codes and resources of this project.

Steps

Creating Jupyter widgets to record audio and stop recording
Using pyaudio to record microphone audio
Creating a speech recognition system using vosk
Adding punctuation to the text transcript using recasepunc

Code

You can find the code for this project here.

microphone.ipynb.

Technologies/Tools

Jupyter Notebook / JupyterLab
Python 3.10.12
Pytorch pip install torch -f https://download.pytorch.org/whl/torch_stable.html
Python packages
- vosk pip install vosk
- pydub pip install pydub
- transformers pip install transformers
- pyaudio pip install pyaudio
- ipywidgets pip install ipywidgets

Installation Guidelines

Vosk

You need to download a model file to run vosk properly. This automatically downloads when you run this code:

from vosk import Model
Model(model_name="vosk-model-small-en-us-0.15")

The full vosk model is large (1GB+). If you want to use it, just specify vosk-model-en-us-0.22 as the model name.

If the models don't automatically download, you can find them here.

Punctuation

By default, vosk outputs text with no punctuation. To add in punctuation, we need a different model. To get this, follow these steps:

Download the model here - caution: 1GB+ in size.
Extract the zip file into the same directory as your code.

Pyaudio

Pyaudio can be a little tricky to install, since it depends on system packages. Check the homepage for specific instructions for each OS.

You also want to figure out the right device to record from. Run this code to find the index of your microphone:

# Find audio device index
import pyaudio
p = pyaudio.PyAudio()
for i in range(p.get_device_count()):
    print(p.get_device_info_by_index(i))

p.terminate()

Data

All audio will come from the microphone.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
microphone.ipynb		microphone.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-Time-Speech-Recognition

Overview

Steps

Code

Technologies/Tools

Installation Guidelines

Vosk

Punctuation

Pyaudio

Data

About

Releases

Packages

Languages

LasithaAmarasinghe/Real-Time-Speech-Recognition

Folders and files

Latest commit

History

Repository files navigation

Real-Time-Speech-Recognition

Overview

Steps

Code

Technologies/Tools

Installation Guidelines

Vosk

Punctuation

Pyaudio

Data

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages