Deep-Learning-Speaker Classification

Project Description

Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.

The first objective will be to implement Speaker Classification using an SVM.

Current Data Set

The data being used for this project can be found at: http://www.openslr.org/12/ . Openslr is an open source project dedicated to hosting speech and language resources, hoping to assist the progress of speech recognition.

Previous Work Done

Pannous is a project that is working on implementing Speech Recognition in Google's Tensorflow.

Github link: https://github.com/pannous/tensorflow-speech-recognition/

To see more of an in-depth walkthrough of how Pannous approaches the speaker classification problem, please see Pannous-Walkthrough.md.

Installation Requirements

- Librosa

- Pydub

- TFLearn

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
data_prep		data_prep
speaker_class		speaker_class
working		working
Pannous-Walkthrough.md		Pannous-Walkthrough.md
README.md		README.md
SpeakerClassification.ipynb		SpeakerClassification.ipynb
SpeakerClassification1.ipynb		SpeakerClassification1.ipynb
SpeakerRec.ipynb		SpeakerRec.ipynb
devclean.ipynb		devclean.ipynb
speech_data.py		speech_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep-Learning-Speaker Classification

Project Description

Current Data Set

Previous Work Done

Installation Requirements

Github Navigation

About

Releases

Packages

Languages

AKBoles/Deep-Learning-Speech-Recognition

Folders and files

Latest commit

History

Repository files navigation

Deep-Learning-Speaker Classification

Project Description

Current Data Set

Previous Work Done

Installation Requirements

Github Navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages