APSSpokenDigits

Project: Recognizing Spoken Arabic Digits

Using signals + statistical methods, including Mel-frequency cepstral coefficients, K-Means clustering, and Gaussian Mixture Models to classify spoken arabic digits audio from male and female speakers.

Results: concatenating a time variable (value of 0 indicating frame at start of audio sample and 1 indicating end of audio sample) to the data produces a model with 5% (90->95) higher accuracy than all others.

Please see GMMPredictTime.py and confusionmatrixTimeGMM.png (time-aware gaussian mixture modeling classification) for the training and testing procedure+results with highest accuracy.

Credits to https://archive.ics.uci.edu/dataset/195/spoken+arabic+digit for dataset.

More detailed description in progress.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
BlockAverageGMM.py		BlockAverageGMM.py
DigitGMMRepresentation.py		DigitGMMRepresentation.py
GMMPredict.py		GMMPredict.py
GMMPredictGender.py		GMMPredictGender.py
GMMPredictTime.py		GMMPredictTime.py
KMeansPredict.py		KMeansPredict.py
Load_Display_TrainArabic.py		Load_Display_TrainArabic.py
PCA.py		PCA.py
README.md		README.md
Test_Arabic_Digit.txt		Test_Arabic_Digit.txt
confusionmatrixAvgGMM.png		confusionmatrixAvgGMM.png
confusionmatrixGMM.png		confusionmatrixGMM.png
confusionmatrixGenderGMM.png		confusionmatrixGenderGMM.png
confusionmatrixKmeans.png		confusionmatrixKmeans.png
confusionmatrixTimeGMM.png		confusionmatrixTimeGMM.png
documentation.html		documentation.html
graphic.jpg		graphic.jpg
lada.png		lada.png
pca_cumulative_variance.png		pca_cumulative_variance.png
scatterplot.py		scatterplot.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

APSSpokenDigits

About

Releases

Packages

Languages

pl909/APSSpokenDigits

Folders and files

Latest commit

History

Repository files navigation

APSSpokenDigits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages