Emotion Recognition from Urdu Speech Audio

Overview

This project focuses on developing an emotion recognition system for Urdu speech. The process begins with organizing the audio dataset and extracting essential features such as MFCCs, Chroma, and Zero Crossing Rate. Data augmentation techniques, including time stretching and pitch shifting, are applied to enhance the dataset. Machine learning models, including CNN, LSTM, and a hybrid CNN-LSTM, are trained and evaluated on the extracted features. The evaluation metrics include accuracy, confusion matrices, and classification reports to analyze model performance.

Dataset

URDU-Dataset: https://github.com/siddiquelatif/URDU-Dataset

Features

Audio emotion classification

Multiple neural network models: CNN, LSTM, CNN-LSTM

93.75% peak accuracy

Technologies:

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Emotion Recognition from Urdu Speech Audio.ipynb		Emotion Recognition from Urdu Speech Audio.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotion Recognition from Urdu Speech Audio

Overview

Dataset

Features

Technologies:

About

Releases

Packages

Languages

License

ranauzairahmed/UrduSpeechEmotions

Folders and files

Latest commit

History

Repository files navigation

Emotion Recognition from Urdu Speech Audio

Overview

Dataset

Features

Technologies:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages