This is a simple web application that can be used to upload an audio file containing vocals and detect the emotion of the vocals. The uploaded audio file will be chopped into mini clips and analyze each mini clip for the emotion. The accuracy is not very good as this was done within a limited time period and with a limited dataset provided.
The project was completed according to the requirements of a friend of mine. Frontend of the application was provided and I assume it is a template based design taken from somewhere.
Try using .wav audio files when testing because it might give an error if you try to process other types of files due to a ffmpeg issue. There are many more improvements that can be done in this project and is not guranteed to be 100% error free.
- Create a Virtual Environement (Optional)
- Delete the .gitignore file inside the src/SER/Audio_Speech_Actors_01-24
- Delete the .gitignore file inside the src/SER/RAV
- Download the RAR file using https://drive.google.com/file/d/1l3VQngVHnoKAWDopD34uFszILNt74yCJ/view?usp=sharing
- Download the RAR file using https://drive.google.com/file/d/1GX5k0IF5PXmBFILPw0L-7IlDLAlqrW71/view?usp=sharing
- Extract the content inside the root directory of Audio_Speech_Actors_01-24.rar into src/SER/Audio_Speech_Actors_01-24
- Extract the content inside the root directory of RAV.rar into src/SER/RAV
- Install the requirements inside the requirements.in & requirements.txt using PIP (Ex: pip install -r requirements.txt)
- Run the app.py (Ex: Python app.py)
- The webpage can be loaded using the emotion-detect.html file inside src/web/emotion-detection/