Used an audio dataset of 37 speakers each speaking 37 scripts in a variety of ways like whispering, speaking fast, etc. Aims at recognizing the identity of a speaker in a voice recording. Using Tensorflow to model a multi neural network and achieved an accuracy of 91%.
Refer report (pdf or docx file) for in-depth details.