Skip to content

xyxCalvin/awesome-speaker-diarization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 

Repository files navigation

awesome-speaker-diarization

Content

Publications

Reviews

Audio-only Speaker Diarization

2021

2020

2019

2018

2017

Multimodal Speaker Diarization

2020

2019

2018

Other Audio-Visual Related Work

2020

2019

2018

2017

Datasets

Audio-Visual Datasets

  • Spot the conversation: speaker diarisation in the wild (Chung J S, Huh J, Nagrani A, et al.(VGG)) A free speaker diarization dataset.(Large dataset with overlapping speeches and background noise)

  • VoxConverse VoxConverse is an audio-visual diarisation dataset consisting of over 50 hours of multispeaker clips of human speech, extracted from YouTube videos.

Tutorials

Books

Acknowledgement

Quan Wang's repo inspires us a lot. Many thanks!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published