audio-datasets

Here are 7 public repositories matching this topic...

ynop / audiomate

Python library for handling audio datasets.

audio music speech speech-recognition dataset-filtering noise dataset-creation dataset-manager corpus-tools data-loader audio-datasets

Updated Jul 6, 2023
Python

Audio-WestlakeU / RealMAN

Star

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

multi-channel speech-enhancement microphone-array-processing doa-estimation audio-datasets sound-source-localization microphone-audio-capture real-world-datasets

Updated Apr 29, 2025
Python

Audio-WestlakeU / audiossl

Star

A library built for easier audio self-supervised training, downstream tasks evaluation

pytorch audio-classification audioset nsynth speech-commands audio-datasets self-supervised-learning voxceleb1 urbansound8k pytorch-lightning audio-representation audio-self-supervised-learning audio-pretraining

Updated Aug 27, 2024
Python

MorenoLaQuatra / audioset-download

Star

This package aims at simplifying the download of the AudioSet dataset.

downloader audioset audio-datasets audioset-download

Updated Jul 17, 2025
Python

silenterus / deepspeech-cleaner

Star

Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework

machine-learning mozilla speech-recognition dataset-filtering dataset-creation dataset-manager multilanguage corpus-tools deepspeech audio-datasets

Updated May 22, 2023
Python

A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code

python data-science data machine-learning deep-learning data-collection dataset-generation text-datasets audio-datasets scarper image-data-generator

Updated Nov 19, 2023
Python

freds0 / katube

Star

KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a list of YouTube playlists or YouTube channels, KATube will generate dataset with audios and texts.

audio-datasets

Updated Jul 27, 2024
Python

Improve this page

Add a description, image, and links to the audio-datasets topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-datasets topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audio-datasets

Here are 7 public repositories matching this topic...

ynop / audiomate

Audio-WestlakeU / RealMAN

Audio-WestlakeU / audiossl

MorenoLaQuatra / audioset-download

silenterus / deepspeech-cleaner

nuhmanpk / Webtrench

freds0 / katube

Improve this page

Add this topic to your repo