audio-augmentation

This repository contains the code and methodology used for the BirdCLEF 2024 Kaggle competition, where I achieved a rank of 55th out of 974 participants, earning a bronze medal. The goal of this competition was to build a model that can accurately classify bird sounds.

pytorch librosa audio-processing torchaudio mel-spectrogram audio-augmentation efficientnet

Updated Jun 20, 2024
Jupyter Notebook

Lallapallooza / fast-audiomentations

Star

⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.

audio python machine-learning gpu dsp pytorch triton data-augmentation audio-effects audio-augmentation augmentations audio-data-augmentation

Updated Jan 19, 2024
Python

imane-ayouni / Text-to-Speech-using-Tacotron2

Star

Converting text to audio and applying audio augmentation

text-to-speech audio-data audio-augmentation tacotron2

Updated Oct 28, 2023
HTML

zabir-nabil / torch-speech-dataloader

Star

A ready-to-use pytorch dataloader for audio classification, speech classification, speaker recognition, etc. with in-GPU augmentations

speech torch audio-augmentation torch-dataloader pytorch-speech-dataloader gpu-augmentation speech-augmentation-gpu

Updated Nov 6, 2022
Python

KentoNishi / torch-time-stretch

Star

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

torch pytorch sound-processing augmentation gpu-support torchaudio time-stretch audio-augmentation

Updated Sep 5, 2022
Python

zabir-nabil / audioperm

Star

A python library for generating different permutations of audible segments from audio files.

audio-classification speaker-recognition audio-processing augmentation speech-augmentation audio-augmentation

Updated Jun 13, 2022
Jupyter Notebook

lucas-fpaiva / survey-audio-aug

Star

Implementation of audio, image, and spectrogram augmentation techniques provided by the librosa, Keras and audiomentations

music-information-retrieval automatic-speech-recognition data-augmentation audio-augmentation environmental-sound-classification

Updated May 24, 2022
Jupyter Notebook

AndreasScharnetzki / EmotionClassifier

Star

A Convolutional Neural Network that distinguishes between the speakers emotions. Comes with multiple preprocessors to improve the models performance.

natural-language-processing supervised-learning convolutional-neural-networks transfer-learning preprocessing human-computer-interaction audio-processing multi-class-classification audio-augmentation variable-length-data speech-emotion-classification

Updated Jan 20, 2022
Python

laurencecliffe / SoundScaper

Star

SoundScaper is an audio augmented reality mobile application that allows users to author, save and reload virtual, and spatially interactive, three-dimensional binaural soundscapes within physical, real world spaces.

augmented-reality mobile-app soundscapes augmented-reality-applications audio-augmentation

Updated Jan 1, 2021

zhaoyi2 / audio_augment

Star

A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN

speed optional volume musan audio-augmentation rirs

Updated Jun 28, 2020
Shell

Improve this page

Add a description, image, and links to the audio-augmentation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-augmentation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audio-augmentation

Here are 13 public repositories matching this topic...

DBraun / audiotree

KentoNishi / torch-pitch-shift

AgaMiko / data-augmentation-review

hperer02 / Bird-sound-classification

Lallapallooza / fast-audiomentations

imane-ayouni / Text-to-Speech-using-Tacotron2

zabir-nabil / torch-speech-dataloader

KentoNishi / torch-time-stretch

zabir-nabil / audioperm

lucas-fpaiva / survey-audio-aug

AndreasScharnetzki / EmotionClassifier

laurencecliffe / SoundScaper

zhaoyi2 / audio_augment

Improve this page

Add this topic to your repo