Converting text to audio and applying audio augmentation
-
Updated
Oct 28, 2023 - HTML
Converting text to audio and applying audio augmentation
SoundScaper is an audio augmented reality mobile application that allows users to author, save and reload virtual, and spatially interactive, three-dimensional binaural soundscapes within physical, real world spaces.
A Convolutional Neural Network that distinguishes between the speakers emotions. Comes with multiple preprocessors to improve the models performance.
This repository contains the code and methodology used for the BirdCLEF 2024 Kaggle competition, where I achieved a rank of 55th out of 974 participants, earning a bronze medal. The goal of this competition was to build a model that can accurately classify bird sounds.
Implementation of audio, image, and spectrogram augmentation techniques provided by the librosa, Keras and audiomentations
Audio data loading and augmentations in JAX
A ready-to-use pytorch dataloader for audio classification, speech classification, speaker recognition, etc. with in-GPU augmentations
A python library for generating different permutations of audible segments from audio files.
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
Add a description, image, and links to the audio-augmentation topic page so that developers can more easily learn about it.
To associate your repository with the audio-augmentation topic, visit your repo's landing page and select "manage topics."