ulmo

Large acoustic models

Things to investigate

alternatives to K-means and better tokenization
- Self supervised pretext tasks such as next frame prediction, infill, ABX (STFT, Mel Spec, CQT, CWT, DWT, WPT choice tasks)
long context to discover repeated impulses
adding an token
Try a denoising autoencoder in place of kmeans to create labels.
Data augmentation by using sets of spectrogram parameters (vary SR +- 10%)
methods of self supervision such as SimCLR, wav2vec2,and BYOL

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.vscode		.vscode
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
experiments.py		experiments.py
lam.py		lam.py
ulmo.py		ulmo.py
ulmo_wav2vec2.py		ulmo_wav2vec2.py
utils.py		utils.py