Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].
-
Updated
Nov 2, 2024 - Python
Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].
The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.
DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging (PyTorch implementation)
Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)
Generating Diverse Audio-Visual 360º Soundscapes for Sound Event Localization and Detection
Latent Acoustic Mapping (LAM) for Direction of Arrival Estimation
A cost effective, wearable device that can help individuals who are hard of hearing navigate their environment via visual cues. Project was awarded highest honors at the NYS Science Congress.
Sound source localization by locally fitting autonomous state space models (LSSMs)
Software for Hard of Hearing
Sound event localization and detection in 360-degree audio-visual soundscapes.
A modular toolkit for NMF-based sound source localization
Add a description, image, and links to the sound-localization topic page so that developers can more easily learn about it.
To associate your repository with the sound-localization topic, visit your repo's landing page and select "manage topics."