Audio Similarity Search

A Python library for audio similarity search using wav2vec2 embeddings and FAISS indexing (https://engineering.fb.com/2017/03/29/data-infrastructure/faiss-a-library-for-efficient-similarity-search/). This library provides efficient audio similarity search with support for multiple index types and built-in visualization tools. The library uses wav2vec2 (https://huggingface.co/docs/transformers/en/model_doc/wav2vec2) to get the embeddings from the audio files and these embeddings are indexed by FAISS to do similarity search.

Enter docs folder and run "make livehtml" to view the full documentation on your local machine, if you don't see the entire documentation in the below Read docs link.

Features

🎵 Audio similarity search using wav2vec2 embeddings
🚀 Multiple FAISS index types (Flat, IVF, HNSW, PQ)
📊 Built-in visualization tools
🔄 Batch processing support
💾 Save and load indices

Installation

Prerequisites

Python 3.10 or later
conda package manager

For M1/M2 Mac Users

# Create conda environment
conda create -n audio_sim python=3.10
conda activate audio_sim

# Install PyTorch ecosystem
pip3 install --pre torch torchaudio --index-url https://download.pytorch.org/whl/nightly/cpu

# Install FAISS
conda install -c conda-forge faiss

# Install the package
pip install anirudhp-audio-similarity

For Other Platforms

# Create conda environment
conda create -n audio_sim python=3.12
conda activate audio_sim

# Install dependencies
conda install -c pytorch pytorch torchaudio faiss-cpu

# Install the package
pip install anirudhp-audio-similarity

Development Installation

# Clone the repository
git clone https://github.com/AnirudhPraveen/audio_similarity.git
cd audio-similarity

# Create conda environment
conda create -n audio_sim python=3.12
conda activate audio_sim

# Install dependencies
conda install -c pytorch pytorch torchaudio
conda install -c conda-forge faiss

# Install in development mode
pip install -e .

Example code

from audio_similarity import AudioSimilaritySearch, IndexType
from pathlib import Path

def main():
    # Initialize
    searcher = AudioSimilaritySearch(index_type=IndexType.FLAT)
    
    # Set up dataset
    dataset_dir = Path("dataset_directory").expanduser()
    query_file = Path("query_directory").expanduser()
    
    # Get audio files
    audio_files = list(dataset_dir.glob("**/*.wav"))
    print(f"Found {len(audio_files)} audio files")
    
    # Add batch to Index files
    #searcher.add_batch(audio_files)

    saved_index_dir = Path("./saved_index_folder").expanduser() 
    # do not include the index.faiss file in the directory

    # Load saved index
    searcher.load(saved_index_dir)
    
    # 1. Get Search Results
    print("\n1. Search Results:")
    print("-" * 50)
    results = searcher.search(str(query_file), k=5)
    for i, (file_path, distance) in enumerate(results, 1):
        print(f"{i}. File: {Path(file_path).name}")
        print(f"   Distance: {distance:.4f}")
    
    # 2. Visualize Search Results
    searcher.visualize_search_results(
        query_path=str(query_file),
        results=results,
        save_path="search_results.png",
        show=True
    )

    print(results)

Advanced Usage

Batch Processing

from pathlib import Path

# Get all audio files in a directory
audio_dir = Path("path/to/audio/files")
audio_files = list(audio_dir.glob("*.wav"))

# Add files in batch
searcher.add_batch(audio_files)

Different Index Types

# Exact search (slower but accurate)
searcher = AudioSimilaritySearch(index_type=IndexType.FLAT)

# Approximate search (faster)
searcher = AudioSimilaritySearch(
    index_type=IndexType.IVF,
    index_params={'nlist': 100}
)

# Graph-based search (memory intensive but fast)
searcher = AudioSimilaritySearch(
    index_type=IndexType.HNSW,
    index_params={'M': 16}
)

Benchmarking

# Compare different index types
configs = [
    {'type': IndexType.FLAT},
    {'type': IndexType.IVF, 'params': {'nlist': 100}},
    {'type': IndexType.HNSW, 'params': {'M': 16}},
]

results = searcher.benchmark(
    compare_with=configs,
    num_samples=1000,
    num_queries=100,
    k=5
)

# Visualize benchmark results
searcher.visualize_benchmarks()

Documentation

Full documentation is available at Read the Docs.

Contributing

We welcome contributions! Please follow these steps:

Fork the repository
Create a new branch: git checkout -b feature-name
Make your changes and commit: git commit -am 'Add new feature'
Push to the branch: git push origin feature-name
Submit a Pull Request

Running Tests

# Install development dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Run tests with coverage
pytest --cov=audio_similarity tests/

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation

If you use this library in your research, please cite:

@software{audio_similarity2024,
  author = {Anirudh Praveen},
  title = {Audio Similarity Search},
  year = {2024},
  publisher = {GitHub},
  url = {https://github.com/AnirudhPraveen/audio_similarity}
}

Acknowledgments

Facebook AI Research for wav2vec2
Facebook Research for FAISS
PyTorch team for torch and torchaudio

Contact

GitHub Issues: Project Issues
Email: anirudhpraveen2000@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github/workflows		.github/workflows
audio_similarity		audio_similarity
conda-recipe		conda-recipe
docs		docs
examples		examples
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
audiosim_example.py		audiosim_example.py
environment.yml		environment.yml
mappings.json		mappings.json
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Similarity Search

Features

Installation

Prerequisites

For M1/M2 Mac Users

For Other Platforms

Development Installation

Example code

Advanced Usage

Batch Processing

Different Index Types

Benchmarking

Documentation

Contributing

Running Tests

License

Citation

Acknowledgments

Contact

About

Releases

Packages

Languages

License

AnirudhPraveen/audio_similarity

Folders and files

Latest commit

History

Repository files navigation

Audio Similarity Search

Features

Installation

Prerequisites

For M1/M2 Mac Users

For Other Platforms

Development Installation

Example code

Advanced Usage

Batch Processing

Different Index Types

Benchmarking

Documentation

Contributing

Running Tests

License

Citation

Acknowledgments

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages