Skip to content
#

mmseqs2

Here is 1 public repository matching this topic...

Language: Python
Filter by language

protclust is a Python library for protein sequence analysis that integrates MMseqs2 for fast clustering and provides tools for creating robust machine learning datasets. It offers cluster-aware data splitting to prevent sequence similarity bias in model evaluation, along with comprehensive protein embedding capabilities for feature generation.

  • Updated Mar 21, 2025
  • Python

Improve this page

Add a description, image, and links to the mmseqs2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mmseqs2 topic, visit your repo's landing page and select "manage topics."

Learn more