KISS

Klustering Images for Subset Selection. Research on the challenge of selecting a representative training subset for convolutional neural classifiers, employing methods that leverage clustering and difficulty estimation for specialized sample selection. Explore various techniques, experiments, and tools designed to address this problem.

About

In this project, our goal is to design and implement efficient methods for selecting a training subset from a larger image dataset. The chosen subsets should be relatively small while maintaining or slightly compromising classification accuracy.

To accomplish this, we leverage DINOv2 as our advanced feature extractor, providing vectorized representations of the images. Subsequently, we employ the FAISS library to execute multiple similarity searches and clusterings, determining the most representative subset of elements.

The repository includes methods and experiments validating their performance, positioning the project as both a practical tool and a versatile research framework.

P.S. We are aware it's clustering, not klustering.

Setup

To use our project, follow these simple steps:

Install PyTorch

Visit the PyTorch site to select your system configuration. We recommend using conda and the Nightly build:

conda install pytorch-nightly::pytorch torchvision torchaudio -c pytorch-nightly

Clone the GitHub repository:

git clone https://github.com/Drske/KISS.git

Build the `kiss` package

pip install .

For an editable installation, run:

pip install -e .

Test

Finally, execute the kiss hello command to verify if the package has been installed correctly.

kiss hello

Repository structure

data

This directory serves as a placeholder for any dataset that should be downloaded while using the repository.

experiments

All experiment configurations and results are stored here.

src

Contains the source code of the kiss project.

notebooks

Explore useful notebooks, including examples or prototypes.

checkpoints

All pretrained model weights are stored here.

License

Our package has been released under the MIT License. Refer to LICENSE for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
charts		charts
checkpoints		checkpoints
data		data
experiments		experiments
notebooks		notebooks
presentation		presentation
src/kiss		src/kiss
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KISS

About

Setup

Install PyTorch

Clone the GitHub repository:

Build the `kiss` package

Test

Repository structure

data

experiments

src

notebooks

checkpoints

License

About

Releases

Packages

Contributors 2

Languages

License

Drske/KISS

Folders and files

Latest commit

History

Repository files navigation

KISS

About

Setup

Install PyTorch

Clone the GitHub repository:

Build the kiss package

Test

Repository structure

data

experiments

src

notebooks

checkpoints

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Build the `kiss` package

Packages