VIT(Vision transformer)

This is a simple implementation of VIT. the state of the art in image classification It is only the application of Transformer in the image domain with slight modification in the implementation in order to handle the different data modality.

Authors

Djim Momar Lo

Architecture

How to use it

This project is made so that the training is intuitive you can modify some parameters directly on the code another version will take into account the use of GPU which you can always do by modifying the code images must be squares of size 384 to respect the values of the article here is the article: click here

Installation

  git clone https://github.com/lodjim/VIT
  cd VIT

  python3 main.py --help

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
librairies		librairies
.gitignore		.gitignore
1__c8SqxPMY_dsApyvDJ8HtA.gif		1__c8SqxPMY_dsApyvDJ8HtA.gif
1_l37va2Mu8Snx6LLb13430A.png		1_l37va2Mu8Snx6LLb13430A.png
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VIT(Vision transformer)

Authors

Architecture

How to use it

Installation

About

Releases

Packages

Languages

lodjim/VIT

Folders and files

Latest commit

History

Repository files navigation

VIT(Vision transformer)

Authors

Architecture

How to use it

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages