Skip to content

This repository contains an implementation of the Vision Transformer (ViT) model as described in the paper "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale".

Notifications You must be signed in to change notification settings

dusky04/vit-pytorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

vit-pytorch

This repository contains an implementation of the Vision Transformer (ViT) model as described in the paper "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale". The Vision Transformer leverages the power of transformers, typically used in NLP, to achieve state-of-the-art results in image classification tasks.

About

This repository contains an implementation of the Vision Transformer (ViT) model as described in the paper "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale".

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published