This is a simple PyTorch implementation of Vision Transformer (ViT) described in the paper "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale"
-
Updated
Mar 6, 2023 - Python
This is a simple PyTorch implementation of Vision Transformer (ViT) described in the paper "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale"
A light implementation of RF-Solver-Edit (image version) with Diffusers
Add a description, image, and links to the simple-implementations topic page so that developers can more easily learn about it.
To associate your repository with the simple-implementations topic, visit your repo's landing page and select "manage topics."