Pytorch Variational Autoencoders

Update 23/06/2025: Added gradient accumulation logic for the training stage

This reposistory is a collection of VAEs (Variational Autoencoder) implemented with PyTorch, which aim to provide a convenient and simple way to work around with available VAEs today (still updating). Three main datasets provided are Flowers102, MNIST and FashionMNIST for a consistent comparison in the future.

Requirement

Python == 3.10.14
PyTorch >= 2.5
CUDA enabled computing device

Installation

To install, follow these steps:

Clone the repository: $ git clone https://github.com/quyongkeomut/Variational-Autoencoder
Navigate to the project directory: $ cd Variational-Autoencoder
Install dependencies: $ pip install -r requirements.txt

Usage

Base training arguments and their functionality are provided below:

python train_vae.py --help
usage: train_vae.py [-h] [--model MODEL] [--dataset DATASET] [--is_ddp] [--img_size IMG_SIZE] [--epochs EPOCHS] [--batch BATCH] [--lr LR] [--seed SEED]

Training args

options:
  -h, --help           show this help message and exit
  --model MODEL        Type of model, valid values are one of ['VAE', 'VQ-VAE']
  --dataset DATASET    Dataset to train model on, valid values are one of ['flowers102', 'mnist', 'fashion_mnist']
  --is_ddp             Option for choosing training in DDP or normal training criteria
  --img_size IMG_SIZE  Image size
  --epochs EPOCHS      Num epochs
  --batch BATCH        Batch size
  --lr LR              Learning rate
  --seed SEED          Random seed for training

Along with base arguments, the program can accept extra arguments to provide for model/criterion/optimizer/trainer-dependent keyword arguments. Example is provided below:

python train_vae.py --prior_weight 1e-3 --reconstruction_method bce --is_ddp

In the example above, -prior_weight is used to assign weight of prior loss of the loss function in VAEs, and --reconstruction_method is used to decide which type of loss function to use for the reconstruction loss - in this case is BCE (Binary Cross Entropy), and is only acceptable if the target image is scaled to the range [0, 1]. To run training process, run these line of codes:

$ cd Variational-Autoencoder
$ python train_vae.py # along with extra arguments

For modifying the backbone of a specific VAE, you can modify the config file template corresponding to the VAE model, which can be found at ./experiment_setup/configs directory. Each file has the template like this:

NAME: "<NAME OF VAE MODEL>"


IMG_CHANNELS: 3


LATENT_DIM: ...


ENCODER_CONFIGS:
  down_channels: [32, 64, 128, 256]
  expand_factor: 3
  drop_p: ...
  activation: "<name of activation>"
  initializer: "<name of initializer>"
  dtype: null
  ...


DECODER_CONFIGS: 
  up_channels: [256, 192, 96, 48, 32]
  expand_factor: 3
  drop_p: ...
  activation: "<name of activation>"
  initializer: "<name of initializer>"
  dtype: null
  ...


OPTIMIZER_NAME: "<name of optimizer>"


OPTIM_KWARGS: 
  weight_decay: 0
  ...


LOGGING_KWARGS:
  save_dir: ./weights/VAE
  ...

Contributing

Any contribution to the main VAEs are welcomed. If any better model are delivered by fine-tuning the hyperparameters, or by changing the backbone, or by modifying the training procedure, please let me know. It would be a pleasure to include and cite your work in this repository.

If you would like to contribute your model, feel free to submit a Pull Request.

License

This repository is released under the Apache License 2.9. See the LICENSE file for details.

Citation

Project Title was created by Phuoc-Thinh Nguyen.

@misc{quyongkeomutVAEs,
  author = {Nguyen, P.-T.},
  title = {Variational Autoencoders},
  year = {2025},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/quyongkeomut/Variational-Autoencoder}}
}

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
augmentation		augmentation
datasets		datasets
experiments_setup		experiments_setup
losses		losses
neural_nets		neural_nets
optimizer		optimizer
utils		utils
weights/VAEweights		weights/VAEweights
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
__init__.py		__init__.py
train_vae.py		train_vae.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pytorch Variational Autoencoders

Requirement

Installation

Usage

Contributing

License

Citation

About

Uh oh!

Releases

Packages

Languages

License

quyongkeomut/Variational-Autoencoder

Folders and files

Latest commit

History

Repository files navigation

Pytorch Variational Autoencoders

Requirement

Installation

Usage

Contributing

License

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages