CVAE and VQ-VAE

This is an implementation of the VQ-VAE (Vector Quantized Variational Autoencoder) and Convolutional Varational Autoencoder. from Neural Discrete representation learning for compressing MNIST and Cifar10. The code is based upon pytorch/examples/vae.

pip install -r requirements.txt
python main.py

requirements

Python 3.6 (maybe 3.5 will work as well)
PyTorch 0.4
Additional requirements in requirements.txt

Usage

# For example
python3 main.py --dataset=cifar10 --model=vqvae --data-dir=~/.datasets --epochs=3

Results

All images are taken from the test set. Top row is the original image. Bottom row is the reconstruction.

k - number of elements in the dictionary. d - dimension of elements in the dictionary (number of channels in bottleneck).

MNIST (k=10, d=64)

CIFAR10 (k=128, d=256)

Imagenet (k=512, d=128)

TODO:

Implement Continuous Relaxation Training of Discrete Latent Variable Image Models
Sample using PixelCNN prior
Improve results on cifar - nearest neighbor should be performed to 10 dictionaries rather than 1
Improve results on cifar - replace MSE with NLL
Improve results on cifar - measure bits/dim
Compare architecture with the offical one
Merge VAE and VQ-VAE for MNIST and Cifar to one script

Acknowledgement

tf-vaevae for a good reference.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
images		images
vq_vae		vq_vae
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CVAE and VQ-VAE

requirements

Usage

Results

TODO:

Acknowledgement

About

Releases

Packages

Contributors 4

Languages

License

nadavbh12/VQ-VAE

Folders and files

Latest commit

History

Repository files navigation

CVAE and VQ-VAE

requirements

Usage

Results

TODO:

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages