Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

Sihyun Yu¹ · Weili Nie² · De-An Huang² · Boyi Li^2,3
Jinwoo Shin¹ · Anima Anandkumar⁴
¹ KAIST ²NVIDIA Corporation ³UC Berkerley ⁴Caltech

[project page] [openreview]

1. Environment setup

conda create -n cmd python=3.8 -y
conda activate cmd
pip install -r requirements.txt

2. Dataset

Dataset download

Currently, we provide experiments for UCF-101. You can place the data that you want and can specifiy it via --data-path arguments in training scripts.

UCF-101

UCF-101
|-- class1
    |-- video1.avi
    |-- video2.avi
    |-- ...
|-- class2
    |-- video1.avi
    |-- video2.avi
    |-- ...
    |-- ...

3. Training

Autoencoder

 torchrun --nnodes=[NUM_NODES] --nproc_per_node=[NUM_GPU] train_ae.py \
    --dataset-name UCF101 \ 
    --data-path /data/UCF-101 \
    --global-batch-size [BATCH_SIZE] \ 
    --results-dir [LOG_DIRECTORY] 
    --mode pixel \
    --ckpt-every 20000

Motion Diffusion Model

python train_motion_diffusion.py \ 
  --nnodes=[NUM_NODES] \ 
  --nproc_per_node=[NUM_GPUS] 
    --dataset-name UCF101 \ 
    --data-path /data/UCF-101 \
    --global-batch-size [BATCH_SIZE] \   
    --results-dir [LOG_DIRECTORY]
    --mode pixel \
    --ckpt-every 20000

Content Diffusion Model

python train_content_diffusion.py \ 
  --nnodes=[NUM_NODES] \ 
  --nproc_per_node=[NUM_GPUS] 
    --dataset-name UCF101 \ 
    --data-path /data/UCF-101 \
    --global-batch-size [BATCH_SIZE] \   
    --results-dir [LOG_DIRECTORY]
    --mode pixel \
    --ckpt-every 20000 \
    --motion-model-config [MOTION_MODEL_CONFIG]

Then these scripts will automatically create the folder in [LOG_DIRECTORY] to save logs and checkpoints.

Note

It's possible that this code may not accurately replicate the results outlined in the paper due to potential human errors during the preparation and cleaning of the code for release. If you encounter any difficulties in reproducing our findings, please don't hesitate to inform us. Additionally, we'll make an effort to carry out sanity-check experiments in the near future.

Citation

Please consider citing CMD if this repository is useful for your work.

@inproceedings{yu2024cmd,
  title={Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition},
  author={Sihyun Yu and Weili Nie and De-An Huang and Boyi Li and Jinwoo Shin and Anima Anandkumar},
  booktitle={International Conference on Learning Representations},
  year={2024}
}

Licenses

This work is made available under the NVIDIA Source Code License-NC. Click here to view a copy of this license.

Acknowledgement

This code is mainly built upon PVDM, DiT, and glide-text2im repositories.
We also used the code from following repositories: StyleGAN-V and TATS.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
diffusion		diffusion
eval		eval
models		models
reconstruction		reconstruction
third_party_licenses		third_party_licenses
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval_functions.py		eval_functions.py
requirements.txt		requirements.txt
train_ae.py		train_ae.py
train_content_diffusion.py		train_content_diffusion.py
train_motion_diffusion.py		train_motion_diffusion.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

[project page] [openreview]

1. Environment setup

2. Dataset

Dataset download

UCF-101

3. Training

Autoencoder

Motion Diffusion Model

Content Diffusion Model

Note

Citation

Licenses

Acknowledgement

About

Releases

Packages

Languages

License

NVlabs/CMD

Folders and files

Latest commit

History

Repository files navigation

Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

[project page] [openreview]

1. Environment setup

2. Dataset

Dataset download

UCF-101

3. Training

Autoencoder

Motion Diffusion Model

Content Diffusion Model

Note

Citation

Licenses

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages