VST: Video Summarisation Transformer

This repository is an implementation of the model found in the project Generating Summarised Videos Using Transformers which can be found on my website. This was my Masters Project from 2020. The implementation of the model is in PyTorch with the following details.

Requirements

Package	Version
Python	3.6.8
PyTorch	1.4.0
NumPy	1.18.4
h5py	2.10.0
ortools	7.5.7466

Installation

Cloning this repository as is should get you mostly what you need. You will also need the datasets as provided in VASnet. Or alternatively pytorch-vsumm-reinforce. Place these datasets in the datasets folder provided.

Split files have been provided, taken from VASnet. To generate your own, please use the guide given in VASnet.

Models produced by me and utilised for this project will be available to download shortly.

How to ...

Train

To train the model, make sure you have the datasets described above and ensure you have some train/test splits. By running the following command, you will execute training for all splits. Details of parameters can be found by running main.py with --help.

python3 main.py --train --model_dir models/

Evaluate

To evaluate the models you have created, run the following command. Details of parameters can be found by running main.py with --help. To utilise beam search, provide a non-zero --beam_width.

python3 main.py --model_dir models/

Visualise

Limited visualisation examples can be found in the notebook visualisations.ipynb. Examples include how to select a specific output from the evaluation set to isolate a machine summary. Example visualisations include...

Visualisation of decoder attention heads

To generate actual summaries, pytorch-vsumm-reinforce provide details on how to generate an MP4 video from a set of frames using a machine summary produced by the model.

Use your own data

Although not utilised by me, details for this can be found in pytorch-vsumm-reinforce

Acknowledgement

Thanks to the work of Zhou et al and Fajtl et al, and OpenNMT this implementation was possible. Where their code has been utilised, a reference should follow. Thank you also to Zhang et al also for contributing to the processing of the datasets referenced prior alongside Zhou et al and Fajtl et al. Citations can be found below for their work. If I have missed a reference of any sort please submit an issue.

@misc{fajtl2018summarizing,
    title={Summarizing Videos with Attention},
    author={Jiri Fajtl and Hajar Sadeghi Sokeh and Vasileios Argyriou and Dorothy Monekosso and Paolo Remagnino},
    year={2018},
    eprint={1812.01969},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

@inproceedings{opennmt,
  author    = {Guillaume Klein and
               Yoon Kim and
               Yuntian Deng and
               Jean Senellart and
               Alexander M. Rush},
  title     = {OpenNMT: Open-Source Toolkit for Neural Machine Translation},
  booktitle = {Proc. ACL},
  year      = {2017},
  url       = {https://doi.org/10.18653/v1/P17-4012},
  doi       = {10.18653/v1/P17-4012}
}

@article{zhou2017reinforcevsumm,
   title={Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward},
   author={Zhou, Kaiyang and Qiao, Yu and Xiang, Tao},
   journal={arXiv:1801.00054},
   year={2017}
}

@inproceedings{zhang2016video,
  title={Video summarization with long short-term memory},
  author={Zhang, Ke and Chao, Wei-Lun and Sha, Fei and Grauman, Kristen},
  booktitle={ECCV},
  year={2016}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
datasets		datasets
images		images
splits		splits
LICENSE		LICENSE
README.md		README.md
beam_search.py		beam_search.py
cpd_auto.py		cpd_auto.py
cpd_nonlin.py		cpd_nonlin.py
knapsack.py		knapsack.py
main.py		main.py
model.py		model.py
train.py		train.py
vasnet_tools.py		vasnet_tools.py
visualisations.ipynb		visualisations.ipynb
vst.py		vst.py
vst_keyframe.py		vst_keyframe.py
vsum_tools.py		vsum_tools.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VST: Video Summarisation Transformer

Requirements

Installation

How to ...

Train

Evaluate

Visualise

Use your own data

Acknowledgement

About

Releases

Packages

Languages

License

tim-roderick/VST

Folders and files

Latest commit

History

Repository files navigation

VST: Video Summarisation Transformer

Requirements

Installation

How to ...

Train

Evaluate

Visualise

Use your own data

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages