Siamese Network with Interactive Transformer for Video Object Segmentation

This repository is the code release of the paper Siamese Network with Interactive Transformer for Video Object Segmentation, accepted by AAAI 2022.

Requirements

Linux
Python >= 3.6
Pytorch >=1.5
CUDA>=9.0
Pillow, opencv-python, scipy

Training

Stage 1

Pretraining on MS-COCO.

python train_coco.py -Dcoco "path to coco"

Stage 2

Training on Davis & Youtube-VOS.

python train_davis.py -Ddavis "path to davis" -Dyoutube "path to youtube-vos" -resume "path to coco pretrained weights"

Evaluation

Evaluating on DAVIS val set.

python eval.py -p "path to weights"

Demo

Acknowledgement

This codebase borrows the code and structure from official STM repository

Citing SITVOS

@article{lan2021siamese,
  title={Siamese Network with Interactive Transformer for Video Object Segmentation},
  author={Lan, Meng and Zhang, Jing and He, Fengxiang and Zhang, Lefei},
  journal={arXiv preprint arXiv:2112.13983},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
dataset		dataset
docs		docs
evaldavis2017		evaldavis2017
model		model
utils		utils
README.md		README.md
eval.py		eval.py
train_coco.py		train_coco.py
train_davis.py		train_davis.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Siamese Network with Interactive Transformer for Video Object Segmentation

Requirements

Training

Stage 1

Stage 2

Evaluation

Demo

Acknowledgement

Citing SITVOS

About

Releases

Packages

Languages

LANMNG/SITVOS

Folders and files

Latest commit

History

Repository files navigation

Siamese Network with Interactive Transformer for Video Object Segmentation

Requirements

Training

Stage 1

Stage 2

Evaluation

Demo

Acknowledgement

Citing SITVOS

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages