CapVis

Outline

Name: Visualization System for Image Captioning (Course Project)
Contributors: Qi Yin, Dongzi Qu
Instructors: Claudio Silva, Jorge Piazentin Ono, Yeuk Yin Chan
Institute: NYU Tandon School of Engineering (Dept: Computer Science)

Notice：

Little bugs：

Please remember to input the beam size
After mask pictures, it may take a while to generate new caption, if the caption is the old one, please click "update" again.

How to use it

Requirements:

python3, nltk, numpy, pytorch, flask

Files need to Download：

After clone this project, please download the pretrained file and put in "CapVis/pretrained" respository:

Pretrained computation model:
Trained with flickr30k: "https://drive.google.com/open?id=1V2PQ7uGgEKv2Wp91p1CAoUBVivcvCLqg" (This is trained by us)
Trained with COCO: "https://drive.google.com/drive/folders/189VY65I_n4RTpQnmLGj7IzVnOF6dmePC" (This is trained by sgrvinod)

ps: The code now is using flickr30k model. If you want to use COCO model, remember to change the path in "caption.py".

Other files "https://drive.google.com/file/d/1YmbJQXCAv08mNnpmtKHjWV4s2rQ1C3QL/view?usp=sharing"

There are three ways to use it

1. Visualization On Website (main part)

python App.py

Then open "http://127.0.0.1:5000/home/" in Chrome please upload an image from image test or you can whatever image but generate a vector for it before upload it.

2. Test a photo and plot the image

python caption.py --model='pretrained/BEST_checkpoint_flickr30k_5_cap_per_img_5_min_word_freq.pth.tar' --word_map='pretrained/WORDMAP_flickr30k_5_cap_per_img_5_min_word_freq.json' --beam_size=5 --img='image_for_test/test7.jpg'

3. Simply print the caption out in console

python get_cap.py --model='pretrained/BEST_checkpoint_flickr30k_5_cap_per_img_5_min_word_freq.pth.tar' --word_map='pretrained/WORDMAP_flickr30k_5_cap_per_img_5_min_word_freq.json' --beam_size=5 --img='image_for_test/test4.jpg'

Related resource

Observable D3 used：

Static data already upload：

https://observablehq.com/@yq605879396/artsed-bubble https://observablehq.com/@yq605879396/zoomable-sunburst/2

Fetching data while the server is running：

https://observablehq.com/@yq605879396/pie-chart https://observablehq.com/@yq605879396/pie-chart/2 https://observablehq.com/@yq605879396/mona-lisa-histogram/2

Referring

Computation Module:

https://github.com/sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning This is an outstanding implementation of "show, attend and tell" in pytorch version

HTML template

https://codyhouse.co/gem/vertical-fixed-navigation-2/ we rewrite our website base on this frame

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
Useful_Code		Useful_Code
__pycache__		__pycache__
image_for_test		image_for_test
static		static
templates		templates
uploaded_image		uploaded_image
useful_data		useful_data
.gitignore		.gitignore
Final_Report.pdf		Final_Report.pdf
README.md		README.md
app.py		app.py
caption.py		caption.py
get_cap.py		get_cap.py
models.py		models.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CapVis

Outline

Notice：

How to use it

Requirements:

Files need to Download：

There are three ways to use it

1. Visualization On Website (main part)

2. Test a photo and plot the image

3. Simply print the caption out in console

Related resource

Observable D3 used：

Referring

Computation Module:

HTML template

About

Releases

Packages

Contributors 2

Languages

yq605879396/CapVis

Folders and files

Latest commit

History

Repository files navigation

CapVis

Outline

Notice：

How to use it

Requirements:

Files need to Download：

There are three ways to use it

1. Visualization On Website (main part)

2. Test a photo and plot the image

3. Simply print the caption out in console

Related resource

Observable D3 used：

Referring

Computation Module:

HTML template

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages