This repo was a course project for CS 598DL Fall 2018 in UIUC. It is a PyTorch implementation of the paper Show and Tell: A Neural Image Caption Generator. Please refer to Project Slides for more details. The model trained with our code (~130 hrs training!) can perform better than Google's original implementation! See MSCOCO Leaderboard for detailed performance ranking with team siqimiao
, and team OriolVinyals
is the Google's implementation.
If you find our repo useful, please star us or cite us. :)
@software{Miao_Show-and-tell_2018,
author = {Miao, Siqi},
doi = {10.5281/zenodo.1234},
month = {11},
title = {{Show-and-tell}},
url = {https://github.com/siqim/Show-and-tell},
version = {1.0.0},
year = {2018}
}