Skip to content

Deep Learning course project: Generate English sentence describing a picture

Notifications You must be signed in to change notification settings

szhou42/Neural-Image-Caption-Generator

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Show and tell

This project implements the Show and Tell paper In simple words, it takes an image and produce an English sentence that describes the image.

Model Short Summary

Resnet: takes an image and produce a 300-dimension image embedding vector LSTM: takes the image embedding vector above and a start word token(STK) to produce a prediction for next word (as a probability distribution) Beam Search: Find the optimal sequence of words given by the probability distribution given at each time step(optimal only under some beam size, not optimal in general).

Some Screenshots

Alt text

About

Deep Learning course project: Generate English sentence describing a picture

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%