Image-Captioning

I implemented a full deep learning pipeline that generates captions for images using a CNN encoder & RNN decoder.

All code was written using Jupyter Notebook and was created for Udacity's "Computer Vision" course. The Notebooks are also fully documented with visuals, answers, and descriptions.

0. Dataset

Explores the COCO Dataset.

1. Preliminaries

Loads the COCO Dataset and performs pre-processing. Also design a CNN-RNN model for automatically generating image captions.

2. Training

Select the hyperparameters and train the CNN-RNN model.

3. Inference

Utilize the CNN-RNN model to generate captions for test images.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
images		images
models		models
0_Dataset.ipynb		0_Dataset.ipynb
1_Preliminaries.ipynb		1_Preliminaries.ipynb
2_Training.ipynb		2_Training.ipynb
3_Inference.ipynb		3_Inference.ipynb
README.md		README.md
data_loader.py		data_loader.py
model.py		model.py
vocab.pkl		vocab.pkl
vocabulary.py		vocabulary.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image-Captioning

I implemented a full deep learning pipeline that generates captions for images using a CNN encoder & RNN decoder.

0. Dataset

1. Preliminaries

2. Training

3. Inference

About

Releases

Packages

Languages

ashayp22/Image-Captioning

Folders and files

Latest commit

History

Repository files navigation

Image-Captioning

I implemented a full deep learning pipeline that generates captions for images using a CNN encoder & RNN decoder.

0. Dataset

1. Preliminaries

2. Training

3. Inference

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages