Image Captioning

Requirements:

Define the required paths in make_input_file.py
Run: python make_input_files.py
There are four training files:
- train.py using CrossEntropy loss.
- trainL2.py using L2 loss.
- trainL1.py using L1 loss.
- train_cosine.py using Cosine Similarity loss.
Train the model with your desired loss function: python desired_train.py
Evaluate the model with: python evaluate.py
Caption a new image with: python caption_image.py --img='path/to/image.jpeg' --model='path/to/BEST_checkpoint_coco_5_cap_per_img_5_min_word_freq.pth.tar' --word_map='path/to/WORDMAP_coco_5_cap_per_img_5_min_word_freq.json' --beam_size=5

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
caption_image.py		caption_image.py
datasets.py		datasets.py
evaluate.py		evaluate.py
make_input_files.py		make_input_files.py
models.py		models.py
train.py		train.py
train_L1.py		train_L1.py
train_L2.py		train_L2.py
train_cosine.py		train_cosine.py
utils.py		utils.py