image-descriptions

Here are 14 public repositories matching this topic...

google / imageinwords

Data release for the ImageInWords (IIW) paper.

evaluation dataset image-captioning dataset-generation image-to-text image-descriptions image-text human-annotation t2i i2t detailed-descriptions detailed-annotations

Updated Nov 17, 2024
JavaScript

baaivision / DenseFusion

Star

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

vlm image-descriptions visual-perception mllm multimodal-large-language-models vision-language-models

Updated Dec 6, 2024
Python

meng1994412 / CBIR

Star

Content-Based Image Retrieval System

information-retrieval database computer-vision image-descriptions keypoints-detector

Updated Dec 18, 2018
Python

dhruvik-patel / image-description

Star

This repo represents our machine learning project Image Description which is used to generate a description of an image based on activities and objects detected in the image.

python flask machine-learning image tensorflow image-processing cnn lstm image-descriptions tflite-models image-descriptor

Updated Apr 8, 2024
CSS

Pavansomisetty21 / Image-Caption-Generation-using-LLMs-GEMINI-

Sponsor

Star

we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI

Updated Aug 24, 2024
Jupyter Notebook

alterism / mastodon-alt-text

Star

Experimenting with mastodon.social client alt-text usage dataset.

data-science university accessibility a11y university-project datascience mastodon image-descriptions fediverse alt-text alttext mastodon-social image-description aiss-master

Updated Dec 18, 2024
HTML

DevExpress-Examples / office-file-api-ai-implementation

Star

Integrate AI capabilities into a DevExpress-powered Office File API Web API application.

ai accessibility web-api devexpress image-descriptions word-processing office-file-api spreadsheet-document-api

Updated Mar 4, 2025
C#

aviralchharia / Neural-Image-Captioning

Star

In this project, we use a Deep Recurrent Architecture, which uses CNN (VGG-16 Net) pretrained on ImageNet to extract 4096-Dimensional image feature Vector and an LSTM which generates a caption from these feature vectors.

natural-language-processing computer-vision cnn lstm neural-networks image-captioning show-and-tell vgg16 bleu-score image-descriptions flickr8k-dataset feature-vectors

Updated Sep 9, 2020
Jupyter Notebook

TatjanaChernenko / image_description_generation

Star

NL Generation from structured inputs. Focuses on generating natural language descriptions for images by exploring the relationship between textual descriptions and image attributes. Leveraging an encoder-decoder architecture with LSTM cells, the system transforms normalized vector representations of attributes into fixed-length vector.

data-science lstm image-captioning natural-language-generation lstm-neural-networks encoder-decoder image-descriptions