Image identification with Kosmos2 model, drawing and cutting bbox with object detection
-
Updated
Jul 25, 2024 - Python
Image identification with Kosmos2 model, drawing and cutting bbox with object detection
This project is a Streamlit web application that leverages OpenAI's GPT-4o to generate descriptions for uploaded images
AI Image Description Generator accurately extracts the key elements from images and interprets the creative purposes behind them, which can be applied in fields such as scientific research, artistic creation, and the mutual search between images and texts.
Experimenting with mastodon.social client alt-text usage dataset.
An intelligent assistant powered by the ReAct framework, leveraging LangChain for tool-based reasoning and Gradio for a user-friendly interface. Supports tasks like weather queries, PDF summarization, image descriptions, and more.
It is an innovative repository housing a sophisticated Large Language Model (LLM) project, showcasing the intersection of advanced natural language processing and cutting-edge artificial intelligence. This repository serves as a comprehensive platform for the development, experimentation, and application of state-of-the-art language models.
AI-Powered-Solution-for-Assisting-Visually-Impaired-Individuals
A public repository with data and code for ''When an Image Tells a Story: The Role of Visual and Semantic Information for Generating Paragraph Descriptions'', Nikolai Ilinykh and Simon Dobnik. 2020. In Proceedings of the 13th International Conference on Natural Language Generation, pages 338–348, Dublin, Ireland.
Add a description, image, and links to the image-description topic page so that developers can more easily learn about it.
To associate your repository with the image-description topic, visit your repo's landing page and select "manage topics."