Skip to content

This project leverages advanced AI models to generate captions for images and translate them into regional languages (Kannada and Hindi). Additionally, it offers text-to-speech conversion, making it accessible to a wider audience, specially those with visual impairments.

Notifications You must be signed in to change notification settings

LavanyaAN21/Depiction-of-image-features-with-audio-to-aid-visually-impaired-person

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Depiction of image features with audio to aid visually impaired person🖼️🔊

About

This project leverages advanced AI models to generate captions for images and translate them into regional languages (Kannada and Hindi). Additionally, it offers text-to-speech conversion, making it accessible to a wider audience, specially those with visual impairments.

🚀Key Features

Image Captioning: Generate meaningful captions based on the content of images. Language Translation: Translate captions from English to Kannada and Hindi. Speech Conversion: Convert captions to audio files using gTTS for ease of access. Multi-modal Application: Supports both visual and auditory outputs for different use cases.

🔍💡Use Cases

Accessibility Aid: Helps visually impaired users by describing images via audio. Language Learning Tool: Supports language translation for educational purposes. Interactive Learning: Enhances digital learning tools with multi-language support.

🎯 The goal of this project is to:

  1. Generate meaningful captions for images.
  2. Translate captions into regional languages (English,Kannada & Hindi).
  3. Convert captions to audio for accessibility.

This tool can be useful in various applications such as:

  • Assisting visually impaired individuals with image descriptions.
  • Learning language translations through images.
  • Enhancing interactive educational tools.

📸Outputs

Screenshot 2024-10-16 183155 Screenshot 2024-10-16 183215

About

This project leverages advanced AI models to generate captions for images and translate them into regional languages (Kannada and Hindi). Additionally, it offers text-to-speech conversion, making it accessible to a wider audience, specially those with visual impairments.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages