Skip to content

VisionMate is an innovative screen-reader friendly web application that empowers individuals with visual impairments to explore new environments confidently. By harnessing advanced artificial intelligence, including large language models (LLMs) for natural conversational interactions and OpenAI's Whisper model for robust speech recognition.

Notifications You must be signed in to change notification settings

zjandali/VisionMate

Repository files navigation

🎯 VisionMate: AI-Powered Navigation Assistant

VisionMate is an innovative screen-reader friendly web application designed to help individuals with visual impairments explore and navigate new places with confidence. Through natural conversation and real-time location awareness, VisionMate provides detailed descriptions of surroundings and helps users discover new destinations.

✨ Key Features

  • Natural Conversation Interface: Engage in natural dialogue to discover new places and get recommendations
  • Live Location Awareness: Receive real-time descriptions of notable places as you walk by
  • Voice-First Design: Fully accessible through voice commands and screen readers
  • Smart Recommendations: Context-aware suggestions based on user preferences and location
  • Voice Memo System: Record and share location-specific hazards or points of interest

🚀 Technologies Used

  • Frontend: HTML5, CSS3, JavaScript
  • Backend: Express
  • Speech Recognition: Web Speech API
  • Location Services: Geolocation API
  • APIs:
    • Yelp Fusion API for business data
    • OpenAI Whisper for speech-to-text
    • Custom LLM for natural language understanding

🛠️ Installation

  1. Clone the repository
git clone https://github.com/yourusername/visionmate.git
cd visionmate
  1. Install dependencies
npm install
  1. Install Whisper (Required for voice processing)
pip install -U openai-whisper
  1. Install ffmpeg For Ubuntu/Debian:
sudo apt update && sudo apt install ffmpeg

For other platforms, see the ffmpeg installation guide.

  1. Set up environment variables Create a .env file in the root directory:
YELP_API_KEY=your_yelp_api_key
OPENAI_API_KEY=your_openai_api_key
  1. Start the server
npm start

Visit http://localhost:3000 in your browser.

📱 Remote Access Setup

To make VisionMate accessible from other devices:

  1. Install ngrok:
npm install -g ngrok
  1. Create a tunnel:
ngrok http 3000

🎯 Future Roadmap

  • Offline mode support
  • Multi-language support
  • Community-driven hazard reporting system
  • Integration with public transit APIs
  • Customizable voice profiles

🙏 Acknowledgments

  • OpenAI for Whisper speech recognition
  • Yelp for their comprehensive business data API
  • The accessibility community for valuable feedback and guidance

About

VisionMate is an innovative screen-reader friendly web application that empowers individuals with visual impairments to explore new environments confidently. By harnessing advanced artificial intelligence, including large language models (LLMs) for natural conversational interactions and OpenAI's Whisper model for robust speech recognition.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published