AI voice assistant

Overview

This project combines audio transcription capabilities with AI-powered interaction, allowing users to transcribe audio and engage in conversations with an AI model.

Features

Audio transcription using Whisper
Integration with Ollama for AI-powered conversations
Real-time visual analysis using computer vision and AI
Describe objects, scenes, and activities captured by webcam or screenshot

Requirements

Python 3.8+
See requirements.txt for a full list of dependencies

Installation

Clone the repository

Create and activate a Python virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Install the required packages:
```
pip install -r requirements.txt
```
Install Ollama following the instructions at ollama.ai

Usage

Ensure your virtual environment is activated
Run the main script:
```
python bobo.py
```
Ask Bobo to tell you what they see or look at your screen!

Configuration

Customize AI model prompts as needed

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License

Acknowledgements

OpenAI Whisper
Faster Whisper
Ollama
All other open-source libraries used in this project

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
bobo.py		bobo.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI voice assistant

Overview

Features

Requirements

Installation

Usage

Configuration

Contributing

License

Acknowledgements

About

Releases

Packages

Languages

jamalx31/bobo

Folders and files

Latest commit

History

Repository files navigation

AI voice assistant

Overview

Features

Requirements

Installation

Usage

Configuration

Contributing

License

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages