AI Buddy is a cross-platform mobile application that leverages the power of AI to provide users with a personal assistant capable of understanding and responding to various queries.
Chat with PDF | Ask Image | Text Chatbot |
---|---|---|
The architecture of the AI Buddy app is designed to efficiently process and interact with PDF documents. Here's a breakdown of the workflow:
The "Chat with PDF" feature of the AI Buddy app allows users to interact with the content of a PDF document in a conversational manner. The app extracts information from the PDF, processes it, and provides relevant responses to user queries.
-
PDF Extraction: The PDF document is processed to extract its pages.
-
Chunking: The extracted pages are then split into manageable chunks for processing.
-
Batching: Chunks are grouped into batches (e.g., 1 batch = 100 chunks) for parallel processing.
-
Embedding Generation: Each batch is sent to an Embeddings API with the task type set to 'Document'. The API generates a list of vector embeddings for each chunk.
-
Local Storage: The generated embeddings for each batch are split into individual chunk embeddings. These embeddings are stored locally using Hive (a key-value database).
-
User Interaction: When a user inputs an instruction (e.g., "Give me an intro to the book"), the app sends the query to the Embeddings API with the task type set to 'Query' to generate an embedding for the query.
-
Semantic Search: The app performs a semantic search by taking the embedding of the user's query and comparing it across the document embeddings stored in Hive.
-
Ranking Results: The chunks are ranked based on the Euclidean distance between the query embedding and document embeddings. The top-ranked chunks are used as the context for the language model (LLM).
-
Response Generation: The language model generates an answer based on the context provided by the top-ranked chunks.
- Flutter: For cross-platform mobile application development.
- Hive: For local storage of chunk embeddings.
- Gemini Embeddings API: For generating vector embeddings of text.
- Gemini (LLM): For generating responses based on context.
- Riverpod: For managing states across the app.
Need the latest flutter version & a gemini api key from here
flutter upgrade
- Clone the repository from GitHub:
git clone https://github.com/yatendra2001/ai_buddy.git
- Get all dependencies:
flutter pub get
- Run the app:
flutter run
Contributions are what makes the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
Contribution to the project can be made if you have some improvements for the project or if you find some bugs. You can contribute to the project by reporting issues, forking it, modifying the code and making a pull request to the repository.
Please make sure you specify the commit type when opening pull requests:
feat: The new feature you're proposing
fix: A bug fix in the project
style: Feature and updates related to UI improvements and styling
test: Everything related to testing
docs: Everything related to documentation
refactor: Regular code refactoring and maintenance
To know more extensively about how to contribute to this project, read our Contribution Guide.
Distributed under the AGPL-3.0 license. See LICENSE
for more information.
Twitter: @iamyatendrak
LinkedIn: linkedin