Here's a README file for the GitHub repository of the project you described:
This project provides a suite of solutions for analyzing documents (PDFs and images) using AI technology. Users can upload their documents, ask questions, and receive intelligent responses enhanced by various AI capabilities.
-
PDF Analysis
- Upload multi-page PDF documents for analysis
- Ask questions about the documents
- AI enhancements:
- Translate: Translate text to different languages
- Comprehend: Perform entity detection and text analysis
- Polly: Convert text to speech
-
Image Analysis
- Upload images for analysis
- Ask questions about the images
- Images are analyzed using Amazon Textract for image analysis
- AI enhancements:
- Translate: Translate text to different languages
- Comprehend: Perform entity detection and text analysis
- Polly: Convert text to speech
-
RAG with Kendra and Bedrock
- Ask questions, and responses are generated by retrieving relevant chunks from Amazon Kendra and summarized using Claude 2
- Example: "What is ABDM?"
-
RAG with OpenSearch and Bedrock
- Ask questions, and responses are generated by retrieving relevant chunks from OpenSearch and summarized using Claude 2
- Example: "How did Amazon perform during COVID-19?"
-
Claude 3
- Ask questions related to mathematics and probability
- Provide images and text in the same query for analysis
Here's the rewritten version for a GitHub README file:
This repository contains code for invoking a large language model (LLM) to process and analyze large documents.
Follow these steps to set up the project locally:
cd GenAIsolutions
python3 -m venv .llmv
source .llmv/bin/activate
.llmv/bin/python
Open a terminal and install the required packages from the requirements.txt
file:
pip install -r requirements.txt
To run the Streamlit application, execute the following command:
streamlit run WelcomePage.py --server.port 8080
Alternatively, you can run the main script directly:
python3 main.py
This project allows you to invoke a large language model to process and analyze large documents. The main functionality is provided by the main.py
script, which you can run directly or through the Streamlit application.
Contributions to this project are welcome. If you find any issues or have suggestions for improvements, please open an issue or submit a pull request.