NLP-LLM-MedicalBot

A quantined llama 2 based medical bot using a medical book pdf for vector database and chainlit for ui

Demo Video

The medicalbot uses the pdf file 71763-gale-encyclopedia-of-medicine.-vol.-1.-2nd-ed.pdf is used to generate a vector database and this vector database is used as a reference when answering user questions.
The vector database is generated from the pdf file using the script ingest.py
- HuggingFaceEmbeddings from langchain utilizes the model sentence-transformers/all-MiniLM-L6-v2 to generate the vector database
The main python script for the chainlit chatbot is model.py
- The chatbot LLM model used is TheBloke/Llama-2-7B-Chat-GGML, a quantized version of Llama 2 7B Chat. As such this quantized model takes up less resources.
- The chatbot responds by displaying both the user-query answer and the source of info. from the pdf file.

Python version: 3.10

Install dependencies using pip install -r requirements.txt (creating a separate virtual environment is recommended)
Run the script python ingest.py to create a vector database
Run the chainlit-langchain script chainlit run model.py -w to run the chatbot powered by quantized Llama 2 7B Chat.
- Note: Since this code utilizes the CPU, the reponse from the bot takes some time to generate.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.chainlit		.chainlit
medical_bot_llm		medical_bot_llm
.gitignore		.gitignore
README.md		README.md