RAG MVP #374

chidochipotle · 2024-04-24T04:57:32Z

Adding a minimum minimum viable RAG feature

apps/service_providers/llm_service/runnables.py

apps/experiments/views/experiment.py

apps/service_providers/llm_service/runnables.py

available_variables should be added as "source_material" using form_data , but I could not figure out how to od it with form_data.

chidochipotle · 2024-05-23T05:12:56Z

Hi Simon, added the commits address your previous comments.

Note the last one is unfinished, hope we can finish it in our meeting.
Unfinished tasks:

available_variables should be added as "source_material" using form_data , but I could not figure out how to do it with form_data.
A message hint should be added to the UI when files are attach to explain how to add the "context" and "input" variables. (Guess you will do this on your end)
Testing was minimal see screenshot below:

Totally independently:
I don't remember if I reported the bug where there are issues if you attach a file before creating the experiment.
It may not exist anymore.
I had to create an experiment every time I tested, sometimes if I reused old experiments after changing the code there was issues such as getting no answer with the system got in a bad state and started to throw lots of logs.

snopoke

Looking good. A few small comments but nothing major.

snopoke · 2024-05-23T11:51:52Z

apps/experiments/tasks.py

+def store_rag_embedding(self, experiment) -> None:
+    file_path = experiment.files.all().last().file.path
+    splits = load_rag_file(file_path)
+    embeddings_model = OpenAIEmbeddings()


Use embeddings from LLM service

Suggested change

embeddings_model = OpenAIEmbeddings()

embeddings_model = experiment.get_llm_service().get_openai_embeddings()

apps/experiments/tasks.py

apps/experiments/views/experiment.py

apps/experiments/tasks.py

chidochipotle · 2024-05-23T16:37:52Z

@snopoke, just pushed your fixes.

Tested with a simple pdf file:
RAG_test_file.pdf

The following prompt:

You are an assistant for question-answering tasks. Use the following pieces of retrieved context to answer the question. If you don't know the answer, just say that you don't know. Use three sentences maximum and keep the answer concise.

Question: {input} 

Context: {context} 

Answer:

The following questions:

Hello, you can ask me anything you want about last_test.

What is "Policy objective 1" ?

Policy objective 1 is to reduce newborn, child, and adolescent morbidity and mortality due to preventable communicable diseases.

What is the capital of France ?

I don't know.

Daniel Kornhauser added 3 commits April 24, 2024 00:23

Add RAG minimum viable feature

e3823b6

Remove debug leftover print statements

f451565

Remove untested html support

15d2aa2

snopoke reviewed Apr 26, 2024

View reviewed changes

apps/service_providers/llm_service/runnables.py Outdated Show resolved Hide resolved

apps/experiments/views/experiment.py Outdated Show resolved Hide resolved

apps/service_providers/llm_service/runnables.py Outdated Show resolved Hide resolved

snopoke changed the title ~~Sk/pg vector~~ RAG MVP May 16, 2024

Daniel Kornhauser added 3 commits May 22, 2024 17:24

Replacing files.all() by better suited files.exists()

ae638bf

Moving RAG embedding creation to celery task

9abe6c7

Unfinished improvement: Using prompt from the experiment

5645800

available_variables should be added as "source_material" using form_data , but I could not figure out how to od it with form_data.

snopoke reviewed May 23, 2024

View reviewed changes

Fixed several small issues commented by Simon in PR RAG MVP dimagi#374

22522a5

snopoke approved these changes May 28, 2024

View reviewed changes

snopoke merged commit 3eeae5b into dimagi:sk/pg-vector May 28, 2024
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RAG MVP #374

RAG MVP #374

chidochipotle commented Apr 24, 2024

chidochipotle commented May 23, 2024

snopoke left a comment

snopoke May 23, 2024

chidochipotle commented May 23, 2024

	embeddings_model = OpenAIEmbeddings()
	embeddings_model = experiment.get_llm_service().get_openai_embeddings()

RAG MVP #374

RAG MVP #374

Conversation

chidochipotle commented Apr 24, 2024

chidochipotle commented May 23, 2024

snopoke left a comment

Choose a reason for hiding this comment

snopoke May 23, 2024

Choose a reason for hiding this comment

chidochipotle commented May 23, 2024