updating RAG example to use IBM granite model #745

sujee · 2024-10-25T06:25:14Z

Why are these changes needed?

Updating RAG example to use IBM Granite 3.0 model

Related issue number (if any).

Signed-off-by: Sujee Maniyam <sujee@node51.com>

shahrokhDaijavad · 2024-10-25T17:43:46Z

@sujee I am getting an error while running the ray version of the Notebook (rag_1A_dpk_process_ray.ipynb) about not finding pdf2parquet_transform_ray. I had no problem with the Python version of this notebook. Here is the error:

ModuleNotFoundError Traceback (most recent call last)
File :13

ModuleNotFoundError: No module named 'pdf2parquet_transform_ray'

shahrokhDaijavad · 2024-10-25T18:09:19Z

@sujee One more thing: In rag_1D_query_replicate.ipynb file, in "Step-6: Initialize LLM", please add the choice of IBM Granite as one of the models in the comments (I know that you are picking the Granite model for the actual execution).

sujee · 2024-10-25T18:18:06Z

@sujee I am getting an error while running the ray version of the Notebook (rag_1A_dpk_process_ray.ipynb) about not finding pdf2parquet_transform_ray. I had no problem with the Python version of this notebook. Here is the error:

ModuleNotFoundError Traceback (most recent call last) File :13

ModuleNotFoundError: No module named 'pdf2parquet_transform_ray'

If you have created a custom kernel when you setup the python env, like this

python -m ipykernel install --user --name=data-prep-kit --display-name "dataprepkit"

Choose this kernel in jupyter when running this notebook. See if it fixes the error.

shahrokhDaijavad · 2024-10-25T19:31:43Z

@sujee Thanks for the suggestion. I was using a conda env. So, I removed that env and created and activated a new env. That fixed the problem! In any case, since I have now tested successfully by going all the way through the rag_1D notebook by starting from either Python or Ray versions, I am ready to approve this, after you make the "one more thing" change above. Thanks.

Signed-off-by: Sujee Maniyam <sujee@node51.com>

shahrokhDaijavad

LGTM. Thanks, @sujee

shahrokhDaijavad · 2024-10-29T18:35:21Z

examples/notebooks/rag/rag_1D_query_replicate.ipynb

+    "| meta/meta-llama-3-70b-instruct      | Meta      | 70 B   | $ 0.65       | $ 2.75        |                                                      |\n",
+    "\n",
+    "\n",
+    "(Prices are as of Oct 2024)\n",
    "\n",


@sujee Since the prices keep changing all the time, even though you have commented that these are Oct. 24 prices, please remove the two price columns from this table (and the comment about Oct. 2024). Thanks.

BTW, the Llama 8b price number for the Output in the table is wrong. It should have been 0.25 (as shown on the replicate.com) and not 0.05. Same as Granite 8b numbers.

fixed model table @shahrokhDaijavad

Signed-off-by: Sujee Maniyam <sujee@node51.com>

shahrokhDaijavad

Thanks for making the change, @sujee. I tested the whole set of notebooks through 1D again after the change.

touma-I

Please see if you want to act on comments and let me know.

touma-I · 2024-10-30T21:08:58Z

examples/notebooks/rag/rag_1A_dpk_process_ray.ipynb

@@ -303,7 +303,8 @@
    "    \"data_files_to_use\": ast.literal_eval(\"['.pdf']\"),\n",
    "    # orchestrator\n",
    "    \"runtime_worker_options\": ParamsUtils.convert_to_ast(worker_options),\n",
-    "    \"runtime_num_workers\": MY_CONFIG.RAY_RUNTIME_WORKERS,\n",
+    "    # \"runtime_num_workers\": MY_CONFIG.RAY_RUNTIME_WORKERS,\n",


Why is this commented out. Please remove or go back to using MY._CONFIG.RAY_RuNTIME_WORKERS = 1

Setting MY._CONFIG.RAY_RUNTIME_WORKERS = 1 breaks some other transforms: ededupe and fuzzy-dedupe.
They seem to need at least 2 workers.

So I am setting workers = 1 only for PDF2PQ step (so we don't run into the race condition of cleaning out the downloaded models.

This is a temp fix until other worker issues are resolved.

touma-I · 2024-10-30T21:18:54Z

examples/notebooks/rag/rag_1D_query_replicate.ipynb

+    "| ibm-granite/granite-3.0-8b-instruct | IBM       | 8 B    | IBM's newest Granite Model v3.0  (default)           |\n",
+    "| ibm-granite/granite-3.0-2b-instruct | IBM       | 2 B    | IBM's newest Granite Model v3.0                      |\n",
+    "| meta/meta-llama-3.1-405b-instruct   | Meta      | 405 B  | Meta's flagship 405 billion parameter language model |\n",
+    "| meta/meta-llama-3-8b-instruct       | Meta      | 8 B    |                                                      |\n",


Missing description for the last 2 rows

@sujee I think we can add these descriptions for the last 2 rows:

Meta's 8 billion parameter language model
Meta's 70 billion parameter language model

Signed-off-by: Sujee Maniyam <sujee@node51.com>

updating RAG example to use IBM granite model

78451ef

Signed-off-by: Sujee Maniyam <sujee@node51.com>

shahrokhDaijavad requested review from shahrokhDaijavad and touma-I October 25, 2024 18:09

Updated documentation for LLM choices at Replicate

0fc1dbc

Signed-off-by: Sujee Maniyam <sujee@node51.com>

shahrokhDaijavad approved these changes Oct 25, 2024

View reviewed changes

shahrokhDaijavad self-requested a review October 29, 2024 18:30

shahrokhDaijavad requested changes Oct 29, 2024

View reviewed changes

shahrokhDaijavad mentioned this pull request Oct 29, 2024

Updating RAG example to use Granite model #695

Closed

sujee added 2 commits October 29, 2024 21:40

Updating model table description

6f2f9af

Signed-off-by: Sujee Maniyam <sujee@node51.com>

Merge branch 'dev_upstream' into rag-example6-granite

db70073

shahrokhDaijavad approved these changes Oct 30, 2024

View reviewed changes

touma-I reviewed Oct 30, 2024

View reviewed changes

fixed model descriptions, clarified a comment

8155bb7

Signed-off-by: Sujee Maniyam <sujee@node51.com>

touma-I merged commit aa013d2 into IBM:dev Oct 31, 2024
1 check passed

sujee deleted the rag-example6-granite branch November 5, 2024 06:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

updating RAG example to use IBM granite model #745

updating RAG example to use IBM granite model #745

sujee commented Oct 25, 2024

shahrokhDaijavad commented Oct 25, 2024

shahrokhDaijavad commented Oct 25, 2024

sujee commented Oct 25, 2024

@sujee I am getting an error while running the ray version of the Notebook (rag_1A_dpk_process_ray.ipynb) about not finding pdf2parquet_transform_ray. I had no problem with the Python version of this notebook. Here is the error:

shahrokhDaijavad commented Oct 25, 2024

shahrokhDaijavad left a comment

shahrokhDaijavad Oct 29, 2024 •

edited

Loading

sujee Oct 30, 2024

shahrokhDaijavad left a comment

touma-I left a comment

touma-I Oct 30, 2024

sujee Oct 31, 2024

touma-I Oct 30, 2024

shahrokhDaijavad Oct 30, 2024

updating RAG example to use IBM granite model #745

updating RAG example to use IBM granite model #745

Conversation

sujee commented Oct 25, 2024

Why are these changes needed?

Related issue number (if any).

shahrokhDaijavad commented Oct 25, 2024

@sujee I am getting an error while running the ray version of the Notebook (rag_1A_dpk_process_ray.ipynb) about not finding pdf2parquet_transform_ray. I had no problem with the Python version of this notebook. Here is the error:

shahrokhDaijavad commented Oct 25, 2024

sujee commented Oct 25, 2024

@sujee I am getting an error while running the ray version of the Notebook (rag_1A_dpk_process_ray.ipynb) about not finding pdf2parquet_transform_ray. I had no problem with the Python version of this notebook. Here is the error:

shahrokhDaijavad commented Oct 25, 2024

shahrokhDaijavad left a comment

Choose a reason for hiding this comment

shahrokhDaijavad Oct 29, 2024 • edited Loading

Choose a reason for hiding this comment

sujee Oct 30, 2024

Choose a reason for hiding this comment

shahrokhDaijavad left a comment

Choose a reason for hiding this comment

touma-I left a comment

Choose a reason for hiding this comment

touma-I Oct 30, 2024

Choose a reason for hiding this comment

sujee Oct 31, 2024

Choose a reason for hiding this comment

touma-I Oct 30, 2024

Choose a reason for hiding this comment

shahrokhDaijavad Oct 30, 2024

Choose a reason for hiding this comment

shahrokhDaijavad Oct 29, 2024 •

edited

Loading