Skip to content

Commit

Permalink
update readme
Browse files Browse the repository at this point in the history
  • Loading branch information
baptiste-pasquier committed Mar 14, 2024
1 parent a153ef5 commit f735b43
Showing 1 changed file with 5 additions and 3 deletions.
8 changes: 5 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -58,9 +58,9 @@ Backend:
Parameters:

- `ingest.clear_database` : Whether to clear the database before ingesting new data.
- `ingest.metadata_keys` : Unstructured metadata to use.
- `ingest.table_format` : How to extract table with Unstructured (`text`, `html` or `image`).
- `ingest.export_extracted` : Whether to export extracted elements in local folder.
- `metadata_keys` : Unstructured metadata to use.

### RAG Option 3

Expand All @@ -75,17 +75,19 @@ Method:
Backend:

- Use the [multi-vector retriever](https://python.langchain.com/docs/modules/data_connection/retrievers/multi_vector)
with [Chroma](https://www.trychroma.com/) to store raw text and images along with their summaries for retrieval.
with [Chroma](https://www.trychroma.com/) to store raw text and images (in a docstore) along with their summaries (in a vectorstore) for retrieval.
- Use GPT-4V for both image summarization (for retrieval) as well as final answer synthesis from join review of images and texts (or tables).

Parameters:

- `ingest.clear_database` : Whether to clear the database before ingesting new data.
- `ingest.metadata_keys` : Unstructured metadata to use.
- `ingest.table_format` : How to extract tables with Unstructured (`text`, `html` or `image`).
- `ingest.summarize_text` : Whether to summarize texts with an LLM or use raw texts for retrieval.
- `ingest.summarize_table` : Whether to summarize tables with LLM or use raw tables for retrieval.
- `ingest.vectorstore_source` : The field of documents to add into the vectorstore (`content` or `summary`).
- `ingest.docstore_source` : The field of documents to add into the docstore (`content` or `summary`).
- `ingest.export_extracted` : Whether to export extracted elements to a local folder.
- `metadata_keys` : Metadata keys from Unstructured to use.

## Installation

Expand Down

0 comments on commit f735b43

Please sign in to comment.