refactor: Async batch processing, limits, and configuration #80

ScarFX · 2024-09-30T17:24:40Z

Added async api calls to embed (process) and add chunks (documents) to vector storage.

Added env variables: MAX_CHUNKS, EMBEDDING_TIMEOUT, BATCH_SIZE, and CONCURRENT_LIMIT to configure/limit batching api class. Description in Readme.

Tested with PGVector with bedrock and amazon.titan-embed-text-v2:0. Also rebuilt in docker successfully.

Verified BATCH_SIZE and CONCURRENT_LIMIT affecting process calls to embed correctly
Verified MAX_CHUNKS works as limit and throws exception if over. Correct default behavior of ignoring
Verified EMBEDDING_TIMEOUT works within limit and throws exception if over

…figuring batching

ScarFX · 2024-09-30T17:26:09Z

Ideal BATCH_SIZE varies on embeddings provider and likely model and file size. Specific default values should be added per embeddings provider in future. Right now the default for BATCH_SIZE=75 which is ideal for openai but not bedrock.

danny-avila · 2024-10-18T15:28:43Z

process_docs.py

+    start_time = time.perf_counter()
+    for i in range(0, len(docs), BATCH_SIZE):
+        batch = docs[i : min(i + BATCH_SIZE, len(docs))]
+        #logger.info(f"Sending batch {i} to {i+len(batch)} / {len(docs)}")


add back as logger.debug()

refactor: Async batch processing and env var variables for limits/con…

429f6fa

…figuring batching

ScarFX requested a review from danny-avila September 30, 2024 17:24

ScarFX self-assigned this Sep 30, 2024

danny-avila reviewed Oct 18, 2024

View reviewed changes

refactor: debugger print

65b9614

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: Async batch processing, limits, and configuration #80

refactor: Async batch processing, limits, and configuration #80

ScarFX commented Sep 30, 2024

ScarFX commented Sep 30, 2024 •

edited

Loading

danny-avila Oct 18, 2024

refactor: Async batch processing, limits, and configuration #80

Are you sure you want to change the base?

refactor: Async batch processing, limits, and configuration #80

Conversation

ScarFX commented Sep 30, 2024

ScarFX commented Sep 30, 2024 • edited Loading

danny-avila Oct 18, 2024

Choose a reason for hiding this comment

ScarFX commented Sep 30, 2024 •

edited

Loading