-
Notifications
You must be signed in to change notification settings - Fork 319
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NaN Embedding Dimension Support #1792
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍 Looks good to me! Reviewed everything up to 2fc7761 in 1 minute and 43 seconds
More details
- Looked at
355
lines of code in8
files - Skipped
0
files when reviewing. - Skipped posting
7
drafted comments based on config settings.
1. py/core/database/graphs.py:81
- Draft comment:
The_get_vector_column_str
function is correctly used here to handle vector column string generation, improving code readability and maintainability. - Reason this comment was not posted:
Confidence changes required:0%
The PR introduces a utility function_get_vector_column_str
to handle vector column string generation, which is used in multiple files. This function is correctly implemented to handle NaN dimensions by returning an empty string, allowing Postgres to handle any dimension. The function is used in various places to replace inline logic, improving code readability and maintainability.
2. py/core/database/graphs.py:529
- Draft comment:
The_get_vector_column_str
function is correctly used here to handle vector column string generation, improving code readability and maintainability. - Reason this comment was not posted:
Confidence changes required:0%
The PR introduces a utility function_get_vector_column_str
to handle vector column string generation, which is used in multiple files. This function is correctly implemented to handle NaN dimensions by returning an empty string, allowing Postgres to handle any dimension. The function is used in various places to replace inline logic, improving code readability and maintainability.
3. py/core/database/graphs.py:1017
- Draft comment:
The_get_vector_column_str
function is correctly used here to handle vector column string generation, improving code readability and maintainability. - Reason this comment was not posted:
Confidence changes required:0%
The PR introduces a utility function_get_vector_column_str
to handle vector column string generation, which is used in multiple files. This function is correctly implemented to handle NaN dimensions by returning an empty string, allowing Postgres to handle any dimension. The function is used in various places to replace inline logic, improving code readability and maintainability.
4. py/core/database/chunks.py:123
- Draft comment:
The_get_vector_column_str
function is correctly used here to handle vector column string generation, improving code readability and maintainability. - Reason this comment was not posted:
Confidence changes required:0%
The PR introduces a utility function_get_vector_column_str
to handle vector column string generation, which is used in multiple files. This function is correctly implemented to handle NaN dimensions by returning an empty string, allowing Postgres to handle any dimension. The function is used in various places to replace inline logic, improving code readability and maintainability.
5. py/core/database/chunks.py:155
- Draft comment:
The_get_vector_column_str
function is correctly used here to handle vector column string generation, improving code readability and maintainability. - Reason this comment was not posted:
Confidence changes required:0%
The PR introduces a utility function_get_vector_column_str
to handle vector column string generation, which is used in multiple files. This function is correctly implemented to handle NaN dimensions by returning an empty string, allowing Postgres to handle any dimension. The function is used in various places to replace inline logic, improving code readability and maintainability.
6. py/core/database/chunks.py:222
- Draft comment:
The_get_vector_column_str
function is correctly used here to handle vector column string generation, improving code readability and maintainability. - Reason this comment was not posted:
Confidence changes required:0%
The PR introduces a utility function_get_vector_column_str
to handle vector column string generation, which is used in multiple files. This function is correctly implemented to handle NaN dimensions by returning an empty string, allowing Postgres to handle any dimension. The function is used in various places to replace inline logic, improving code readability and maintainability.
7. py/core/database/chunks.py:327
- Draft comment:
The_get_vector_column_str
function is correctly used here to handle vector column string generation, improving code readability and maintainability. - Reason this comment was not posted:
Confidence changes required:0%
The PR introduces a utility function_get_vector_column_str
to handle vector column string generation, which is used in multiple files. This function is correctly implemented to handle NaN dimensions by returning an empty string, allowing Postgres to handle any dimension. The function is used in various places to replace inline logic, improving code readability and maintainability.
Workflow ID: wflow_kGlO0vQo1UOynxr6
You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet
mode, and more.
[embedding]
provider = "litellm"
concurrent_request_limit = 32 # Embedding concurrency limit
base_model = "openai/nebius/bge-en-icl"
base_dimension = nan my export OPENAI_API_KEY=sk-123
export OPENAI_API_BASE="https://litellm.mywebsite.com/v1"
export LITELLM_PROXY_API_KEY=sk-123
export LITELLM_PROXY_API_BASE="https://litellm.mywebsite.com/v1" Does not work.
When I try to ingest files, I again get the same error message that r2r is trying to set dimensionality parameters when posting to my LiteLLM Proxy.
|
@qdrddr we haven't yet pushed a release with this change. You'd have to build off of main for it to take effect. |
I tried to build from the main, but got errors with the container keep restarting. |
FYI this is the patch that was working for me when applied to v3.3.22 (but doesn't work in 3.3.24 anymore):
|
Allows for specifying
nan
dimensions for models that do not support dimension parameters.