Vector Search and Semantic Caching #417

slorello89 · 2023-11-02T20:35:13Z

Introduces Vector Search and Semantic caching to Redis OM .NET.

One breaking change - replace string[] with object[] in some key places (e.g. Execute/ExecuteAsync) as the byte arrays needed for VectorSearch need to be passed in raw. Should be transparent to anyone using the higher-level APIs within Redis OM, but anyone using those raw commands might need to make some adjustments. See README for details as to how to use the new API.

slorello89 · 2023-11-02T20:35:44Z

Some broken tests at first (need to obtain keys for the integration tests with OpenAI/HuggingFace/Azure)

slorello89 · 2023-11-02T20:40:03Z

FYI @Spartee, @tylerhutcherson, & @banker

bsbodden

L-very-GTM!

tylerhutcherson

Heck yeah. This is awesome Steve. I left a few readme suggestion/ideas, mainly focused on clarity and flow. Where will this be represented in the docs?

tylerhutcherson · 2023-11-03T14:05:08Z

README.md

+
+A `Vector<T>` is a representation of an object that can be transformed into a vector by a Vectorizer.
+
+A `VectorizerAttribute` is the abstract class you use to decorate your Vector fields, it is responsible for defining the logic to convert your Vectors into Embeddings. In the package `Redis.OM.Vectorizers` we provide vectorizers for HuggingFace, OpenAI, and AzureOpenAI to allow you to easily integrate them into your workflows.


the phrase "convert your Vectors into Embeddings" is a bit misleading as those two terms are relatively interchangeable. I think we're essentially talking about the definition of the various vector field attributes like distance metric, data type, dims, etc? Some of those subsumed by the choice of vectorizer for sure

changed the verbiage a bit - hopefully this is better?

tylerhutcherson · 2023-11-03T14:05:47Z

README.md

+    [RedisIdField]
+    public string Id { get; set; }
+
+    [Indexed(DistanceMetric = DistanceMetric.COSINE)]


Can you also show how a few other vector field attributes like index type (HNSW vs FLAT) and related args are set here?

Added a couple of other parameters for the index definition, and explained it a bit better in the modeling section.

tylerhutcherson · 2023-11-03T14:06:28Z

README.md

+
+With Redis OM, the embeddings can be completely transparent to you, they are generated and bound to the `Vector<T>` when you query/insert your vectors. If however you needed your embedding after the insertion/Query, they are available at `Vector<T>.Embedding`, and be queried either as the raw bytes, as an array of doubles or as an array of floats (depending on your vectorizer).
+
+#### Configuration


Add other vector field attribute level configuration details here too?

Added details about index definition in the modeling section as that's more or less where it belongs (the configuration section is talking about configuring the vectorizers)

README.md

tylerhutcherson · 2023-11-03T14:17:07Z

README.md

+With the vector defined in our model, all we need to do is create Vectors of the generic type, and insert them with our model. Using our `RedisCollection`, you can do this by simply using `Insert`:
+
+```cs
+var query = new OpenAIQuery


naming this query makes sense given the implied caching use case here. Maybe spell it out a bit so it's clear why we are inserting a "query" object into your vector database?

Query is confusing in this context, is OpenAICompletionResult & completionResult better? (It's something that's not query that you might actually do with these embeddings lol).

…ibution of the model files)

…otnet into feature/vectors

slorello89 added 18 commits October 6, 2023 07:41

starting vectors

e6f3627

working out vector strings

241645c

string arrays to object arrays

9150f41

seralization/deserialization of vectors

a377faf

initial queries working

bc6ce05

Vector Range, score binding

c257207

removing extra FromHashSet method

feb0946

hybrid queries

04fbb90

external-vectorizers

2a9ad73

semantic cache start

f2866f0

more vectorizers

21957f5

some semantic caching

4fbd0bf

azure openai vectorizer.

30aca01

merge

57f4d13

reorganizing vectorizers

06b888b

updating nuget packaging

65272dc

use Vector<T> instead of just the given type

16c694c

readme updates

73c6c05

slorello89 added breakingchange feature labels Nov 2, 2023

slorello89 requested review from bsbodden and shacharPash November 2, 2023 20:35

updating docker image to .NET 7

9a3a257

bsbodden approved these changes Nov 2, 2023

View reviewed changes

slorello89 added 2 commits November 3, 2023 08:52

test cleanup

ecc0a91

slight tweak to NearestNeighbors/VectorRange APIs

9d09c3a

tylerhutcherson approved these changes Nov 3, 2023

View reviewed changes

updates per Tylers comments

980e737

slorello89 added 2 commits November 3, 2023 16:06

changing query -> CompletionResult and queryPrompt -> prompt

efa7d33

adding setter for embedding

d33e935

slorello89 force-pushed the feature/vectors branch from a79689c to d33e935 Compare November 27, 2023 14:18

slorello89 and others added 15 commits November 27, 2023 09:20

Resnet18 and AllMiniLML6V2 vectorizers

d7cdae8

removing dependency on ML Resnet project (to support transitive distr…

81db70d

…ibution of the model files)

moving to file based validation

8d68528

semantic caching for native vectorizers

18e212a

Merge branch 'main' into feature/vectors

59c40f7

normalizing vectorization pipelines

22e4978

Merge branch 'feature/vectors' of https://github.com/redis/redis-om-d…

2e9264c

…otnet into feature/vectors

fixing csproj file

7b8f9c9

fixing test host for Vectorizer tests

1528684

lfs

e5aab0e

removing System.Drawing dep, adding onnxruntime

59b2b13

removing warnings

843a81d

adding MIT licenses for sources

3fc563f

readme updates

4411d02

Encode -> Vectorize

96eae51

slorello89 linked an issue Dec 5, 2023 that may be closed by this pull request

Enhancement: Add support for "vector similarity" search #342

Closed

slorello89 merged commit cf41ed7 into main Dec 5, 2023
1 check passed

VagyokC4 mentioned this pull request Dec 6, 2023

Can Redis OM support RESTful Webdis interface to avoid persistent connection issues? #352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vector Search and Semantic Caching #417

Vector Search and Semantic Caching #417

slorello89 commented Nov 2, 2023

slorello89 commented Nov 2, 2023

slorello89 commented Nov 2, 2023

bsbodden left a comment

tylerhutcherson left a comment

tylerhutcherson Nov 3, 2023

slorello89 Nov 3, 2023

tylerhutcherson Nov 3, 2023

slorello89 Nov 3, 2023

tylerhutcherson Nov 3, 2023

slorello89 Nov 3, 2023

tylerhutcherson Nov 3, 2023

slorello89 Nov 3, 2023


		A `Vector<T>` is a representation of an object that can be transformed into a vector by a Vectorizer.

		A `VectorizerAttribute` is the abstract class you use to decorate your Vector fields, it is responsible for defining the logic to convert your Vectors into Embeddings. In the package `Redis.OM.Vectorizers` we provide vectorizers for HuggingFace, OpenAI, and AzureOpenAI to allow you to easily integrate them into your workflows.


		With Redis OM, the embeddings can be completely transparent to you, they are generated and bound to the `Vector<T>` when you query/insert your vectors. If however you needed your embedding after the insertion/Query, they are available at `Vector<T>.Embedding`, and be queried either as the raw bytes, as an array of doubles or as an array of floats (depending on your vectorizer).

		#### Configuration

Vector Search and Semantic Caching #417

Vector Search and Semantic Caching #417

Conversation

slorello89 commented Nov 2, 2023

slorello89 commented Nov 2, 2023

slorello89 commented Nov 2, 2023

bsbodden left a comment

Choose a reason for hiding this comment

tylerhutcherson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment