Add top_k parameter to ChatMessageRetriever #8258

Emil-io · 2024-08-20T14:10:32Z

Is your feature request related to a problem? Please describe.
The ChatMessageRetriever for RAG + Chat always provides the whole conversation history. That way, at some point the context window is exceeded.

Describe the solution you'd like
Something similar to a top_k parameter in the init of the component. That way, only a specified number of the latest chat messages is retrieved.
This could the also potentially be provided to a summary prompt node, that summarizes multiple ChatMessages.

Describe alternatives you've considered
Custom logic, where the last chat messages are stored separately and are concatenated to the query.

MetroCat69 · 2024-08-22T07:03:06Z

can I get this issue?

julian-risch · 2024-08-23T09:34:35Z

Thanks for the initiative @MetroCat69 Sure, please feel free to create a draft pull request early on so that we can give feedback.

vblagoje · 2024-09-02T13:20:25Z

What's the proper name here top_k, last_k or something else cc @TuanaCelik @Emil-io @julian-risch ?

Emil-io · 2024-09-02T16:39:01Z

What's the proper name here top_k, last_k or something else cc @TuanaCelik @Emil-io @julian-risch ?

Maybe last_k, so that people do not think it also filters down to the best matching text messages? Also it better describes what this parameter is doing, but this is just my opinion.

vblagoje · 2024-09-02T17:56:47Z

Yeah, good points @Emil-io thank you - I'll update the PR to use last_k as it better clarifies what's going on.

vblagoje · 2024-09-02T17:57:55Z

Hmm, but we also then deviate from retriever + top_k usage. Hmm, really not sure how to name this one. @TuanaCelik @julian-risch ?

TuanaCelik · 2024-09-04T13:05:05Z

I think we are making a mistake by calling this a retriever and this conversation made me come to this conclusion.

A retriever has too much connotation for something that will be retrieving based on similarity, embedding, etc. In which case top_k makes sense.
However, my understanding from the ChatMessageRetriever is that it simply fetches all of the messages in the store right? Would we have any use case where this retriever would be used to retrieve 'relevant messages' based on a similarity metirc? In which case, I think having both top_k and last_k may make sense but I'm a bit worried this mixes up concepts too much.

TLDR: I don't think we should use top_k to do something that is not actual 'top k'...

@anakin87 - What's your take on this?

anakin87 · 2024-09-04T13:17:41Z

I'm just commenting on the best name to give to this parameter (not the component name).

I think something like last_k, last_k_messages, or last_n_messages might work...

vblagoje · 2024-09-04T14:12:12Z

last_k it is, we can add top_k later when these become searching retrievers.

julian-risch added this to Haystack - Contributions wanted Aug 23, 2024

julian-risch moved this to In Progress in Haystack - Contributions wanted Aug 23, 2024

julian-risch added the P2 Medium priority, add to the next sprint if no P1 available label Aug 23, 2024

julian-risch changed the title ~~ChatMessageRetriever top_k~~ Add top_k parameter to ChatMessageRetriever Aug 23, 2024

julian-risch assigned vblagoje Sep 2, 2024

vblagoje mentioned this issue Sep 2, 2024

feat: Adds last_k parameter to ChatMessageRetriever init/run methods deepset-ai/haystack-experimental#68

Merged

anakin87 mentioned this issue Sep 4, 2024

ChatMessageRetriever flexibility issues deepset-ai/haystack-experimental#72

Closed

julian-risch removed the status in Haystack - Contributions wanted Sep 9, 2024

julian-risch removed this from Haystack - Contributions wanted Sep 9, 2024

vblagoje closed this as completed in deepset-ai/haystack-experimental#68 Sep 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add top_k parameter to ChatMessageRetriever #8258

Add top_k parameter to ChatMessageRetriever #8258

Emil-io commented Aug 20, 2024

MetroCat69 commented Aug 22, 2024

julian-risch commented Aug 23, 2024

vblagoje commented Sep 2, 2024

Emil-io commented Sep 2, 2024

vblagoje commented Sep 2, 2024

vblagoje commented Sep 2, 2024

TuanaCelik commented Sep 4, 2024

anakin87 commented Sep 4, 2024

vblagoje commented Sep 4, 2024 •

edited

Loading

Add top_k parameter to ChatMessageRetriever #8258

Add top_k parameter to ChatMessageRetriever #8258

Comments

Emil-io commented Aug 20, 2024

MetroCat69 commented Aug 22, 2024

julian-risch commented Aug 23, 2024

vblagoje commented Sep 2, 2024

Emil-io commented Sep 2, 2024

vblagoje commented Sep 2, 2024

vblagoje commented Sep 2, 2024

TuanaCelik commented Sep 4, 2024

anakin87 commented Sep 4, 2024

vblagoje commented Sep 4, 2024 • edited Loading

vblagoje commented Sep 4, 2024 •

edited

Loading