community: add support for using GPUs with FastEmbedEmbeddings #29627

vemonet · 2025-02-06T11:30:53Z

Description: add a gpu: bool = False field to the FastEmbedEmbeddings class which enables to use GPU (through ONNX CUDA provider) when generating embeddings with any fastembed model. It just requires the user to install a different dependency and we use a different provider when instantiating fastembed.TextEmbedding
Issue: when generating embeddings for a really large amount of documents this drastically increase performance (honestly that is a must have in some situations, you can't just use CPU it is way too slow)
Dependencies: no direct change to dependencies, but the users will need to install fastembed-gpu instead of fastembed, I made all the changes to the init function and docstrings to properly let the user know which dependency they should install depending on if they enabled gpu or not

cf. fastembed docs about GPU for more details: https://qdrant.github.io/fastembed/examples/FastEmbed_GPU/

I did not added test because it would require access to a GPU in the testing environment

… greatly increase speed and scalability when generating embeddings for a large amount of documents with a GPU

vercel · 2025-02-06T11:30:58Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Feb 6, 2025 11:35am

ccurme · 2025-02-06T13:04:11Z

libs/community/langchain_community/embeddings/fastembed.py

            )

-        if importlib.metadata.version("fastembed") < MIN_VERSION:
+        if importlib.metadata.version(pkg_to_import) < MIN_VERSION:


Assuming fastembed and fastembed-gpu versions are kept in sync? (They appear on the same minor version now.)

@ccurme it seems like it is perfectly synced:

https://pypi.org/project/fastembed/#history

https://pypi.org/project/fastembed-gpu/#history

@ccurme

…bedEmbeddings (#29631) Made a mistake in the module to import (the module stay the same only the installed package changes), fixed it and tested it #29627 @ccurme

community: add support for using GPUs with FastEmbedEmbeddings. Which…

abed794

… greatly increase speed and scalability when generating embeddings for a large amount of documents with a GPU

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Feb 6, 2025

dosubot bot added community Related to langchain-community Ɑ: embeddings Related to text embedding models module labels Feb 6, 2025

community: fix formatting for FastEmbedEmbeddings

02a68b5

ccurme approved these changes Feb 6, 2025

View reviewed changes

dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Feb 6, 2025

ccurme merged commit 0ac5536 into langchain-ai:master Feb 6, 2025
19 checks passed

vemonet mentioned this pull request Feb 6, 2025

community: fix typo in the module imported when using GPU with FastEmbedEmbeddings #29631

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

community: add support for using GPUs with FastEmbedEmbeddings #29627

community: add support for using GPUs with FastEmbedEmbeddings #29627

vemonet commented Feb 6, 2025 •

edited

Loading

vercel bot commented Feb 6, 2025 •

edited

Loading

ccurme Feb 6, 2025

vemonet Feb 6, 2025

community: add support for using GPUs with FastEmbedEmbeddings #29627

community: add support for using GPUs with FastEmbedEmbeddings #29627

Conversation

vemonet commented Feb 6, 2025 • edited Loading

vercel bot commented Feb 6, 2025 • edited Loading

ccurme Feb 6, 2025

Choose a reason for hiding this comment

vemonet Feb 6, 2025

Choose a reason for hiding this comment

vemonet commented Feb 6, 2025 •

edited

Loading

vercel bot commented Feb 6, 2025 •

edited

Loading