-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can I fed 500K documents in rank_bm25? #27
Comments
@Witiko can you please provide any insight? |
@ramsey-coding I don't see a reason why it shouldn't. Have you tried? |
@Witiko the problem is call to the It is taking ~5 second per call. |
@dorianbrown the library is slow to retrieval from ~350K samples. Can you please guide what to do here? |
Hi @ramsey-coding, I have just released a new Python-based search engine called |
Better use elastichsearch.Python version can be slow makes you crazy |
You should try my library |
This library started as a side project, and gained a fair amount of traction organically. It was designed as a fairly simple implementation of these retrieval algorithms, but won't compare to something like the mentioned I've now also added a remark in the readme to direct users to |
Thanks for this awesome library.
I am curious to know whether rank_bm25 can handle 500K documents. Each document has around 1000 words.
Looking forward to your feedback. I want to use the following functionality with rank_bm25:
The text was updated successfully, but these errors were encountered: