Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug in bm25 implementation. #1718

Closed
se4u opened this issue Nov 15, 2017 · 3 comments
Closed

Bug in bm25 implementation. #1718

se4u opened this issue Nov 15, 2017 · 3 comments
Labels
bug Issue described a bug difficulty easy Easy issue: required small fix good first issue Issue for new contributors (not required gensim understanding + very simple)

Comments

@se4u
Copy link

se4u commented Nov 15, 2017

In line 52 of bm25.py the variable self.corpus_size should be replaced by len(document).

https://github.com/RaRe-Technologies/gensim/blob/82c394a9085d583e8a75c2bb32ecd37cf61236f0/gensim/summarization/bm25.py#L52

@menshikh-iv
Copy link
Contributor

Nice catch @se4u, thanks for the report!

@menshikh-iv menshikh-iv added bug Issue described a bug difficulty easy Easy issue: required small fix good first issue Issue for new contributors (not required gensim understanding + very simple) labels Nov 16, 2017
@souravsingh
Copy link
Contributor

@se4u Can you tell us how the bug affects the execution of the summarization module? A small code sample reproducing the bug would be nice.

@se4u
Copy link
Author

se4u commented Nov 18, 2017 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Issue described a bug difficulty easy Easy issue: required small fix good first issue Issue for new contributors (not required gensim understanding + very simple)
Projects
None yet
Development

No branches or pull requests

3 participants