Installs gensim from fork #15

sidravi1 · 2022-10-20T13:51:24Z

Overview

Use our fork of gensim that uses POT instead of pyemd.

Reviewer: @lickem22
Estimate: ~30 mins

How tested

Unit tests pass!

Next steps

Once this PR is merged in, we may want to abandon our fork.

sidravi1 · 2022-10-20T14:23:13Z

faqt/model/faq_matching/keyed_vectors_scoring.py

-            corpus=preprocessed_content_tokens, kv_model=self.word_embedding_model
+            corpus=preprocessed_content_tokens,
+            kv_model=self.word_embedding_model,
+            chunksize=np.ceil(len(contents) / os.cpu_count()).astype(int),


Chunking based on number of cpus

sidravi1 added 7 commits October 18, 2022 15:36

using IDi gensim fork

38a2dd3

typo

9f2e66c

fixed link to release

72f4bd9

downgraded cython

6df0a94

fixed gensim ref

149747b

removed space

f21c59a

removed cython

bca8555

sidravi1 commented Oct 20, 2022

View reviewed changes

sidravi1 requested review from suzinyou and lickem22 October 20, 2022 14:23

lickem22 approved these changes Oct 26, 2022

View reviewed changes

sidravi1 merged commit baad018 into main Oct 27, 2022

sidravi1 deleted the gensim_fork branch October 27, 2022 14:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Installs gensim from fork #15

Installs gensim from fork #15

sidravi1 commented Oct 20, 2022 •

edited

Loading

sidravi1 Oct 20, 2022

Installs gensim from fork #15

Installs gensim from fork #15

Conversation

sidravi1 commented Oct 20, 2022 • edited Loading

Overview

How tested

Next steps

sidravi1 Oct 20, 2022

Choose a reason for hiding this comment

sidravi1 commented Oct 20, 2022 •

edited

Loading