Add docstrings for Wordrank #1378

parulsethi · 2017-06-01T08:23:59Z

Added description for wordrank training
Corrected output_dir param (previous one would give error at .load_wordrank_model() which loads files from output_dir)
fixed example doc in wordrank file (fix "TypeError: __init__() takes exactly 1 argument (4 given)" error when calling Wordrank #1384)

tmylk · 2017-06-01T10:44:15Z

Just the style failures W291 trailing whitespace

menshikh-iv · 2017-06-03T15:59:32Z

gensim/models/wrappers/wordrank.py

@@ -47,8 +47,12 @@ class Wordrank(KeyedVectors):
    @classmethod
    def train(cls, wr_path, corpus_file, out_name, size=100, window=15, symmetric=1, min_count=5, max_vocab_size=0,
              sgd_num=100, lrate=0.001, period=10, iter=90, epsilon=0.75, dump_period=10, reg=0, alpha=100,
-              beta=99, loss='hinge', memory=4.0, cleanup_files=True, sorted_vocab=1, ensemble=0):
+              beta=99, loss='hinge', memory=4.0, cleanup_files=False, sorted_vocab=1, ensemble=0):


What is the reason for change cleanup_file to False?

cleanup_files=False will not delete the (word/context) embedding files and vocab file generated by wordrank during training, which are saved inside wordrank's directory . Though the train() method loads the final required embedding file before deleting everything that was generated during training but it could be confusing to users who expect to find it after the training is finished.
So, making the default behavior to not delete them could be better to avoid confusion.

Please enumerate output files (filename and what the file contains) in docstring.

I've added the output filenames and content info. in out_name param description because it is the directory which contain these files.

menshikh-iv · 2017-06-06T03:15:19Z

Thank you @parulsethi 👍

added docstring for train method

b0b66d2

parulsethi added 4 commits June 1, 2017 16:28

fix flake8 error

bffdee6

fix flake8 error

251b6e7

fix flake8 error

fa424e5

fix doc

e28e6bb

parulsethi mentioned this pull request Jun 2, 2017

"TypeError: __init__() takes exactly 1 argument (4 given)" error when calling Wordrank #1384

Closed

menshikh-iv reviewed Jun 3, 2017

View reviewed changes

added info for content generated by wordrank

0d3495c

menshikh-iv merged commit 0e6f1b2 into piskvorky:develop Jun 6, 2017

parulsethi deleted the wordrank_docs branch June 12, 2017 14:51

parulsethi mentioned this pull request Jun 12, 2017

Fix wordrank tests #1410

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add docstrings for Wordrank #1378

Add docstrings for Wordrank #1378

parulsethi commented Jun 1, 2017 •

edited

Loading

tmylk commented Jun 1, 2017

menshikh-iv Jun 3, 2017 •

edited

Loading

parulsethi Jun 3, 2017

menshikh-iv Jun 4, 2017

parulsethi Jun 5, 2017

menshikh-iv commented Jun 6, 2017

Add docstrings for Wordrank #1378

Add docstrings for Wordrank #1378

Conversation

parulsethi commented Jun 1, 2017 • edited Loading

tmylk commented Jun 1, 2017

menshikh-iv Jun 3, 2017 • edited Loading

Choose a reason for hiding this comment

parulsethi Jun 3, 2017

Choose a reason for hiding this comment

menshikh-iv Jun 4, 2017

Choose a reason for hiding this comment

parulsethi Jun 5, 2017

Choose a reason for hiding this comment

menshikh-iv commented Jun 6, 2017

parulsethi commented Jun 1, 2017 •

edited

Loading

menshikh-iv Jun 3, 2017 •

edited

Loading