[WIP] Changes in sklearn wrappers for LDA and LSI models #1398

chinmayapancholi13 · 2017-06-07T02:51:40Z

This PR makes the following changes in the sklearn wrappers for LDA and LSI models :

Doesn't train model from within the __init__ function of the wrappers (following the convention followed by the fuctions in scikit-learn)
Uses super while calling the parent class' __init__ function from fit function

menshikh-iv · 2017-06-08T06:54:17Z

gensim/sklearn_integration/sklearn_wrapper_gensim_ldamodel.py

@@ -46,15 +46,6 @@ def __init__(
        self.gamma_threshold = gamma_threshold
        self.minimum_probability = minimum_probability
        self.random_state = random_state


Please remove corpus parameter from constructor (you pass a corpus only for fit* methods) in both models.

menshikh-iv · 2017-06-14T07:21:21Z

The same as #1405

menshikh-iv · 2017-06-19T08:47:52Z

gensim/test/test_sklearn_integration.py

            transformed_approx = matutils.sparse2full(transformed, 2)  # better approximation
-        expected = [0.13, 0.87]
+        expected = [0.87, 0.0]
        passed = numpy.allclose(sorted(transformed_approx), sorted(expected), atol=1e-1)
        self.assertTrue(passed)


It's often broken in Travis, please check, what is an reason of this

I have made changes to solve this problem and this should now be resolved. :)

menshikh-iv · 2017-06-19T08:48:20Z

gensim/test/test_sklearn_integration.py

@@ -191,6 +181,22 @@ def testSetGetParams(self):
        for key in param_dict.keys():
            self.assertEqual(model_params[key], param_dict[key])

+    def testPersistence(self):


Same as LdaSeq

Thanks. Done.

menshikh-iv · 2017-06-19T08:48:35Z

gensim/test/test_sklearn_integration.py

-        score = text_lda.score(corpus, data.target)
+        text_lsi = Pipeline((('features', model,), ('classifier', clf)))
+        text_lsi.fit(corpus, data.target)
+        score = text_lsi.score(corpus, data.target)
        self.assertGreater(score, 0.50)


same as LdaSeq

Thanks. Done.

chinmayapancholi13 added 3 commits June 6, 2017 19:41

updated LSI wrapper

173fe1f

updated LDA wrapper

d86d52d

corection in 'init' call in 'fit'

1344aa1

menshikh-iv reviewed Jun 8, 2017

View reviewed changes

chinmayapancholi13 added 3 commits June 8, 2017 05:54

removed 'corpus' param from 'init' in LDA model

c8f8fd4

removed 'corpus' param from 'init' in LSI model

ff6aab1

changed docstring for 'fit' method

61eb5cb

chinmayapancholi13 added 3 commits June 14, 2017 16:41

refactored code for LDA and LSI wrappers

3c0174c

replaced 'self.model' by 'self.gensim_model'

9e4d29b

updated 'testCSRMatrixConversion' test

b13669a

chinmayapancholi13 changed the title ~~Changes in sklearn wrappers for LDA and LSI models~~ [WIP] Changes in sklearn wrappers for LDA and LSI models Jun 15, 2017

chinmayapancholi13 added 7 commits June 15, 2017 00:20

updated 'self.__model' to 'self.gensim_model' for LSI wrapper

7d557bf

fixed 'testTransform' test for LDA and LSI

018acc0

updated 'transform' and 'partial_fit' functions

1583645

added 'testPersistence' and 'testModelNotFitted' tests

9ed7ac9

added newline at end of files

e303a38

added example for 'docs' for 'transform' function in docstring

7b05a61

replaced 'text_lda' variable with 'text_lsi'

04714b6

menshikh-iv suggested changes Jun 19, 2017

View reviewed changes

chinmayapancholi13 and others added 5 commits June 19, 2017 03:27

updated 'testPersistence' test for LDA and LSI models

635d9a4

updated 'testPartialFit' tests

9c8ac42

set fixed seed for LDA and LSI model tests

dd31014

updated 'testPartialFit' test for LDA and LSI models

520bd75

Merge branch 'develop' into lda_lsi_wrapper_changes

34a6d14

menshikh-iv merged commit b989be7 into piskvorky:develop Jun 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Changes in sklearn wrappers for LDA and LSI models #1398

[WIP] Changes in sklearn wrappers for LDA and LSI models #1398

chinmayapancholi13 commented Jun 7, 2017

menshikh-iv Jun 8, 2017

menshikh-iv commented Jun 14, 2017

menshikh-iv Jun 19, 2017

chinmayapancholi13 Jun 19, 2017

menshikh-iv Jun 19, 2017

chinmayapancholi13 Jun 19, 2017

menshikh-iv Jun 19, 2017

chinmayapancholi13 Jun 19, 2017

[WIP] Changes in sklearn wrappers for LDA and LSI models #1398

[WIP] Changes in sklearn wrappers for LDA and LSI models #1398

Conversation

chinmayapancholi13 commented Jun 7, 2017

menshikh-iv Jun 8, 2017

Choose a reason for hiding this comment

menshikh-iv commented Jun 14, 2017

menshikh-iv Jun 19, 2017

Choose a reason for hiding this comment

chinmayapancholi13 Jun 19, 2017

Choose a reason for hiding this comment

menshikh-iv Jun 19, 2017

Choose a reason for hiding this comment

chinmayapancholi13 Jun 19, 2017

Choose a reason for hiding this comment

menshikh-iv Jun 19, 2017

Choose a reason for hiding this comment

chinmayapancholi13 Jun 19, 2017

Choose a reason for hiding this comment