CBOW model equivalent to the supervised learning model of fastText #960

giacbrd · 2016-10-19T08:57:20Z

fastText is an "evolution" of word2vec, it contains new models for word embeddings and models for learning the association document -> label, i.e. classification.

Implementing the latter can be obtained by reusing the Word2vec class (only for CBOW), defining an input layer of words and an output layer of labels with their specific vocabulary. The concept of windows in training can be dropped. It is possible to implement, for output computation, the softmax function, together with its already present "approximations" (negative sampling and huffman tree).

A LabeledWord2Vec class is already implemented https://github.com/giacbrd/ShallowLearn/blob/master/shallowlearn/word2vec.py , it should be ported and improved (it misses negative sampling, some methods implementation)

tmylk · 2016-10-19T09:25:55Z

For completeness linking to Fasttext Wrapper #847

giacbrd · 2016-11-14T10:58:19Z

I am still developing the code for this PR in https://github.com/giacbrd/ShallowLearn, I hope to start working on the fork ASAP

piskvorky · 2016-12-12T20:29:40Z

@giacbrd any progress on the PR? Cheers.

giacbrd · 2016-12-13T11:08:00Z

I am going to release a more stable model on my project, before Christmas, then I can port it in Gensim, it should be "easy"!
Cheers

giacbrd · 2017-01-25T08:52:38Z

I am finalizing the pull request. I am just thinking to design a better interface, but in general there is not much code.
Sorry but my spare time in the last two months has been minimal.
Cheers

tmylk · 2017-01-25T09:09:09Z

From Gensim integration point of view, an API extending the existing FastText Wrapper API would be preferable. Though FastText wrapper API is not yet released so can be changed

giacbrd · 2017-01-25T09:42:50Z

Here I am doing something slightly different than re-implementing fastText. I have practically written a variant of the Word2Vec model, with the goal of learning the combinations sets_of_words->labels (i.e. text classification), where:

Output layer and its vocabulary is independent of the input layer
It is limited to CBOW
Given that the output layer is usually of a pre-defined size (e.g. labels in a text classification scenario), it is feasible to directly compute the softmax instead of its "approximations" (negative sampling and huffman tree). Then there are 3 loss methods

The wrapper, for now, is only for word embedding applications, but yes, it could be extended with the "supervised learning" component of fastText

giacbrd · 2017-02-17T22:48:06Z

See the pull request #1153

giacbrd mentioned this issue Feb 17, 2017

[WIP] Labeled w2v #1153

Closed

menshikh-iv added feature Issue described a new feature difficulty medium Medium issue: required good gensim understanding & python skills labels Oct 2, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CBOW model equivalent to the supervised learning model of fastText #960

CBOW model equivalent to the supervised learning model of fastText #960

giacbrd commented Oct 19, 2016

tmylk commented Oct 19, 2016

giacbrd commented Nov 14, 2016

piskvorky commented Dec 12, 2016

giacbrd commented Dec 13, 2016

giacbrd commented Jan 25, 2017

tmylk commented Jan 25, 2017

giacbrd commented Jan 25, 2017 •

edited

Loading

giacbrd commented Feb 17, 2017

CBOW model equivalent to the supervised learning model of fastText #960

CBOW model equivalent to the supervised learning model of fastText #960

Comments

giacbrd commented Oct 19, 2016

tmylk commented Oct 19, 2016

giacbrd commented Nov 14, 2016

piskvorky commented Dec 12, 2016

giacbrd commented Dec 13, 2016

giacbrd commented Jan 25, 2017

tmylk commented Jan 25, 2017

giacbrd commented Jan 25, 2017 • edited Loading

giacbrd commented Feb 17, 2017

giacbrd commented Jan 25, 2017 •

edited

Loading