-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Word2Vec.build_vocab(…, update=True) gives "ValueError: all the input array dimensions except for the concatenation axis must match exactly" #1162
Comments
Are you by chance passing the same There is somewhat of an expectation that the corpus for an update would contain new words, or else you wouldn't need to be calling Separately, those cases where there are no words – perhaps because the new material is very small or very redundant with previous training – are also cases where subsequent training may not be a net benefit to the model. (This incremental-vocabulary-expansion option is best considered an experimental feature to should be evaluated carefully after each use – not a true 'online' training option where incremental new examples always or even usually lead to net improvements.) So I'd be wary of any process where an update-vocab is being casually called with small/redundant new batches. |
Thank you for your response, I did not realize I was using this improperly. |
It is not an issue but a better error message is needed. |
The
build_vocab
function does not work with the ability to update online.i.e.:
works fine, but
does not.
It provides the following error:
The text was updated successfully, but these errors were encountered: