You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Added models which cover several different languages: one for combined Germanic and Romance languages, one for the Slavic languages available in UDCoref #1406
XCL (Classical Armenian) models with word vectors from Caval
bugfixes
update tqdm usage to remove some duplicate code: #14133de69ca
long list of incorrectly tokenized Spanish words added directly to the combined Spanish training data to improve their tokenization: #1410
Occasionally train the tokenizer with the sentence final punctuation of a batch removed. This helps the tokenizer avoid learning to tokenize the last character regardless of whether or not it is punctuation. This was also related to the Spanish tokenization issue 56350a0
actually include the visualization: #1421 thank you @bollwyvl