add sense2vec support too and integrate with POS-config #26

davidberenstein1957 · 2022-12-04T06:53:03Z

No description provided.

prakhar251998 · 2022-12-06T05:26:02Z

Hi @davidberenstein1957, saw the capabailities of sense2vec library and how it will work much better than some of the pretrained glove word2vec models.
My question was is there are a way we can add support for more state of the art word vector embeddings like sentence transformers,BERT etc.?

davidberenstein1957 · 2022-12-06T06:10:41Z

@prakhar251998 thanks for the suggestion, but sadly this wouldn´t be possible. The concise-concepts library works based on a find_most_similar search within pre-defined embeddings based on tokens present in the embedding model. For word2vec-like models, these tokens are pre-defined/indexed and have a stand-alone semantical meaning like apple being used in a similar context as pear. For transformer-based models, the index is mostly limited to a sub-word/character level and therefore doesn´t allow for a find_most_similar operation.

I you would like to use these kinds of embeddings, you could potentially create a semantic-search knowledge base with KNN/ANN and embeddings based on the descriptions of the potential entities, but maybe this costs too much effort.

davidberenstein1957 added the enhancement New feature or request label Dec 6, 2022

davidberenstein1957 added a commit that referenced this issue Jan 12, 2023

#26 added sense2vec support

84398ce

davidberenstein1957 closed this as completed Jan 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add sense2vec support too and integrate with POS-config #26

add sense2vec support too and integrate with POS-config #26

davidberenstein1957 commented Dec 4, 2022

prakhar251998 commented Dec 6, 2022

davidberenstein1957 commented Dec 6, 2022

add sense2vec support too and integrate with POS-config #26

add sense2vec support too and integrate with POS-config #26

Comments

davidberenstein1957 commented Dec 4, 2022

prakhar251998 commented Dec 6, 2022

davidberenstein1957 commented Dec 6, 2022