Skip to content

用于词向量的相似性任务、类比任务的数据。包含了中文新闻语料分类的实验数据。(Data for similarity tasks and analogy tasks of word vectors. The experimental data of Chinese news corpus classification are included.)

Notifications You must be signed in to change notification settings

CallMeJiaGu/WordSimilarityAnalogyData

Repository files navigation

WordSimilarityAnalogyData 用于验证词向量效果好坏的数据集。

词的相似性任务-Word Similarity

常用的英文数据集:WordSim-353 、MEN、SCWS

WordSim-353: http://alfonseca.org/eng/research/wordsim353.html、http://www.cs.technion.ac.il/~gabr/resources/data/wordsim353/

MEN: https://staff.fnwi.uva.nl/e.bruni/MEN

SCWS:http://ai.stanford.edu/~ehhuang/

常用的中文数据集:wordsim-240、wordsim-297

在该仓库能找到(wordsim-240、wordsim-297)

词的类比任务-Word Analogy

常用的中文数据集:Chen 2015年构造的评测文件

在本仓库能找到。(Chen 2015年构造的评测文件)

About

用于词向量的相似性任务、类比任务的数据。包含了中文新闻语料分类的实验数据。(Data for similarity tasks and analogy tasks of word vectors. The experimental data of Chinese news corpus classification are included.)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published