Skip to content

Commit

Permalink
Merge pull request #16 from akb89/develop
Browse files Browse the repository at this point in the history
updated wget urls
  • Loading branch information
akb89 authored Nov 2, 2020
2 parents cb1d4be + 59f379f commit d2393e3
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 4 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -47,18 +47,18 @@ python3 setup.py install
## Pre-requisites
To run Nonce2Vec, you need two gensim Word2Vec models (a skipgram model and a cbow model to compute informativeness-metrics). You can download the skipgram model from:
```bash
wget http://129.194.21.122/~kabbach/gensim.w2v.skipgram.model.7z
wget backup.3azouz.net/gensim.w2v.skipgram.model.7z
```
and the cbow model from:
```sh
wget http://129.194.21.122/~kabbach/gensim.w2v.cbow.model.7z
wget backup.3azouz.net/gensim.w2v.cbow.model.7z
```
or generate both yourself following the instructions below.

### Generating a Word2Vec model from a Wikipedia dump
You can download our English Wikipedia dump of January 2019 here:
```bash
wget http://129.194.21.122/~kabbach/enwiki.20190120.7z
wget backup.3azouz.net/enwiki.20190120.7z
```
If you want to generate a completely new (tokenized-one-sentence-per-line) dump
of Wikipedia, for English or any other language, check out [WiToKit](https://github.com/akb89/witokit).
Expand Down
2 changes: 1 addition & 1 deletion setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@
author_email='akb@3azouz.net',
long_description=long_description,
long_description_content_type='text/markdown',
version='2.0.1',
version='2.0.2',
url='https://github.com/minimalparts/nonce2vec',
download_url='https://github.com/minimalparts/nonce2vec/#files',
license='MIT',
Expand Down

0 comments on commit d2393e3

Please sign in to comment.