Skip to content

Commit

Permalink
Added da_lexeme_prob.json
Browse files Browse the repository at this point in the history
  • Loading branch information
KennethEnevoldsen committed Dec 14, 2024
1 parent 1d90ebc commit 504dc3a
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 0 deletions.
1 change: 1 addition & 0 deletions spacy_lookups_data/data/da_lexeme_prob.json

Large diffs are not rendered by default.

6 changes: 6 additions & 0 deletions spacy_lookups_data/data/da_source.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
da_lexeme_prob are derived from
https://huggingface.co/datasets/chcaa/dagw-word-frequencies

using revision e692bc45447f841258e30363a4b371ef1e4908f7.

It is derived using the smoothed log probabilites only contains the first 1 000 000 lexemes.

0 comments on commit 504dc3a

Please sign in to comment.