ngram
Here are 146 public repositories matching this topic...
Touch typing trainer using N-grams as data source, with options to customize the auto-generated lessons and specify the minimum typing performance needed. There are sound/color effects as well.
-
Updated
Aug 12, 2024 - JavaScript
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate a…
-
Updated
Dec 17, 2024 - C++
Cluster and merge similar string values: an R implementation of Open Refine clustering algorithms
-
Updated
Mar 14, 2024 - C++
Python implementation of an N-gram language model with Laplace smoothing and sentence generation.
-
Updated
Feb 9, 2018 - Python
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
-
Updated
Apr 25, 2022 - Scala
Top-k Approximate String Matching.
-
Updated
Dec 16, 2021 - Go
大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning
-
Updated
Jul 25, 2024 - Java
利用传统方法(N-gram,HMM等)、神经网络方法(CNN,LSTM等)和预训练方法(Bert等)的中文分词任务实现【The word segmentation task is realized by using traditional methods (n-gram, HMM, etc.), neural network methods (CNN, LSTM, etc.) and pre training methods (Bert, etc.)】
-
Updated
Jun 15, 2022 - Python
Create n-grams of wordlists based on words, characters, or charsets to use in offline password attacks and data analysis
-
Updated
Jun 27, 2024 - Python
multiprocess unsupervised chinese_detect_words ngram_combination
-
Updated
Jan 2, 2019 - Python
Ngrams with Basic Smoothings
-
Updated
May 15, 2024 - Python
fast and comprehensive k-mer counting package
-
Updated
Sep 27, 2021 - C++
Improve this page
Add a description, image, and links to the ngram topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ngram topic, visit your repo's landing page and select "manage topics."