linguistic-corpora

Here are 38 public repositories matching this topic...

wzkariampuzha / EvolutionaryLinguistics

Investigations into Evolutionary Linguistics using the Google Ngrams corpus. Sub-projects include Birth and Death of English Lexemes in Closed Lexical Classes | Lexicon Size

evolution linguistics linguistic-corpora google-ngram

Updated Sep 14, 2023
Jupyter Notebook

AxelleDomingues / Memoire-2

Star

python language bigdata linguistics typology proportion normalization linguistic-corpora linguistics-databases italian-language french-language googlecolaboratory student-managed

Updated Jun 23, 2023
Jupyter Notebook

ipante / visualisation-diatopique

Star

Atelier de visualisation cartographique dans le cadre de la Summer School "Phonologie de corpus", UNIL (22-26.07.2019)

visualization d3 data sli linguistic-corpora unil

Updated Jul 25, 2019
JavaScript

saevers / WG-Texts

Star

A script for processing linguistic data with interlinear glosses from a PDF

linguistics linguistic-features linguistic-corpora linguistic-analysis

Updated Apr 24, 2020
R

rahonalab / cl-cagliari2017

Star

Pdf, tex and data of corpus linguistics lessons delivered in Cagliari, December 2017

latex-document linguistics morphological-analysis linguistic-corpora beamer-presentation

Updated Dec 5, 2017
TeX

lisni946 / Word2Vec_koine_greek

Star

Word2Vec Model for Koine Greek Categorisation

ancient-greek word2vec-model linguistic-corpora

Updated Apr 14, 2021
Jupyter Notebook

miweru / vrt_generator

Star

Python class for creating vrt-annotated corpora

wrapper linguistics corpora vrt linguistic-corpora

Updated Dec 19, 2019
Python

miweru / vrt_spacy

Star

nlp wrapper linguistics spacy corpora vrt linguistic-corpora

Updated Dec 19, 2019
Python

sorinmarti / textanalyzer

Star

Java Software to analyze text files.

linguistic-corpora linguistics-field dialects linguistic-analysis

Updated Feb 12, 2018
Java

avery-radmacher / Wemyss

Star

A web scraper for the student newspaper of Covenant College.

newspaper webscraping linguistic-corpora

Updated Oct 15, 2020
Ruby

habecker / Orthographie-Archiv

Star

Custom search-engine for a small corpora

flask vuejs duden linguistic-corpora

Updated Jan 5, 2021
Vue

npedrazzini / averageReducedFrequency

Star

R script to calculate the Average Reduced Frequency (ARF) of all words in a corpus

r frequency-analysis linguistic-corpora keyword-analysis

Updated May 14, 2020
R

emeinhardt / switchboard-lm

Star

Notebooks for processing various versions of the Switchboard corpus.

linguistics linguistic-corpora

Updated Nov 22, 2019
Jupyter Notebook

clemsciences / old_swedish_texts

Star

bible-translations linguistic-corpora philology old-swedish

Updated Jul 8, 2018
HTML

emeinhardt / fisher-lm

Star

Notebook converts the Fisher Corpus to a relational format and processes it for a language model.

linguistics linguistic-corpora

Updated Sep 5, 2019
Jupyter Notebook

emeinhardt / fisher-lm-srilm

Star

A repository describing the construction of a unigram language model from the Fisher corpus

linguistics linguistic-corpora

Updated Oct 14, 2019
Jupyter Notebook

Frobeniusnorm / AcademicTextEstimator

Star

linguistics linguistic-corpora pragmatics linguistic-analysis genre-classification

Updated Jun 28, 2022
Scala

levindoneto / Programmierkurs

Star

Aufgaben zum Programmierkurs - Universität Stuttgart - Wintersemester

python programming linguistic-corpora assignments

Updated Jan 24, 2018
Python

LAAC-LSCP / datasets

Star

DataLad superdataset including all the datasets currently managed by the LAAC/LSCP team

linguistic-corpora linguistics-databases linguistic-dataset

Updated Feb 18, 2021
Python

thjbdvlt / corpus-recemment

Star

corpus of unannotated, tokenized and lemmatized french sentences (22M).

french linguistic-corpora french-nlp

Updated Sep 18, 2024

Improve this page

Add a description, image, and links to the linguistic-corpora topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the linguistic-corpora topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

linguistic-corpora

Here are 38 public repositories matching this topic...

wzkariampuzha / EvolutionaryLinguistics

AxelleDomingues / Memoire-2

ipante / visualisation-diatopique

saevers / WG-Texts

rahonalab / cl-cagliari2017

lisni946 / Word2Vec_koine_greek

miweru / vrt_generator

miweru / vrt_spacy

sorinmarti / textanalyzer

avery-radmacher / Wemyss

habecker / Orthographie-Archiv

npedrazzini / averageReducedFrequency

emeinhardt / switchboard-lm

clemsciences / old_swedish_texts

emeinhardt / fisher-lm

emeinhardt / fisher-lm-srilm

Frobeniusnorm / AcademicTextEstimator

levindoneto / Programmierkurs

LAAC-LSCP / datasets

thjbdvlt / corpus-recemment

Improve this page

Add this topic to your repo