Skip to content
View davidgibsonp's full-sized avatar
🤓
🤓

Block or report davidgibsonp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

NLP

21 repositories

ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the u…

Jupyter Notebook 563 133 Updated Jan 1, 2025

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,337 896 Updated Jan 17, 2025

Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning

Python 307 30 Updated Oct 18, 2024

A comprehensive reference for all topics related to Natural Language Processing

Python 2,019 283 Updated Oct 6, 2024

Augmenty is an augmentation library based on spaCy for augmenting texts.

Python 151 11 Updated May 24, 2024

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

Python 1,212 147 Updated Jan 16, 2024

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 6,323 777 Updated Jan 17, 2025

SpikeX - SpaCy Pipes for Knowledge Extraction

Python 397 28 Updated Jul 30, 2021

Python implementation of TextRank algorithms ("textgraphs") for phrase extraction

Python 2,162 333 Updated Jul 16, 2024

State-of-the-Art Text Embeddings

Python 15,779 2,527 Updated Jan 17, 2025

GSDMM: Short text clustering

Python 355 94 Updated Dec 28, 2022

Top2Vec learns jointly embedded topic, document and word vectors.

Python 2,973 374 Updated Nov 14, 2024

DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks

Python 1,264 135 Updated Mar 2, 2023

🏡 Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

Python 1,749 248 Updated Dec 20, 2023

An open-source NLP research library, built on PyTorch.

Python 11,782 2,250 Updated Nov 22, 2022

Natural Language Processing Best Practices & Examples

Python 6,387 917 Updated Aug 30, 2022

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,715 27,590 Updated Jan 17, 2025

A Visual Analysis Tool to Explore Learned Representations in Transformers Models

Python 587 51 Updated Feb 7, 2024

Minimal keyword extraction with BERT

Python 3,660 357 Updated Jul 16, 2024
Jupyter Notebook 19 5 Updated Sep 30, 2024

NLP, before and after spaCy

Python 2,214 249 Updated Sep 22, 2023