Skip to content
#

text-data

Here are 20 public repositories matching this topic...

This repository implements a pipeline to store various data of files from a large unstructured dataset. These fields are used for topic modeling (wordclouds, based on low-dimensional versions of embedding vectors, Named Entity Clustering and document-topic incidences). The information is aggregated and visualised using FCA.

  • Updated Jul 28, 2025
  • Python

Improve this page

Add a description, image, and links to the text-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-data topic, visit your repo's landing page and select "manage topics."

Learn more