Skip to content

Toolkit for processing data in the danish foundation models project.

License

Notifications You must be signed in to change notification settings

danish-foundation-models/dfm-processing

Repository files navigation

DFM Processing

This repo contains code for processing data within the Danish Foundation Models project.

CLI

The CLI is divided into separate sections:

  1. Document Processing (document)
    • process-directory
    • process-web-crawl
  2. Data Cleaning (cleaning)
    • TBD

About

Toolkit for processing data in the danish foundation models project.

Resources

License

Stars

Watchers

Forks

Languages