DFM Processing This repo contains code for processing data within the Danish Foundation Models project. CLI The CLI is divided into separate sections: Document Processing (document) process-directory process-web-crawl Data Cleaning (cleaning) TBD