Extract ko-wiki history data from XML file. Only support Korean Language.
-
Should setup
WikiExtractor
-
run
wiki extractor
python -m wikiextractir.WikiExtractor 'your wiki dump file' -o 'output path'
- run
processing.py
python processing.py --path 'output path' --write-path 'result output path'