Simple tool to download the OSCAR corpus.
Installation can be done using pypi:
$ pip install oscar-corpus-downloader
Submit an OSCAR access request following the procedure described on the project page.
Once you have received your credentials, you can use the command line interface to download an OSCAR corpus part.
$ export OSCAR_USERNAME=username
$ export OSCAR_PASSWORD=password
$ oscar download --help
Usage: oscar download [OPTIONS]
Options:
-u, --url TEXT OSCAR corpus url [required]
-o, --output-dir TEXT Output directory [required]
--resume Resume download
--help Show this message and exit.
$ oscar download \
--url https://oscar-prive.huma-num.fr/2301/fr_meta \
-o ./oscar-fr