Installation

python3 setup.py install

Local installation

For a local installation for the current user, make sure the directory ~/.local/lib/python3.x exists (where x is the minor version number of your Python install) and run:

python3 setup.py install --prefix=~/.local

Commands

The package provides commands for manipulating FLOPO data. The usual pipeline consists of calls like this:

mkdir -p data/work/webanno
flopo-convert \
	-f csv -t webanno-tsv \
	-i data/final/kiky.conll.csv -o data/work/webanno/
	-a NamedEntity:data/final/kiky.ner.csv \
	   Quote:data/final/kiky.quotes.csv \
	   Metaphor:data/final/kiky.metaphors.csv \
	   Hedging:data/final/kiky.hedging.csv
flopo-package \
	-I data/work/webanno/ \
	-t data/raw/webanno-project-template.json \
	-o data/final/kiky.zip \
	-n 'Case KIKY'

The commands are described below. Furthermore, each command has a --help option, which will print detailed usage information.

`flopo-convert`

Convert between different file formats used in FLOPO.

Arguments

-f, --from -- input format (currently conll, csv or webanno-tsv),
-t, --to -- output format (currently webanno-tsv or prolog), use flopo-export to convert WebAnno files back to CSV,
-i, --input-path -- path to the input file or directory,
-o, --output-path -- path to the output file or directory,
-a, --annotations -- a list of annotations to add, each having the format: LAYER:FILE, where LAYER is the name of the layer (for example 'Hedging') and FILE is a CSV file.
-n, --max-docs-pef-file -- split the output file into parts containing max. N documents (CSV output format only)
-r, --recursive -- if reading input from a directory, search also subdirectories
-r, --recursive -- if reading input from a directory, search also subdirectories

The -i and -o arguments can be either a file or directory, depending on the format. The right course of action is determined automatically.

Examples

flopo-convert -f csv -t webanno-tsv -i kiky.conll.csv -o webanno/

Convert a whole corpus from CSV format (CoNLL columns) to WebAnno-TSV files and save them in the folder webanno.

flopo-convert -f webanno-tsv -t prolog -i 99860144 -o 99860144.pl

Convert a single TSV file to Prolog.

`flopo-export`

Export the annotations from WebAnno files as text or CSV.

Arguments

-a, --annotation -- the annotation layer to export.
-i, --input-file -- a single WebAnno TSV file.
-I, --input-dir -- alternatively, you may supply a directory containing WebAnno TSV files.
-o, --output-file -- output file; if none or - given, standard output is used.
-d, --delimiter -- the field delimiter for the output format (default: comma). If you want to do some further processing (e.g. with cut or awk), it is useful to set it to Tab.
--doc-id -- the document ID (optional; default: filename without .tsv suffix)

Examples

Print out the annotations from the layer Metaphor to a CSV file:

flopo-export -a Metaphor -I webanno/ -o metaphors.csv

`flopo-finer`

Tag named entities using FINER.

Arguments

-i, --input-file -- CSV file containing a corpus to annotate,
-o, --output-file -- CSV file to save FINER annotations,
--remote -- use a remote FINER instance; its URL is currently hardcoded to https://finer-flopo.rahtiapp.fi.

Examples

flopo-finer -i kiky.conll.csv -o kiky.ner.csv

Annotate the corpus using local FINER.

flopo-finer --remote -i kiky.conll.csv -o kiky.ner.csv

The same using remote FINER.

`flopo-eval`

Compare annotations to a gold standard.

Arguments

-i, --input-file -- CSV file containing the annotations to evaluate,
-g, --gs-file -- CSV file containing the gold standard annotation,
-c, --corpus-file -- corpus file (CoNLL format),
-r, --results-format -- results format: short - print only evaluation measures, long - print results for each sentence, csv - output a CSV suitable for more detailed evaluation.

Examples

flopo-eval \
	-c kiky.conll.csv  -i kiky.conll.quotes.csv \
	-g quotes.aharju.csv -r csv

`flopo-package`

Package a corpus of WebAnno files as a project (zip file) ready to import to WebAnno.

Arguments

-I, --input-dir -- the directory containing WebAnno-TSV files,
-t, --template-file -- a JSON file containing a template of the project metadata (in a format expected by WebAnno),
-n, --name -- the project name,
-o, --output-file -- the resulting zip file (default: NAME.zip).

Examples

flopo-package \
	-I data/work/webanno/ \
	-t data/raw/webanno-project-template.json \
	-o kiky.zip -n 'Case KIKY'

`flopo-csv-merge-articles`

This takes as input CoNLL-CSV files in which articles are divided into stuctural blocks encoded in articleIds. For example instead of the article with ID 2000004489163, we have 2000004489163_title, 2000004489163_ingress and 2000004489163_body as separate documents, with paragraph and sentence IDs starting from 1 in each.

The script merges the parts into a single document and fixes the paragraph and sentence IDs. It makes the following assumptions:

the part identifier begins after the first underscore; the article IDs must not contain underscores,
the different parts of one article follow each other in the input CSV.

The information about parts is written into a separate CSV file with the structure: articleId,blockId,startParagraphId,endParagraphId.

Arguments

-i, --input-file -- the input CSV file
-o, --output-file -- the output CSV file containing the documents
-p, --parts-file -- the CSV file to write the information about parts to

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
src/flopo_formats		src/flopo_formats
tests		tests
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Local installation

Commands

`flopo-convert`

Arguments

Examples

`flopo-export`

Arguments

Examples

`flopo-finer`

Arguments

Examples

`flopo-eval`

Arguments

Examples

`flopo-package`

Arguments

Examples

`flopo-csv-merge-articles`

Arguments

About

Releases 1

Packages

Contributors 2

Languages

hsci-r/flopo-formats

Folders and files

Latest commit

History

Repository files navigation

Installation

Local installation

Commands

flopo-convert

Arguments

Examples

flopo-export

Arguments

Examples

flopo-finer

Arguments

Examples

flopo-eval

Arguments

Examples

flopo-package

Arguments

Examples

flopo-csv-merge-articles

Arguments

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

`flopo-convert`

`flopo-export`

`flopo-finer`

`flopo-eval`

`flopo-package`

`flopo-csv-merge-articles`

Packages