This XSL transformation is based on the work of dariok for the Transkribus Project. His original transformation scenario is to be found under the page2tei project name.
Page2tei takes a PAGE XML file and transforms it into a conform TEI XML file.
Customizations by Time Us project include :
- transforming non-default tags into TEI-conform tags
- dealing only with PAGE XML files (not METS)
- transforming additional data entered under the
temp
(temporary) namespace by our ExportFromTranskribus script - creating more complete teiHeader
- creating
rendition="multiline"
attributes to all semantic tags extended over more than one line
- Download latest version of Saxon HE parser (ex:
saxonHE9-9-0-1J
):
- Set following directory configuration:
page2tei_TimeUs/
| - page2tei_TU.xsl
| - stringpack.xsl
| - saxon9he.jar
| - input/
| - oupput/
-
Paste all PAGE files to transform in
input
directory -
Run following command in terminal:
java -jar -s:input/ -o:output/ -xsl:page2tei_TU.xsl
Transformed files will be stored in output/
directory.