Skip to content

TimeUs-ANR/page2tei_TimeUS

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Page2tei, Time Us custom

Original project

This XSL transformation is based on the work of dariok for the Transkribus Project. His original transformation scenario is to be found under the page2tei project name.

Page2tei

Page2tei takes a PAGE XML file and transforms it into a conform TEI XML file.

Customizations

Customizations by Time Us project include :

  • transforming non-default tags into TEI-conform tags
  • dealing only with PAGE XML files (not METS)
  • transforming additional data entered under the temp (temporary) namespace by our ExportFromTranskribus script
  • creating more complete teiHeader
  • creating rendition="multiline" attributes to all semantic tags extended over more than one line

Running Page2tei_TU

  1. Download latest version of Saxon HE parser (ex: saxonHE9-9-0-1J):
  1. Set following directory configuration:
page2tei_TimeUs/
 | - page2tei_TU.xsl
 | - stringpack.xsl
 | - saxon9he.jar
 | - input/
 | - oupput/
  1. Paste all PAGE files to transform in input directory

  2. Run following command in terminal:

java -jar -s:input/ -o:output/ -xsl:page2tei_TU.xsl

Transformed files will be stored in output/ directory.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • XSLT 100.0%