Library in C++ and a python wrapper for dealing with Page XML files
Check py-pagexml/README.rst and/or docker/Dockerfile_build, docker/Dockerfile_runtime.
- lib: Directory containing the C++ PageXML and TextFeatExtractor libraries.
- py-pagexml: Swig-based python wrapper for the PageXML library.
- py-textfeat: Swig-based python wrapper for the TextFeatExtractor library.
- https://omni-us.github.io/pagexml/py-pagexml: Online documentation for py-pagexml.
- https://omni-us.github.io/pagexml/py-textfeat: Online documentation for py-textfeat.