The Layout Parser Open Platform hosts a collection of layout detection models and document image analysis pipelines. We hope this platform can make it easier for practitioners to find and use the best layout detection models for their research and application. And it can provide a platform for the interchange of ideas for building complex DIA pipelines.
We welcome contributions to the layout parser platform -- feel free to submit your models or pipelines to the platform following the instructions below.
- Train your own layout detection models
- Submit a pull request in the main layout-parser repo:
- If your layout model is based on an existing framework like Detectron2 or PaddleDetection, you just need to provide the model weights and we’ll incorporate it in the library.
- If your layout model is built on a different framework, you might also need to implement the LayoutDetector Class yourself.
- Submit the model details to the Layout Parser Open Platform.
- You might need to provide
lp-model-config
in the model card data section. It will be assigned in the previous PR in the model adding request.
- You might need to provide
Please note: the models submitted to the Layout-Parser repo will be assigned with Apache 2.0 license in accordance with the layoutparser library. If you don't wish to submit your models under the license, you can also consider including the model in a DIA pipeline repo and have it submitted instead.
- Develop your own document image analysis pipeline in a separate Github repo.
- There are not any license requirements for the pipeline repos as they are individual projects from layout-parser.
- Submit the pipline details to the Layout Parser Open Platform.
- You can specify the
link
to the DIA pipeline repo in the model card data section.
- You can specify the