Add new models for NER information extraction #20

eriknovak · 2024-07-17T13:07:50Z

Connected to a problem?

The current NER information extraction focuses on using the GLiNER model, specifically urchade/gliner_multi_pii-v1. While this model does support some different languages, we would need models that would cover a more extensive list of languages. Furthermore, the NER model should support various domains as well.

Solution?

Find NER datasets or create synthetic datasets that support different languages and domains. For this, we could use the scripts provided by the GLiNER package and publish the trained models on the huggingface hub.

An additional bonus would be to evaluate these models in different languages and domains. However, this could be difficult due to the lack of open datasets for these use cases.

Alternatives?

No response

The text was updated successfully, but these errors were encountered:

eriknovak added the enhancement New feature or request label Jul 17, 2024

eriknovak self-assigned this Jul 17, 2024

eriknovak removed their assignment Oct 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new models for NER information extraction #20

Add new models for NER information extraction #20

eriknovak commented Jul 17, 2024

Add new models for NER information extraction #20

Add new models for NER information extraction #20

Comments

eriknovak commented Jul 17, 2024

Connected to a problem?

Solution?

Alternatives?