Skip to content

Improving my NER model's accuracy #13244

Discussion options

You must be logged in to vote

That is indeed a relatively low accuracy for an NER challenge, and the reason might be your data model. In general, we advice against having sublabels of common entity types, as it will probably be challenging for an NER model to see the distinctions between "ACQUIREE_COMPANY" and "ACQUIROR_COMPANY". On the other hand, simply identifying entities that are a "COMPANY" would be something that an NER is quite good at.

So, my personal approach to an NLP solution would probably look a bit more like this:

  1. Train an NER to recognize any company name in text (or use/evaluate an existing one like from one our pretrained pipelines)
  2. Identify sentences that talk about acquisition (e.g. with a textcat

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by svlandeg
Comment options

You must be logged in to vote
1 reply
@svlandeg
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
training Training and updating models feat / ner Feature: Named Entity Recognizer perf / accuracy Performance: accuracy
2 participants