Wrong context preperation in SequenceTagger.predict() #3024

mohammad-al-zoubi · 2022-12-13T13:28:39Z

Hi everyone,

I finetuned a TransformerWordEmbeddings based on 'microsoft/mdeberta-v3-base' for a NER task and set use_context=True. I noticed some discrepancy between the test that happens automatically after the training (implemented by flair) and the tests that I do with SequenceTagger manually afterwards. After debugging I noticed that the sentences in a batch are being reordered in SequenceTagger.predict():

flair/flair/models/sequence_tagger_model.py

Line 465 in 5a13598

reordered_sentences = sorted(sentences, key=len, reverse=True)

So the context that is being expanded from this batch is wrong. While the context that is being prepared in the automatic test after training is correct (no sentence reordering happens and SequenceTagger._prepare_tensors(batch) is directly called).

Can you let me know if my observation is correct or if I missed something?

alanakbik · 2022-12-19T07:03:19Z

Hello @alzoubi36 thanks for noticing this - it is indeed a big problem that needs to be fixed. The sorting is done to speed-up inference, but when document context needs to be inferred on the fly, the new ordering messes up the contexts. The simplest solution would be to remove the sorting, but this would lead to slow-downs for models that don't require context. We have to think how to best address this.

helpmefindaname · 2022-12-19T12:37:58Z

I think a hotfix could be to first call the embedding and then run the predictions, e.g.

model.embeddings.embedd(sentences)
model.predict(sentences)

or to just set the context beforehands:

for first, second in zip(sentences, sentences[1:]):
    first._next_sentence = second
    second._first_sentence = first

but I agree that this is something to be fixed in the library.
I also noticed in the code for adding the context:
There is currently no way to specify that a sentence is the first one and has no left_context, as the check if context is set is self._previous_sentence is not None.

@alanakbik Maybe we can move that logic to the .predict method, such that the embeddings always expect the context to be set already?

alanakbik · 2022-12-19T14:27:46Z

@alanakbik Maybe we can move that logic to the .predict method, such that the embeddings always expect the context to be set already?

That would be a solution, however it would mean adding this logic to every predict method in Flair (the SequenceTagger uses a different one from DefaultClassifier) and also this introduces some overhead for models that don't require context (though probably setting the context is not so expensive).

helpmefindaname · 2023-02-20T22:57:08Z

This is fixed by the PR above

alanakbik added the fix-for-release-0.12 Must be fixed / completed for release 0.12 label Dec 19, 2022

helpmefindaname mentioned this issue Jan 23, 2023

Improved Tars Context #3063

Merged

helpmefindaname closed this as completed Feb 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong context preperation in SequenceTagger.predict() #3024

Wrong context preperation in SequenceTagger.predict() #3024

mohammad-al-zoubi commented Dec 13, 2022 •

edited

Loading

alanakbik commented Dec 19, 2022

helpmefindaname commented Dec 19, 2022

alanakbik commented Dec 19, 2022

helpmefindaname commented Feb 20, 2023

Wrong context preperation in SequenceTagger.predict() #3024

Wrong context preperation in SequenceTagger.predict() #3024

Comments

mohammad-al-zoubi commented Dec 13, 2022 • edited Loading

alanakbik commented Dec 19, 2022

helpmefindaname commented Dec 19, 2022

alanakbik commented Dec 19, 2022

helpmefindaname commented Feb 20, 2023

mohammad-al-zoubi commented Dec 13, 2022 •

edited

Loading