-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Does the parser special-treat relation subtypes? #83
Comments
UDPipe indeed simply selects |
I have a related question. I know that the EDIT: To be more precise, I am concerned with all the stages of training, evaluation, and prediction, i.e. does the loss function ignore subtypes, does the evaluation process for choosing the best iteration ignore the subtypes, and does this apply in any way during the stage of prediction (this last question I think is answered by your last post here). |
UDPipe 1 significantly pre-dates the There is however one place where the subtypes are ignored -- when running This is a design choice -- we try to reconstruct whatever the user has given us. If you are interested only in deprels without subtypes, you can remove them from the training data :-) |
I see, in the meantime I have found this piece of code: github.com/ufal/udpipe/blob/master/src/model/evaluator.cpp lines 252-256, I take it that this is used only when running |
Exactly. During training, https://github.com/ufal/udpipe/blob/master/src/parsito/parser/parser_nn_trainer.cpp#L476 is used, which compares the whole deprels. |
Imagine the parser is trying to decide between
rela:subtype1
,rela:subtype2
andrelb
. Let them have probabilities 0.25, 0.2 and 0.3 respectively. Will UDPipe simply selectrelb
or will it selectrela:subtype1
because universalrela
is more probable (0.25+0.2)?The text was updated successfully, but these errors were encountered: