Filter positive items for ranking evaluation #523

tqtg · 2023-07-24T21:31:42Z

Description

Previously, we did not filter positive items in training when doing evaluation on test/validation set for most of the ranking metrics. As a result, it harms performance of some models that indeed rank those items to the top. In this PR, we first explicitly filter out those items when calling Recommender.rank() function using item_indices, we then modify all ranking metrics to obey this logic. To summarize, positive items in training will not be involved during evaluation.

How are the ranking metrics affected?

AUC: not affected - previous logic already took it into account.
Precision: affected - expected the results to be increased.
Recall: affected - expected the results to be increased.
NDCG: affected - expected the results to be increased.
NCRR: affected - expected the results to be increased.
MAP: affected - expected the results to be increased.
MRR: affected - expected the results to be increased.

How are the rating metrics affected?
No. This change only affects ranking metrics.

Related Issues

#503

Checklist:

I have added tests.
I have updated the documentation accordingly.
I have updated README.md (if you are adding a new model).
I have updated examples/README.md (if you are adding a new example).
I have updated datasets/README.md (if you are adding a new dataset).

tqtg · 2023-07-24T21:43:33Z

Hi @amirj, we're making some changes to how the ranking metrics being computed, and these changes will break the logic of PropensityStratifiedEvaluation. Could you please have a look and see if we could mitigate this with some modification to the eval method?

tqtg · 2023-07-29T14:30:05Z

@saghiles @lthoang please help spend some time on this PR and see if this makes sense to you. The best way is to write a few test cases. Let me know if you have any comments

cornac/eval_methods/base_method.py

cornac/metrics/ranking.py

saghiles

@tqtg, I left two comments please have a look. The rest looks good to me.

lthoang · 2023-08-11T16:34:46Z

@tqtg The two comments of @saghiles make sense to me. The rests LGTM.

tqtg · 2023-08-11T19:04:07Z

@saghiles @lthoang thanks both for the comments which are now fixed. If no further suggestions, I'll go ahead and merge this PR.

tqtg added 8 commits July 24, 2023 19:22

Fix AUC metric

a82982e

Fix Precision, Recall, and FMeasure

7a57a20

Update ranking_eval method

bde4a00

Fix NDCG

e7f451a

Fix NCRR

52c3b14

Fix MRR

f2b8d4b

Ignore test for PropensityStratifiedEvaluation, need to revise the logic

837d159

Fix tests for all ranking metrics

e1ac588

tqtg requested review from lthoang and saghiles July 24, 2023 21:32

saghiles reviewed Aug 10, 2023

View reviewed changes

cornac/eval_methods/base_method.py Show resolved Hide resolved

saghiles reviewed Aug 10, 2023

View reviewed changes

cornac/metrics/ranking.py Outdated Show resolved Hide resolved

saghiles approved these changes Aug 11, 2023

View reviewed changes

tqtg added 2 commits August 11, 2023 19:37

optimize NDCG

7177a31

refactor code

3a1f0ce

tqtg force-pushed the filter-positive-items-for-ranking-eval branch from 6c5f332 to 3a1f0ce Compare August 11, 2023 19:38

Merge branch 'master' into filter-positive-items-for-ranking-eval

c828a50

tqtg merged commit b0d6fe8 into PreferredAI:master Aug 13, 2023

tqtg deleted the filter-positive-items-for-ranking-eval branch August 13, 2023 04:51

tqtg added a commit that referenced this pull request Oct 26, 2023

Update results from #523

e5eb3da

This was referenced May 7, 2024

Fix EFM train_set fitting PreferredAI/tutorials#26

Closed

Fix EFM train_set fitting PreferredAI/tutorials#27

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter positive items for ranking evaluation #523

Filter positive items for ranking evaluation #523

tqtg commented Jul 24, 2023 •

edited

Loading

tqtg commented Jul 24, 2023

tqtg commented Jul 29, 2023

saghiles left a comment

lthoang commented Aug 11, 2023

tqtg commented Aug 11, 2023

Filter positive items for ranking evaluation #523

Filter positive items for ranking evaluation #523

Conversation

tqtg commented Jul 24, 2023 • edited Loading

Description

Related Issues

Checklist:

tqtg commented Jul 24, 2023

tqtg commented Jul 29, 2023

saghiles left a comment

Choose a reason for hiding this comment

lthoang commented Aug 11, 2023

tqtg commented Aug 11, 2023

tqtg commented Jul 24, 2023 •

edited

Loading