[WIP] Non-adaptative Agent Comparisions #276

TimotheeMathieu · 2023-01-24T11:58:44Z

Description

In this PR I introduce a new function compare_agents.

Given n_agents agents that have each been fitted n_fit times, we evaluate these agents and compare them using a multiple test in order to know which agent are statistically different and which are not.

Two methods are implemented: Tukey HSD (parametric, suppose that the evaluations are Gaussians) and Permutation test with StepDown method (non parametric, suppose only a finite second moment). The results are illustrated with a boxplot and a heatmap. In the case of Tukey HSD we also have access to some adapted p-values to quantify the certainty of the test.

Example :

EDIT: now with a simple text (dataframe) output:

       Agent1 vs Agent2  mean Agent1  mean Agent2   mean diff    std diff decisions     p-val significance
0  A2CAgent vs PPOAgent   213.600875   423.431500 -209.830625  144.600160    reject  0.002048           **
1  A2CAgent vs DQNAgent   213.600875   443.296625 -229.695750  152.368506    reject  0.000849          ***
2  PPOAgent vs DQNAgent   423.431500   443.296625  -19.865125  104.279024    accept  0.926234

Still TODO:

Be able to use pickle files instead of list of agent managers
Tests

…parisions

TimotheeMathieu added 9 commits January 18, 2023 10:39

fix bug dill and compress always

f840b34

change version

9d0e5a7

permutation tests

2908942

comparison tukey_hsd and permutation and plot more or less working

063c864

api and misc doc improvements

773bb56

Merge branch 'main' of https://github.com/rlberry-py/rlberry into com…

a69020e

…parisions

simpler returns

f83f7e7

remove test thing

dae75e1

add test

58c9767

TimotheeMathieu added the ready for review label Jul 12, 2023

TimotheeMathieu added 3 commits July 13, 2023 09:30

Merge branch 'main' of https://github.com/rlberry-py/rlberry into com…

de7627c

…parisions

doc, test, typos

1ea7cf6

revert mistaken changes to readme

d98d755

TimotheeMathieu requested review from riccardodv, AleShi94 and KohlerHECTOR July 13, 2023 14:38

add decision column, remove old compare_agent

29a3265

KohlerHECTOR mentioned this pull request Jul 13, 2023

User guide #325

Open

TimotheeMathieu added 4 commits July 13, 2023 18:29

correct test

8847caf

Merge branch 'main' into comparisions

c68bba9

fix bug when merge

2db13d0

fix agent manager

f10436d

TimotheeMathieu merged commit cc84a0f into rlberry-py:main Aug 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Non-adaptative Agent Comparisions #276

[WIP] Non-adaptative Agent Comparisions #276

TimotheeMathieu commented Jan 24, 2023 •

edited

Loading

[WIP] Non-adaptative Agent Comparisions #276

[WIP] Non-adaptative Agent Comparisions #276

Conversation

TimotheeMathieu commented Jan 24, 2023 • edited Loading

Description

TimotheeMathieu commented Jan 24, 2023 •

edited

Loading