Make it possible to track the preferences of the PPO in the app. #70

jbloomAus · 2023-05-13T00:57:23Z

https://docs.google.com/document/d/1N1lVOXS5bLKYiXfoEeQoxxtI_0EfROi-JXcs-eYTCSA/edit?usp=sharing

I think this could be very valuable form the perspective of measuring the agent-simulators proclivity for modelling different agents in it's training distribution.

jbloomAus · 2023-05-13T00:58:59Z

A better version of this might be write a script which takes the training data and tests the predictions of the RL policies vs the agent simulator. We can think closely investigate examples with significant divergence and investigate the underlying mechanisms.

jbloomAus moved this to Todo in Decision Transformer Interpretability May 13, 2023

jbloomAus added this to Decision Transformer Interpretability May 13, 2023

jbloomAus removed this from Decision Transformer Interpretability May 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make it possible to track the preferences of the PPO in the app. #70

Make it possible to track the preferences of the PPO in the app. #70

jbloomAus commented May 13, 2023

jbloomAus commented May 13, 2023

Make it possible to track the preferences of the PPO in the app. #70

Make it possible to track the preferences of the PPO in the app. #70

Comments

jbloomAus commented May 13, 2023

jbloomAus commented May 13, 2023