For Tree-Based-Exploration see: https://github.com/KohlerHECTOR/TREX-Tree-Reward-EXploration
Install scikit-learn and SB3
pip3 install -r requirements.txt
Launch MBPO for 100 iterations on InvertedPendulum with Decision Trees as Model estimators and SAC as policy optim. Results are saved in 'Experience_Results/pendul-tree-sac/':
python3 experience.py InvertedPendulum-v4 tree sac 100 pendul-tree-sac
Launch MBPO for 100 iterations on InvertedPendulum with 2x64 MLP as Model estimators and SAC as policy optim. Results are saved in 'Experience_Results/pendul-mlp-sac/':
python3 experience.py InvertedPendulum-v4 mlp sac 100 pendul-mlp-sac
Save Plots of comparisons 'Experience_Results/Comparison-date-time/':
python3 compare_experiences.py pendul-tree-sac pendul-mlp-sac
Save Plots of results in 'Experience_Results/pendul-tree-sac/':
python3 plot_experience.py pendul-tree-sac