You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I recently found an MOPO code implemented using pytorch (https://github.com/junming-yang/mopo-pytorch). I cannot find the difference between MOPO and SAC. The only difference is that there data are sampled from the rollout replay buffer generated by the learned dynamcis?
The text was updated successfully, but these errors were encountered:
I recently found an MOPO code implemented using pytorch (https://github.com/junming-yang/mopo-pytorch). I cannot find the difference between MOPO and SAC. The only difference is that there data are sampled from the rollout replay buffer generated by the learned dynamcis?
The text was updated successfully, but these errors were encountered: