Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why don't you just use qval? #2025

Closed
Triple-ROCK opened this issue Aug 13, 2020 · 2 comments
Closed

Why don't you just use qval? #2025

Triple-ROCK opened this issue Aug 13, 2020 · 2 comments
Labels

Comments

@Triple-ROCK
Copy link

reward_run = (xposafter - xposbefore)/self.dt

@Triple-ROCK
Copy link
Author

reward_run = (xposafter - xposbefore)/self.dt

I see some tutorial where they register customized env and use 'xvel = observations[:, 9]' instead of
reward_run = (xposafter - xposbefore)/self.dt

@jkterry1
Copy link
Collaborator

PR #2762 is about to be merged, introducing V4 MuJoCo environments using new bindings and a dramatically newer version of the engine. If this issue still persists with the V4 ones, please create a new issue for it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants