Why don't you just use qval? #2025

Triple-ROCK · 2020-08-13T03:43:19Z

Line 16 in 6df1b99

reward_run = (xposafter - xposbefore)/self.dt

Triple-ROCK · 2020-08-13T03:45:23Z

gym/gym/envs/mujoco/half_cheetah.py

Line 16 in 6df1b99

reward_run = (xposafter - xposbefore)/self.dt

I see some tutorial where they register customized env and use 'xvel = observations[:, 9]' instead of
reward_run = (xposafter - xposbefore)/self.dt

jkterry1 · 2022-05-23T18:54:35Z

PR #2762 is about to be merged, introducing V4 MuJoCo environments using new bindings and a dramatically newer version of the engine. If this issue still persists with the V4 ones, please create a new issue for it.

pzhokhov added the mujoco label Aug 28, 2020

jkterry1 closed this as completed May 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why don't you just use qval? #2025

Why don't you just use qval? #2025

Triple-ROCK commented Aug 13, 2020

Triple-ROCK commented Aug 13, 2020

jkterry1 commented May 23, 2022

Why don't you just use qval? #2025

Why don't you just use qval? #2025

Comments

Triple-ROCK commented Aug 13, 2020

Triple-ROCK commented Aug 13, 2020

jkterry1 commented May 23, 2022