-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Uniform random agent not moving in AntGather environment #3
Comments
That's expected, it's performing random actions each time step, so on average it would likely stay in the same spot |
But even in case of the PointGatherEnv there is no movement in the red cube on executing random actions |
Change
to
|
Thanks, @sash-a Can you also provide PointMazeEnv? |
Also, some examples with a trained example would help in the provided Colab. Probably running a Stable baselines agent might be enough: |
PointMazeEnv may come in the future, but unfortunately I have pressing deadlines at the moment and it is not at the top of my list of envs to implement, AntPush and AntFall will likely come first. The point of these environments is that they generally require hierarchical reinforcement learning to solve, so stable baselines likely would not cut it. Regardless that is beyond the scope of this repository, at the moment it is simply for my own research as I could not find non-mujoco versions of these envs and anyone that wants to use these envs is welcome, but I don't really have enough time to create baselines. You are more than welcome to try running some standard RL algorithms on these envs and see if they work and make some contributions like a As a side not I did at one point try PPO on AntGather and it got a reward of 0, but that was a couple months ago and a very quick test. |
I tried the example code provided in the README. the agent is not making any move from the initial position it was dropped, except it jitters its body a little.
The text was updated successfully, but these errors were encountered: