Implemented methods to save and restore PyBullet states. #33

louixp · 2022-07-04T02:59:19Z

This PR is to address the feature discussed in #32.

qgallouedec

Great!
Could you also add a brief documentation by creating a new file in docs/usage/
Something like Save and restore states with a brief piece of code that explains how to use save_state and restore_state (in my opinion, explaining remove_state is not necessary).

import gym
import panda_gym

env = gym.make("PandaReach-v2")
obs = env.reset()

# [Interact]

valuable_state = env.save_state()

# [Try a sequence of actions]

env.restore_state(valuable_state) # Restore the valuable state

# [Try an alternative sequence of actions.]

env.close()

(No need to replace the brackets comments with code example.)

louixp · 2022-07-04T20:27:15Z

I have added an example of a greedy random search to the documentation. PTAL

qgallouedec · 2022-07-04T21:02:25Z

This looks great.

Now you need to update the index.rst to make this section of the documentation visible in the index.
You'll also need to add a unit test function. Add a new file in test called state_test.py. I think one test function for the all the three new methods should be enough.

If you want some help, feel free to ask.

louixp · 2022-07-04T21:58:04Z

Thanks! I have added the unit tests. However, I'm not super familiar with pytest and it's giving me ModuleNotFoundError for panda_gym in all test suites locally. Do you know what I could have done wrong?

qgallouedec · 2022-07-05T06:45:57Z

To use pytest, install it in your virtual env:

pip install pytest

Then just run

pytest

louixp · 2022-07-05T06:54:56Z

Thanks! It seems like a bunch of tests are failing. I reproduced this by copying a fresh copy of the repo. Here is the error message:

=========================================================================================================================== short test summary info ============================================================================================================================
FAILED test/envs_test.py::test_reach - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: Box(-10...
FAILED test/envs_test.py::test_slide - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: Box(-10...
FAILED test/envs_test.py::test_push - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: Box(-10....
FAILED test/envs_test.py::test_pickandplace - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: ...
FAILED test/envs_test.py::test_stack - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (6,), float32), desired_goal: Box(-10.0, 10.0, (6,), float32), observation: Box(-10...
FAILED test/envs_test.py::test_flip - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (4,), float32), desired_goal: Box(-10.0, 10.0, (4,), float32), observation: Box(-10....
FAILED test/envs_test.py::test_dense_reach - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: B...
FAILED test/envs_test.py::test_dense_slide - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: B...
FAILED test/envs_test.py::test_dense_push - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: Bo...
FAILED test/envs_test.py::test_dense_pickandplace - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observa...
FAILED test/envs_test.py::test_dense_stack - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (6,), float32), desired_goal: Box(-10.0, 10.0, (6,), float32), observation: B...
FAILED test/envs_test.py::test_dense_flip - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (4,), float32), desired_goal: Box(-10.0, 10.0, (4,), float32), observation: Bo...
FAILED test/envs_test.py::test_reach_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: ...
FAILED test/envs_test.py::test_slide_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: ...
FAILED test/envs_test.py::test_push_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: B...
FAILED test/envs_test.py::test_pickandplace_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observ...
FAILED test/envs_test.py::test_stack_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (6,), float32), desired_goal: Box(-10.0, 10.0, (6,), float32), observation: ...
FAILED test/envs_test.py::test_flip_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (4,), float32), desired_goal: Box(-10.0, 10.0, (4,), float32), observation: B...
FAILED test/envs_test.py::test_dense_reach_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observa...
FAILED test/envs_test.py::test_dense_slide_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observa...
FAILED test/envs_test.py::test_dense_push_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observat...
FAILED test/envs_test.py::test_dense_pickandplace_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), ...
FAILED test/envs_test.py::test_dense_stack_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (6,), float32), desired_goal: Box(-10.0, 10.0, (6,), float32), observa...
FAILED test/envs_test.py::test_dense_flip_joints - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (4,), float32), desired_goal: Box(-10.0, 10.0, (4,), float32), observat...
FAILED test/seed_test.py::test_seed_reach - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: Bo...
FAILED test/seed_test.py::test_seed_push - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: Box...
FAILED test/seed_test.py::test_seed_slide - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observation: Bo...
FAILED test/seed_test.py::test_seed_pick_and_place - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (3,), float32), desired_goal: Box(-10.0, 10.0, (3,), float32), observ...
FAILED test/seed_test.py::test_seed_stack - AssertionError: The observation returned by the `reset()` method is not contained with the observation space (Dict(achieved_goal: Box(-10.0, 10.0, (6,), float32), desired_goal: Box(-10.0, 10.0, (6,), float32), observation: Bo...
================================================================================================================== 29 failed, 30 passed, 87 warnings in 5.90s ==================================================================================================================

…o pr/33

qgallouedec · 2022-07-05T07:01:55Z

Yes, these errors come from the latest version of gym. I solved the problem yesterday on the master branch. I just included these changes to your branch. Pull the changes, force reinstall gym (pip install gym==0.23) and these problems should be solved.

louixp · 2022-07-05T07:03:39Z

Awesome thanks! I fixed some small errors in tests, but everything should be good now. Everything is green locally.

qgallouedec · 2022-07-05T07:13:34Z

I just thought of something:
Logically, restore_state should also restore from desired goal, right?

louixp · 2022-07-05T07:23:37Z

Is the desired goal not an object where the state is captured in pybullet?

qgallouedec · 2022-07-05T07:27:31Z

No. It is the opposite. A target position is sampled, and a fake object (just for rendering, agent can't interact with it) is placed in the simulation.

qgallouedec · 2022-07-05T07:33:40Z

In my opinion, this should work:

env = PandaReachEnv()
env.reset()

state_id = env.save_state()

# Perform the action
action = env.action_space.sample()
next_obs1, reward, done, info = env.step(action)

# Restore and perform the same action
env.reset()
env.restore_state(state_id)
next_obs2, reward, done, info = env.step(action)

# The observations in both cases should be equals
assert np.all(next_obs1["achieved_goal"] == next_obs2["achieved_goal"])
assert np.all(next_obs1["observation"] == next_obs2["observation"])
assert np.all(next_obs1["desired_goal"] == next_obs2["desired_goal"])

louixp · 2022-07-05T07:36:07Z

I see what you mean. I didn't do assertion for the desired goal since it cannot change during an episode, but I could add that.

louixp · 2022-07-05T07:46:59Z

Done!

qgallouedec · 2022-07-05T07:53:09Z

I think I explained it wrong:
Consider that I save the state when the environment has goal A. I reset the environment, thus a goal B is sampled. Now I restore the saved state, I would like the goal to be A again.

I think this can be done by storing the goals in a dictionary that associates state_id with the goal.

qgallouedec · 2022-07-05T07:58:08Z

maybe something like

def save_state(self) -> int:
    state_id = self.sim.save_state()
    self._saved_goal[state_id] = self.task.goal
    return state_id

def restore_state(self, state_id: int) -> None:
    self.sim.restore_state(state_id)
    self.task.goal = self._saved_goal[state_id]

def remove_state(self, state_id: int) -> None:
    self._saved_goal.pop(state_id)
    self.sim.remove_state(state_id)

louixp · 2022-07-05T08:11:45Z

I see! Just pushed the change.

qgallouedec · 2022-07-05T08:15:46Z

Useful trick to help you formatting your code: Install black and isort (pip install black isort)
Then run

black <your-directory>
isort <your-directory>

Here:

black -l 127 panda_gym test
isort -l 127 panda_gym test

-l 127 means that a line can contain 127 characters.

louixp · 2022-07-05T08:25:00Z

Thanks!

qgallouedec · 2022-07-05T10:04:37Z

Thank you for contributing, your changes have been included in the version 2.0.4 :)

Implemented methods to save and restore PyBullet states.

056dd61

louixp mentioned this pull request Jul 4, 2022

Physics client closed during environment destruction #32

Closed

qgallouedec requested changes Jul 4, 2022

View reviewed changes

louixp added 2 commits July 4, 2022 13:17

Fixed typos.

f1a3bf8

Added docs for save_state() and remove_state().

933a0bb

louixp added 3 commits July 4, 2022 14:31

Make save and restore state docs visible in index.

ed4ea69

Added unit tests for save and restore states.

e514e76

Added unit test for remove state.

c5fdc0f

qgallouedec linked an issue Jul 5, 2022 that may be closed by this pull request

Physics client closed during environment destruction #32

Closed

Merge branch 'master' of https://github.com/qgallouedec/panda-gym int…

7c7fc1e

…o pr/33

Fixed save and restore test logic.

f57d53a

isort and black

2d4ace1

Test for desired goal consistency during state saving and restoring.

58bd0d1

Save and restore task goal.

2f62339

Run linting.

5c7a902

qgallouedec added 2 commits July 5, 2022 11:12

p to self.physics_client

59bd584

fix docstring style

c96e5a2

qgallouedec approved these changes Jul 5, 2022

View reviewed changes

Update version

c826f75

qgallouedec merged commit bdc9ae1 into qgallouedec:master Jul 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented methods to save and restore PyBullet states. #33

Implemented methods to save and restore PyBullet states. #33

louixp commented Jul 4, 2022

qgallouedec left a comment •

edited

Loading

louixp commented Jul 4, 2022

qgallouedec commented Jul 4, 2022

louixp commented Jul 4, 2022

qgallouedec commented Jul 5, 2022 •

edited

Loading

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

louixp commented Jul 5, 2022

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

Implemented methods to save and restore PyBullet states. #33

Implemented methods to save and restore PyBullet states. #33

Conversation

louixp commented Jul 4, 2022

qgallouedec left a comment • edited Loading

Choose a reason for hiding this comment

louixp commented Jul 4, 2022

qgallouedec commented Jul 4, 2022

louixp commented Jul 4, 2022

qgallouedec commented Jul 5, 2022 • edited Loading

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

louixp commented Jul 5, 2022

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

louixp commented Jul 5, 2022

qgallouedec commented Jul 5, 2022

qgallouedec left a comment •

edited

Loading

qgallouedec commented Jul 5, 2022 •

edited

Loading