[Feature Request] Vectorized/multi-agent environments compatibilitiy issues #777

matteobettini · 2022-12-31T15:10:49Z

Motivation

Vectorized environments are environments that perform simulations using batches. This can be useful to benefit from parallel computation on GPUs. These environments have their own batch_sizes, which can be used for different reasons.

For example:

Brax observations have shape (n_vectorized_envs, obs_size)
Vmas observations have shape (n_vectorized_envs, n_agents, obs_size)

Currently, torchrl environment infrastructure has some issues with environemnts which have non-empty batch sizes or that have a batch dimension for agents.

Ideally, we would like to use vectorized environments freely in torch rl and leverage its features such as ParallelEnv and Collectors on top of such environments. This whould create tensordicts with many dimensions in the batch_size, for example:

tensordict.batch_size = (
    n_parallel_envs, # from ParallelEnv
    n_agents, # from env.batch_size
    n_vectorized_envs, # from env.batch_size
    *other_env_dimensions, # from env.batch_size
    n_rollout_samples # from env.rollout()
)

I created this issue to list and organize all the issues that need to be addressed in order to generalize to BaseEnvs with general batch sizes in torchrl:

Issues

Stacking tensordicts of hetergoeneous shapes and nestedtensors compatibility (#766)(PR)

When some of the dimensions of the vectorized enironment are heterogenous (agents with different observation and action spaces that stil share the other batch dimensions), we need to carry this heterogeneous data in a suitable data straucture.

NestedTensors provide a natural candidate for this task. Here is a list of the operations that need to be supported by NestedTensors in order to enable this feature:

stacking along any dim
shape (not only size)
indexing along any dim that is compatible
stacking nested tensors together (currently we can't combine a two nested tensors containing tensors of shape [[a, b], [a, c]] into a single one of shape [[[a, b], [a, c]], [[a, b], [a, c]]])
NesetedTensors of NestedTensors
Nested tensor aritchemic and algebraic operations

Heterogeneous `CompositeSpec` (#766)(PR #829)

Bug on how `ParallelEnv` sets the `batch_size` (#773)(PR #774)

Bug on using `sorted()` on `CompositeSpec` keys (#775)(PR #787)

Hangling of the done flag when it has arbitrary dimensions (#776)(PR #788)

The `_reset()` method needs to be able to know which dimensions and indexes to reset (#790)(PR #800)

Collectors crash with enviornments with non-empty batch_size (#807)(PR #828)

The text was updated successfully, but these errors were encountered:

vmoens · 2022-12-31T15:28:08Z

For the last one (_reset should know the batch size) we could just pass an empty TensorDict instance. Wdyt?

matteobettini · 2022-12-31T15:33:22Z

For the last one (_reset should know the batch size) we could just pass an empty TensorDict instance. Wdyt?

There might be use cases where only some of the dimensions of the vector have to be reset. For example, the done flag can state that only some simulations in the vector have to be reset.

This is why methods such as reset_at() exist in rllib VectorEnv (https://github.com/ray-project/ray/blob/master/rllib/env/vector_env.py#L104)

vmoens · 2022-12-31T15:35:09Z

We have that in ParallelEnv through a "resent_workers" key IIRC.
We could make a reset_at helper that writes the Boolean mask in the tensordict.

matteobettini · 2022-12-31T15:38:27Z

We have that in ParallelEnv through a "resent_workers" key IIRC. We could make a reset_at helper that writes the Boolean mask in the tensordict.

Exactly, a key like that can be used in the reset() function of BaseEnv and instead of limiting to the worker dimensions it spans over all the batch_dim of the env.

If this key is not present the default could be reset all dims

matteobettini added the enhancement New feature or request label Dec 31, 2022

matteobettini assigned vmoens Dec 31, 2022

matteobettini mentioned this issue Jan 3, 2023

[Feature] Vmas library wrapper #785

Merged

11 tasks

matteobettini changed the title ~~[Feature Request] Vectorized environments compatibilitiy issues~~ [Feature Request] Vectorized/multi-agent environments compatibilitiy issues Jan 10, 2023

matteobettini mentioned this issue Jan 19, 2023

[BUG] Memory leak? #845

Closed

matteobettini mentioned this issue Feb 4, 2023

[Feature Request] Doc revamp #883

Open

5 tasks

matteobettini closed this as completed Mar 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Vectorized/multi-agent environments compatibilitiy issues #777

[Feature Request] Vectorized/multi-agent environments compatibilitiy issues #777

matteobettini commented Dec 31, 2022 •

edited

Loading

vmoens commented Dec 31, 2022

matteobettini commented Dec 31, 2022

vmoens commented Dec 31, 2022 •

edited

Loading

matteobettini commented Dec 31, 2022 •

edited

Loading

[Feature Request] Vectorized/multi-agent environments compatibilitiy issues #777

[Feature Request] Vectorized/multi-agent environments compatibilitiy issues #777

Comments

matteobettini commented Dec 31, 2022 • edited Loading

Motivation

Issues

Stacking tensordicts of hetergoeneous shapes and nestedtensors compatibility (#766)(PR)

Heterogeneous CompositeSpec (#766)(PR #829)

Bug on how ParallelEnv sets the batch_size (#773)(PR #774)

Bug on using sorted() on CompositeSpec keys (#775)(PR #787)

Hangling of the done flag when it has arbitrary dimensions (#776)(PR #788)

The _reset() method needs to be able to know which dimensions and indexes to reset (#790)(PR #800)

Collectors crash with enviornments with non-empty batch_size (#807)(PR #828)

vmoens commented Dec 31, 2022

matteobettini commented Dec 31, 2022

vmoens commented Dec 31, 2022 • edited Loading

matteobettini commented Dec 31, 2022 • edited Loading

matteobettini commented Dec 31, 2022 •

edited

Loading

Heterogeneous `CompositeSpec` (#766)(PR #829)

Bug on how `ParallelEnv` sets the `batch_size` (#773)(PR #774)

Bug on using `sorted()` on `CompositeSpec` keys (#775)(PR #787)

The `_reset()` method needs to be able to know which dimensions and indexes to reset (#790)(PR #800)

vmoens commented Dec 31, 2022 •

edited

Loading

matteobettini commented Dec 31, 2022 •

edited

Loading