[BUG] Collectors of batched environemnts return more frames than requested #846

matteobettini · 2023-01-19T11:37:42Z

Describe the bug

The collectors currently force the actual collected frames_per_batch to be divisible by the number of batched environments (which can be collector workers or parallel workers)(if looking at #828 this could also be vectorized dimensions in the batch size).

This leads to the user feeding a desired frames_per_batch at collector creation, and actually getting more frames than requested. As you can see in the following example:

gym_env = lambda: GymEnv("Pendulum-v1", device="cpu")
gym_parallel_env = lambda: ParallelEnv(10, gym_env)

pendulum_policy = TensorDictModule(
    nn.Linear(3, 1), in_keys=["observation"], out_keys=["action"]
)

coll = SyncDataCollector(
    gym_parallel_env,
    pendulum_policy,
    total_frames=20000,
    max_frames_per_traj=5,
    frames_per_batch=145,
    split_trajs=False,
)

for data in coll:
    print("Ending", data) # batch_size=torch.Size([10, 15]) aka 150 frames
    break

Which is caused for example by code like this:

self.frames_per_batch = -(-frames_per_batch // self.n_env)

This behavior might be dangerous for some users which might think that at each iteration they are training on x frames and instead they are training on x+y frames.

Solutions

Throw an error if the frames_per_batch is not divisible by the number of batched envs
Throw a warning if the frames_per_batch is not divisible by the number of batched envs
Find a way to return only the requested amount of frames through discarding some of the collected data

The text was updated successfully, but these errors were encountered:

vmoens · 2023-01-19T11:46:17Z

Makes sense, I'll add that to my collector refactoring

matteobettini · 2023-01-23T17:20:26Z

#828 introduces warnings for this, but we would like to turn those into errors to resolve this issue

matteobettini added the bug Something isn't working label Jan 19, 2023

matteobettini assigned vmoens Jan 19, 2023

vmoens added the Good first issue A good way to start hacking torchrl! label Jan 19, 2023

matteobettini mentioned this issue Jan 23, 2023

[BugFix] [Feature] Multi-agent collectors #828

Closed

matteobettini mentioned this issue Mar 24, 2023

[Feature] Warn when collectors collect more frames than requested #989

Merged

matteobettini closed this as completed Mar 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Collectors of batched environemnts return more frames than requested #846

[BUG] Collectors of batched environemnts return more frames than requested #846

matteobettini commented Jan 19, 2023 •

edited

Loading

vmoens commented Jan 19, 2023

matteobettini commented Jan 23, 2023

[BUG] Collectors of batched environemnts return more frames than requested #846

[BUG] Collectors of batched environemnts return more frames than requested #846

Comments

matteobettini commented Jan 19, 2023 • edited Loading

Describe the bug

Solutions

vmoens commented Jan 19, 2023

matteobettini commented Jan 23, 2023

matteobettini commented Jan 19, 2023 •

edited

Loading