You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The collectors currently force the actual collected frames_per_batch to be divisible by the number of batched environments (which can be collector workers or parallel workers)(if looking at #828 this could also be vectorized dimensions in the batch size).
This leads to the user feeding a desired frames_per_batch at collector creation, and actually getting more frames than requested. As you can see in the following example:
This behavior might be dangerous for some users which might think that at each iteration they are training on x frames and instead they are training on x+y frames.
Solutions
Throw an error if the frames_per_batch is not divisible by the number of batched envs
Throw a warning if the frames_per_batch is not divisible by the number of batched envs
Find a way to return only the requested amount of frames through discarding some of the collected data
The text was updated successfully, but these errors were encountered:
Describe the bug
The collectors currently force the actual collected
frames_per_batch
to be divisible by the number of batched environments (which can be collector workers or parallel workers)(if looking at #828 this could also be vectorized dimensions in the batch size).This leads to the user feeding a desired
frames_per_batch
at collector creation, and actually getting more frames than requested. As you can see in the following example:Which is caused for example by code like this:
This behavior might be dangerous for some users which might think that at each iteration they are training on x frames and instead they are training on x+y frames.
Solutions
frames_per_batch
is not divisible by the number of batched envsframes_per_batch
is not divisible by the number of batched envsThe text was updated successfully, but these errors were encountered: