What's the difference between on_step
and on_epoch
of pl_module.log
#8806
-
I'm using horovod to train the model. I wonder if |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
Dear @marsggbo, When using self.log(..., on_step=True), this will compute the metric per step locally as synchronisation adds performance hit. Best, |
Beta Was this translation helpful? Give feedback.
Dear @marsggbo,
When using self.log(..., on_step=True), this will compute the metric per step locally as synchronisation adds performance hit.
When using self.log(..., on_step=True, sync_dist=True), this will compute the metric per step across GPUS.
When using self.log(..., on_epoch=True), this will compute the metrics across GPUS and epoch batches automatically.
Best,
T.C