One-line change reduce memory from 11 GB to 2 GB. #8

AlexanderMath · 2021-01-10T17:17:55Z

The original glow code uses gradient checkpointing, a very efficient way of reducing peak memory consumption. The following single line adds gradient checkpointing in away that memory consumption from 11 GB to 2 GB. It allowed me to increased batch size from 64 to 256 with no issue. I think 512 is possible, maybe even 1024 if we use float16 for some of the layers.

glow/models/glow/coupling.py

Line 28 in 59ed99f

st = self.nn(x_id)

def forward(self, x, ldj, reverse=False):
        x_change, x_id = x.chunk(2, dim=1)

        #st = self.nn(x_id) # change this line to the one below. 
        st = torch.utils.checkpoint.checkpoint(self.nn, x_id)
        s, t = st[:, 0::2, ...], st[:, 1::2, ...]
        s = self.scale * torch.tanh(s)

The text was updated successfully, but these errors were encountered:

chrischute · 2021-01-18T16:58:49Z

Thanks for the suggestion. I added a reference to this issue in the README. If you'd like to add support for checkpointing as a command line argument, feel free to open a pull request and I'll happily review.

AlexanderMath · 2021-01-18T16:59:56Z

Thanks for the suggestion. I added a reference to this issue in the README. If you'd like to add support for checkpointing as a command line argument, feel free to open a pull request and I'll happily review.

Will do when I get time after ICML deadline.

chrischute closed this as completed Jan 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

One-line change reduce memory from 11 GB to 2 GB. #8

One-line change reduce memory from 11 GB to 2 GB. #8

AlexanderMath commented Jan 10, 2021 •

edited

Loading

chrischute commented Jan 18, 2021

AlexanderMath commented Jan 18, 2021

One-line change reduce memory from 11 GB to 2 GB. #8

One-line change reduce memory from 11 GB to 2 GB. #8

Comments

AlexanderMath commented Jan 10, 2021 • edited Loading

chrischute commented Jan 18, 2021

AlexanderMath commented Jan 18, 2021

AlexanderMath commented Jan 10, 2021 •

edited

Loading