[fix] torch.inference_mode inplace of torch.no_grad #3188

pdumin · 2024-07-14T10:36:28Z

Changed inference context to torch.inference_mode() in test stage.

Setting device automatically with torch.cuda.is_availble.

CLAassistant · 2024-07-14T10:36:33Z

All committers have signed the CLA.

CLAassistant · 2024-07-14T10:36:33Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

mihran113

Hey @pdumin! Thanks a lot for opening the PR. Everything looks good 🎉 Proceeding to merge.

pdumin · 2024-07-15T10:15:14Z

@mihran113 Does it make sense to rewrite the training and validation in the PyTorch example? Currently, the accumulation of losses and metrics is done illogically: every 30 (batch) iterations. It could be fixed by accumulating gradients and metrics after each batch, which seems more natural.

I mean replace this:

for i, (images, labels) in enumerate(train_loader):
        images = images.to(device)
        labels = labels.to(device)

        # Forward pass
        outputs = model(images)
        loss = criterion(outputs, labels)

        # Backward and optimize
        optimizer.zero_grad()
        loss.backward()
        optimizer.step()

        if i % 30 == 0:
            logging.info(
                'Epoch [{}/{}], Step [{}/{}], ' 'Loss: {:.4f}'.format(
                    epoch + 1, num_epochs, i + 1, total_step, loss.item()
                )
            )

            # aim - Track model loss function
            correct = 0
            total = 0
            _, predicted = torch.max(outputs.data, 1)
            total += labels.size(0)
            correct += (predicted == labels).sum().item()
            acc = 100 * correct / total

            # aim - Track metrics
            items = {'accuracy': acc, 'loss': loss}
            aim_run.track(items, epoch=epoch, context={'subset': 'train'})

            # aim - Track weights and gradients distributions
            track_params_dists(model, aim_run)
            track_gradients_dists(model, aim_run)

with this:

for i, (images, labels) in enumerate(train_loader):
        images = images.to(device)
        labels = labels.to(device)

        # Forward pass
        outputs = model(images)
        loss = criterion(outputs, labels)

        # Backward and optimize
        optimizer.zero_grad()
        loss.backward()
        optimizer.step()
        # aim - Track model loss function
       correct = 0
       total = 0
       _, predicted = torch.max(outputs.data, 1)
       total += labels.size(0)
       correct += (predicted == labels).sum().item()
       acc = 100 * correct / total

       # aim - Track metrics
       items = {'accuracy': acc, 'loss': loss}
       aim_run.track(items, epoch=epoch, context={'subset': 'train'})

      # aim - Track weights and gradients distributions
      track_params_dists(model, aim_run)
      track_gradients_dists(model, aim_run)

And there is no validation step, I can do it.

pdumin added 2 commits July 14, 2024 13:28

set device to cuda if availble

d1b41be

inference_mode context inplace of no_grad

67efd61

mihran113 approved these changes Jul 14, 2024

View reviewed changes

mihran113 merged commit 756f41a into aimhubio:main Jul 14, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] torch.inference_mode inplace of torch.no_grad #3188

[fix] torch.inference_mode inplace of torch.no_grad #3188

pdumin commented Jul 14, 2024

CLAassistant commented Jul 14, 2024 •

edited

Loading

CLAassistant commented Jul 14, 2024

mihran113 left a comment

pdumin commented Jul 15, 2024

[fix] torch.inference_mode inplace of torch.no_grad #3188

[fix] torch.inference_mode inplace of torch.no_grad #3188

Conversation

pdumin commented Jul 14, 2024

CLAassistant commented Jul 14, 2024 • edited Loading

CLAassistant commented Jul 14, 2024

mihran113 left a comment

Choose a reason for hiding this comment

pdumin commented Jul 15, 2024

CLAassistant commented Jul 14, 2024 •

edited

Loading