[quantization speedup]fix data device type bug #3856

linbinskn · 2021-06-21T09:44:11Z

This PR is for fixing data's device type which has to be "cpu". Otherwise, it can not convert to numpy directly and will raise errors.

QuanluZhang · 2021-06-21T10:41:43Z

nni/compression/pytorch/quantization_speedup/integrated_tensorrt.py

@@ -326,6 +329,8 @@ def inference(self, test_data):
            Model input tensor
        """
        # convert pytorch tensor to numpy darray
+        if test_data.device != torch.device("cpu"):
+            test_data = test_data.to("cpu")


it is a little strange that inference can only be executed on cpu... why?

Actually the inference executed on GPU after the memory copy from host memory to device memory which is done in our tool and the process of host2device memory copy is unavoidable. We use pycuda' s memcpy_htod_async api to do such copy. It is just a way of unifying that we choose to handle all host data and we will do memory copy in our engine before calibration and inference. We can also take another to handle it such as finding other way or API which can use torch' s cuda type data directly. However, it will be more complicated and no extra advantage.

fix data device type bug

a5650c7

QuanluZhang requested a review from J-shang June 21, 2021 10:37

QuanluZhang reviewed Jun 21, 2021

View reviewed changes

QuanluZhang approved these changes Jun 21, 2021

View reviewed changes

J-shang approved these changes Jun 21, 2021

View reviewed changes

QuanluZhang merged commit 27e123d into microsoft:master Jun 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quantization speedup]fix data device type bug #3856

[quantization speedup]fix data device type bug #3856

linbinskn commented Jun 21, 2021

QuanluZhang Jun 21, 2021

linbinskn Jun 21, 2021 •

edited

Loading

QuanluZhang Jun 21, 2021

[quantization speedup]fix data device type bug #3856

[quantization speedup]fix data device type bug #3856

Conversation

linbinskn commented Jun 21, 2021

QuanluZhang Jun 21, 2021

Choose a reason for hiding this comment

linbinskn Jun 21, 2021 • edited Loading

Choose a reason for hiding this comment

QuanluZhang Jun 21, 2021

Choose a reason for hiding this comment

linbinskn Jun 21, 2021 •

edited

Loading