Consider adding a pure PyTorch function for `segment_mm` and `gather_mm` #56

tvercaut · 2024-09-26T10:44:35Z

segment_mm and gather_mm are borderline in scope for this repo but they would still be useful additions. DGL provides some nice functions but installing it is non trivial.
https://docs.dgl.ai/generated/dgl.ops.gather_mm.html
https://docs.dgl.ai/generated/dgl.ops.segment_mm.html
https://pyg-lib.readthedocs.io/en/latest/modules/ops.html#pyg_lib.ops.segment_matmul

A workaround for gather_mm is found here and would be a good starting point for inclusion in torchsparsegradutils (just needs a unit test):
pytorch/pytorch#136747

The text was updated successfully, but these errors were encountered:

tvercaut · 2024-09-26T16:46:10Z

For the record, the initial workaround

def my_gather_mm(a, b, idx_b):
  # mimic https://docs.dgl.ai/generated/dgl.ops.gather_mm.html
  R,D1,D2 = b.shape
  N = idx_b.shape[0]

  # Sanity check sizes
  assert(a.shape[0]==N and a.shape[1]==D1)

  torchdevice = a.device
  src_idx = torch.arange(N,device=torchdevice)

  # Ideally the conversions below to nested tensor would be handled without for looops and without copy
  nested_a = torch.nested.as_nested_tensor( 
      [torch.index_select(a,dim=0,index=torch.nonzero(idx_b==i).squeeze()) for i in range(R)] )
  src_idx_reshuffled = torch.cat( 
      [torch.index_select(src_idx,dim=0,index=torch.nonzero(idx_b==i).squeeze()) for i in range(R)] )
  nested_b = torch.nested.as_nested_tensor( 
      [b[i,:,:].squeeze() for i in range(R)] )

  # The actual gather matmul computation
  nested_ab = torch.matmul(nested_a,nested_b)

  # Convert back to tensors, again, ideally this would be handled natively with no copy
  ab_segmented = torch.cat(nested_ab.unbind(),dim=0)
  ab = torch.empty((N,D2),device=torchdevice)
  ab[src_idx_reshuffled] = ab_segmented
  return ab

can be simplified a bit

def my_gather_mm(a, b, idx_b):
  # mimic https://docs.dgl.ai/generated/dgl.ops.gather_mm.html
  R,D1,D2 = b.shape
  N = idx_b.shape[0]

  # Sanity check sizes
  assert(a.shape[0]==N and a.shape[1]==D1)

  torchdevice = a.device
  src_idx = torch.arange(N,device=torchdevice)

  # Ideally the conversions below to nested tensor would be handled without for looops and without copy
  nested_a = torch.nested.as_nested_tensor([a[idx_b==i,:] for i in range(R)] )
  src_idx_reshuffled = torch.cat( [src_idx[idx_b==i] for i in range(R)] )
  nested_b = torch.nested.as_nested_tensor(
      [b[i,:,:].squeeze() for i in range(R)] )

  # The actual gather matmul computation
  nested_ab = torch.matmul(nested_a,nested_b)

  # Convert back to tensors, again, ideally this would be handled natively with no copy
  ab_segmented = torch.cat(nested_ab.unbind(),dim=0)
  ab = torch.empty((N,D2),device=torchdevice)
  ab[src_idx_reshuffled] = ab_segmented
  return ab

* Added simple workarounds for gather_mm and segment_mm. See #56 * bumping python and pytorch version in CI * enabling black on notebooks in CI * updating github actions to avoid deprecation warning

tvercaut added a commit that referenced this issue Sep 26, 2024

Added simple workarounds for gather_mm and segment_mm. See #56

5a1693c

tvercaut mentioned this issue Sep 26, 2024

Added simple workarounds for gather_mm and segment_mm #57

Merged

tvercaut closed this as completed Sep 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider adding a pure PyTorch function for `segment_mm` and `gather_mm` #56

Consider adding a pure PyTorch function for `segment_mm` and `gather_mm` #56

tvercaut commented Sep 26, 2024

tvercaut commented Sep 26, 2024 •

edited

Loading

Consider adding a pure PyTorch function for segment_mm and gather_mm #56

Consider adding a pure PyTorch function for segment_mm and gather_mm #56

Comments

tvercaut commented Sep 26, 2024

tvercaut commented Sep 26, 2024 • edited Loading

Consider adding a pure PyTorch function for `segment_mm` and `gather_mm` #56

Consider adding a pure PyTorch function for `segment_mm` and `gather_mm` #56

tvercaut commented Sep 26, 2024 •

edited

Loading