Skip to content

CUTLASS 1.3.2

Compare
Choose a tag to compare
@kerrmudgeon kerrmudgeon released this 10 Jul 18:42
b5cab17

Performance enhancement for Volta Tensor Cores TN layout

  • Fixed performance defect with indirect access to pointer array for Volta TensorCores TN arrangement.