Skip to content

NVBit-1.7.2

Latest
Compare
Choose a tag to compare
@x-y-z x-y-z released this 13 Dec 15:35
ff94852
  1. [API change] nvbit_set_at_launch(CUcontext ctx, CUfunction func, uint64_t param_val, CUstream custream = nullptr,uint64_t launch_handle = 0) now accepts parameter value instead of a pointer to the parameter. The newly added custream and launch_handle are provided and used during nvbit_at_graph_node_launch() to help set the parameter for CUDA graph kernel node.
  2. Improved cubin compatibility
  3. Fixed SASS instruction parsing
  4. Improved CUDA graph support
  5. [experimental] Changed mem_trace to support CUDA graph.
  6. Fixed related function detection for the function pointer case.