Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve upon error reporting in common #389

Closed
wants to merge 2 commits into from

Conversation

janekb04
Copy link
Contributor

@janekb04 janekb04 commented Aug 18, 2023

Cherry picked from #377

  1. Makes error checking macros report the real source location of the error, rather than a rather unuseful logging.h.
  2. Fixes segfault when calling cuGetErrorString with wrong number of arguments.

Signed-off-by: Jan Bielak <jbielak@nvidia.com>
Signed-off-by: Jan Bielak <jbielak@nvidia.com>
@timmoon10
Copy link
Collaborator

These changes would fix #376. It's similar to #382, so I think we should incorporate these changes into that PR.

timmoon10 added a commit to timmoon10/TransformerEngine that referenced this pull request Aug 18, 2023
Incorporating changes from NVIDIA#389.

Co-authored-by: Tim Moon <tmoon@nvidia.com>
Co-authored-by: Jan Bielak <jbielak@nvidia.com>
Signed-off-by: Tim Moon <tmoon@nvidia.com>
@ksivaman
Copy link
Member

ksivaman commented Aug 19, 2023

Can we close this in favor of #382? @timmoon10 @janekb04

@janekb04
Copy link
Contributor Author

I think so. It seems #382 was rebased on this.

@janekb04 janekb04 closed this Aug 21, 2023
timmoon10 added a commit that referenced this pull request Oct 24, 2023
* Do not include logging macros in installed C headers

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Debug logging macros

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Debug C++ tests

Use Google style for header includes.

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Update CUDA driver macros

Incorporating changes from #389.

Co-authored-by: Tim Moon <tmoon@nvidia.com>
Co-authored-by: Jan Bielak <jbielak@nvidia.com>
Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Use core error checking macros in PyTorch extensions

Hack to get around macro redefinition warning.

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Fix missing arg when getting CUDA driver error string

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Reuse logging header in frameworks

Signed-off-by: Tim Moon <tmoon@nvidia.com>

---------

Signed-off-by: Tim Moon <tmoon@nvidia.com>
Co-authored-by: Jan Bielak <jbielak@nvidia.com>
mingxu1067 pushed a commit to mingxu1067/TransformerEngine that referenced this pull request Nov 3, 2023
* Do not include logging macros in installed C headers

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Debug logging macros

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Debug C++ tests

Use Google style for header includes.

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Update CUDA driver macros

Incorporating changes from NVIDIA#389.

Co-authored-by: Tim Moon <tmoon@nvidia.com>
Co-authored-by: Jan Bielak <jbielak@nvidia.com>
Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Use core error checking macros in PyTorch extensions

Hack to get around macro redefinition warning.

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Fix missing arg when getting CUDA driver error string

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Reuse logging header in frameworks

Signed-off-by: Tim Moon <tmoon@nvidia.com>

---------

Signed-off-by: Tim Moon <tmoon@nvidia.com>
Co-authored-by: Jan Bielak <jbielak@nvidia.com>
Signed-off-by: Ming Huang <mingh@nvidia.com>
cyanguwa pushed a commit to cyanguwa/TransformerEngine that referenced this pull request Nov 13, 2023
* Do not include logging macros in installed C headers

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Debug logging macros

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Debug C++ tests

Use Google style for header includes.

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Update CUDA driver macros

Incorporating changes from NVIDIA#389.

Co-authored-by: Tim Moon <tmoon@nvidia.com>
Co-authored-by: Jan Bielak <jbielak@nvidia.com>
Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Use core error checking macros in PyTorch extensions

Hack to get around macro redefinition warning.

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Fix missing arg when getting CUDA driver error string

Signed-off-by: Tim Moon <tmoon@nvidia.com>

* Reuse logging header in frameworks

Signed-off-by: Tim Moon <tmoon@nvidia.com>

---------

Signed-off-by: Tim Moon <tmoon@nvidia.com>
Co-authored-by: Jan Bielak <jbielak@nvidia.com>
Signed-off-by: Charlene Yang <8636796+cyanguwa@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants