-
Notifications
You must be signed in to change notification settings - Fork 525
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fp16 nchw for cudnn-fp16 backend (support GTX 16xx GPUs) #849
Merged
Commits on Nov 11, 2018
-
Merge pull request #3 from LeelaChessZero/master
use bestmove_is_sent_ for Search::IsSearchActive() (LeelaChessZero#502)
Configuration menu - View commit details
-
Copy full SHA for e3ad2c0 - Browse repository at this point
Copy the full SHA e3ad2c0View commit details
Commits on Nov 21, 2018
-
Configuration menu - View commit details
-
Copy full SHA for b2e5114 - Browse repository at this point
Copy the full SHA b2e5114View commit details
Commits on Dec 16, 2018
-
Configuration menu - View commit details
-
Copy full SHA for beed96e - Browse repository at this point
Copy the full SHA beed96eView commit details
Commits on Dec 21, 2018
-
Configuration menu - View commit details
-
Copy full SHA for 80ac4a1 - Browse repository at this point
Copy the full SHA 80ac4a1View commit details
Commits on Jan 15, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 0f7bc50 - Browse repository at this point
Copy the full SHA 0f7bc50View commit details
Commits on Feb 13, 2019
-
Configuration menu - View commit details
-
Copy full SHA for e4737e3 - Browse repository at this point
Copy the full SHA e4737e3View commit details -
- replace all cudaMemcpyAsync used for loading weights with cudaMemcpy as source (in CPU memory) could be deleted before the async version of the function actually does the copy. - minor naming/style changes. - add comment explaining what the policy map layer does and how the layout conversion from CHW to HWC works.
Configuration menu - View commit details
-
Copy full SHA for 49eb8e8 - Browse repository at this point
Copy the full SHA 49eb8e8View commit details -
Configuration menu - View commit details
-
Copy full SHA for acfd7c1 - Browse repository at this point
Copy the full SHA acfd7c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 33f3d57 - Browse repository at this point
Copy the full SHA 33f3d57View commit details
Commits on Feb 14, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 1976777 - Browse repository at this point
Copy the full SHA 1976777View commit details
Commits on Feb 19, 2019
-
Configuration menu - View commit details
-
Copy full SHA for 8f46984 - Browse repository at this point
Copy the full SHA 8f46984View commit details
Commits on May 4, 2019
-
Configuration menu - View commit details
-
Copy full SHA for b8dd014 - Browse repository at this point
Copy the full SHA b8dd014View commit details
Commits on May 11, 2019
-
support cudnn-fp16 backend on GPUs without tensor cores
- try NCHW layout and winograd alogirhtm for convolutions (same as what we use for fp32). - it's slower than NHWC/fp16 on GPUs with tensor cores, but should give some speedup on GP100 and TU11x GPUs.
Configuration menu - View commit details
-
Copy full SHA for 7211cda - Browse repository at this point
Copy the full SHA 7211cdaView commit details -
Configuration menu - View commit details
-
Copy full SHA for dd8c0ae - Browse repository at this point
Copy the full SHA dd8c0aeView commit details -
Configuration menu - View commit details
-
Copy full SHA for a73cfe8 - Browse repository at this point
Copy the full SHA a73cfe8View commit details
Commits on May 12, 2019
-
add check for cards with no tensor cores
- GP100 (SM6.0) - GTX 16xx GPUs (unfortunately same sm 7.5 version so need a string compare)
Configuration menu - View commit details
-
Copy full SHA for 017c07c - Browse repository at this point
Copy the full SHA 017c07cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 445c7c6 - Browse repository at this point
Copy the full SHA 445c7c6View commit details -
add backend-opt to force nhwc on or off
default is auto-select (-1).
Configuration menu - View commit details
-
Copy full SHA for d8049a6 - Browse repository at this point
Copy the full SHA d8049a6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9daed1a - Browse repository at this point
Copy the full SHA 9daed1aView commit details -
Use bool option instead of int and use IsDefault mechanism to check if the option was forced or not.
Configuration menu - View commit details
-
Copy full SHA for 124eef1 - Browse repository at this point
Copy the full SHA 124eef1View commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.