[POC] Adds TH_TENSOR_APPLY2_PARALLEL #395
Open
+165
−2
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Following the discussion in #323, I've tried to add a macro
TH_TENSOR_APPLY2_PARALLEL
, which uses omp if both tensors are contiguous. For the moment, I haven't set a threshold to use omp or not.As a proof of concept, I added it to the unary operations implemented by
LAB_IMPLEMENT_BASIC_FUNCTION
(likeabs
,tan
, etc).Any thoughts ?
cc @dominikgrewe