-
Notifications
You must be signed in to change notification settings - Fork 656
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove legacy sync path #19714
Remove legacy sync path #19714
Conversation
This removes all the needless blocking operations that were occurring on hip. Removal should boost speed and allow multi device parallelism.
3ccedad
to
c87a44c
Compare
struct ROCMOptions { | ||
std::string target = ""; | ||
std::string targetFeatures = ""; | ||
std::string bitcodeDirectory = getDefaultBitcodeDirectory(); | ||
int wavesPerEu = 0; | ||
std::string enableROCMUkernels = "none"; | ||
bool legacySync = true; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Digging through history:
- Removing the use of the legacy_sync hack from all but ROCM. #16493 removed legacy_sync from all but ROCM, since "The ROCM HAL does not support command buffers at all"
- At the time, we still had hal/rocm and experimental/hip
- [HIP] Adds graph command buffer & descriptor set and pipeline layout #15910 added graph command buffers to experimental/hip
- [hip] Move into hal/drivers and build by default #16706 promoted experimental/hip to hal/drivers/
- We later deleted hal/rocm
@benvanik @nithinsubbiah does that sound right?
So I think we're ready for this now? All tests seem to be passing.
So my main comment here is that the hip backend still "technically" supports inline execution. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
woohoo! have been trying to delete this for ages!
This removes all the needless blocking operations that were occurring on hip. Removal should boost speed and allow multi device parallelism.