Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RPP Tensor Transpose ND on HOST and HIP #362

Merged
merged 149 commits into from
Jun 28, 2024

Conversation

r-abishek
Copy link
Member

Adds support for Tensor Transpose ND on HOST and HIP
Adds relevant Unit/Perf/QA tests

r-abishek and others added 30 commits November 27, 2023 22:10
made changes in test suite to read read fixed input and comparision with golden output
made changes to update destination strides based on permute order
added golden input and outputs for 3D [2, 0, 1] transpose version

removed golden inputs and outputs for 6D data
Transpose ND HIP Kernel Optimizations
@kiritigowda kiritigowda requested a review from rrawther May 24, 2024 06:30
@kiritigowda kiritigowda self-assigned this May 24, 2024
@kiritigowda kiritigowda added the enhancement New feature or request label May 24, 2024
@kiritigowda kiritigowda changed the base branch from develop to master May 24, 2024 06:40
@kiritigowda kiritigowda requested a review from a team as a code owner May 24, 2024 06:40
@kiritigowda kiritigowda changed the base branch from master to develop May 24, 2024 06:45
@kiritigowda kiritigowda marked this pull request as draft May 24, 2024 08:04
@r-abishek r-abishek marked this pull request as ready for review May 29, 2024 02:42
rpp_simd_store(rpp_store4_f32_to_f32, dstPtrRow[6], &pDst[6]);
rpp_simd_store(rpp_store4_f32_to_f32, dstPtrRow[7], &pDst[7]);

srcPtrRow[0] += vectorIncrement;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you can combine 8 of these increamets into single avx instruction?

Copy link
Contributor

@rrawther rrawther left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added some comment

@rrawther rrawther requested review from rrawther and AryanSalmanpour and removed request for a team June 13, 2024 00:55
@kiritigowda kiritigowda merged commit 3077458 into ROCm:develop Jun 28, 2024
1 check was pending
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.