Introduce tensor support #70

r-abishek · 2021-08-31T15:12:39Z

Adds all RPP framework headers and changes needed for tensor support.
Adds all unit test and performance test support for brightness tensor.

… into ar/tensor_support

… methods

AryanSalmanpour · 2021-09-07T17:05:36Z

src/include/hip/rpp_hip_common.hpp

+    d_half8 dst_h8;
+
+    dst_h8.x.x = __float22half2_rn(*(float2 *)&(dst_f8->x));
+    dst_h8.x.y = __float22half2_rn(*((float2 *)&(dst_f8->x) + 1));


please try to avoid this +1 in your code. Use the x, y, z, and w components of the dst_f8->x.

AryanSalmanpour · 2021-09-07T17:08:45Z

src/include/hip/rpp_hip_common.hpp

+
+__device__ __forceinline__ void rpp_hip_load24_pkd3_and_unpack_to_float24_pln3(float *srcPtr, uint srcIdx, d_float24 *src_f24)
+{
+    src_f24->x.x.x = srcPtr[srcIdx];


this function needs to be rewritten. please don't access the srcPtr memory like the way you've written here by adding some offset. read a float4 each time and then use the x, y, z, w components to fill the src_f24.

AryanSalmanpour · 2021-09-07T17:11:26Z

src/include/hip/rpp_hip_common.hpp

+
+__device__ __forceinline__ void rpp_hip_load24_pkd3_and_unpack_to_float24_pln3(half *srcPtr, uint srcIdx, d_float24 *src_f24)
+{
+    src_f24->x.x.x = __half2float(srcPtr[srcIdx]);


use the same comments mentioned above for rpp_hip_load24_pkd3_and_unpack_to_float24_pln3 function to rewrite this function as well.

AryanSalmanpour · 2021-09-07T17:15:40Z

src/include/hip/rpp_hip_common.hpp

+    srcPtrG = srcPtrR + increment;
+    srcPtrB = srcPtrG + increment;
+
+    src_f24->x.x.x = *srcPtrR;


use vector float4 for reading srcPtr memory and use its components when filling the src_f24.

rrawther · 2021-09-07T21:36:02Z

include/rppdefs.h

+typedef enum
+{
+    rppStatusSuccess        = 0,
+    rppStatusBadParm        = 1,


it is better to give -ve values for error codes

rrawther

please redo the macros and fix codacy warnings

rrawther

@paveltc : Please run all the unit-tests before merge

r-abishek · 2021-09-17T00:00:59Z

@asalmanp The I8 images now match exactly with the F16/F32/U8 visually. Please let me know if you have any other thoughts too. We could now proceed for the final testing and merge @paveltc

AryanSalmanpour · 2021-09-17T01:55:19Z

src/modules/hip/kernel/brightness.hpp

+
+__device__ void brightness_hip_compute(signed char *srcPtr, d_float8 *src_f8, d_float8 *dst_f8, float4 *alpha_f4, float4 *beta_f4)
+{
+    dst_f8->x = rpp_hip_pixel_check((src_f8->x + 128) * *alpha_f4 + *beta_f4) - 128;


src_f8->x is a float4 so if your intention is to add each of the x/y/z/w with 128 then you need to use (float4)128 here.

similarly, you would need to use (float4)128 to subtract in the end of this line.

Thats true, but I believe it was doing an auto typecast to float4. In any case, I have added a manual typecast now to be on the safer side.

AryanSalmanpour · 2021-09-17T02:03:48Z

src/modules/hip/kernel/roi_conversion.hpp

+{
+    int id_x = (hipBlockIdx_x * hipBlockDim_x + hipThreadIdx_x) * 4;
+
+    roiTensorPtrSrc[id_x + 2] -= (roiTensorPtrSrc[id_x] - 1);


use int2 to read from the roiTensorPtrSrc instead of using +1

Thanks! I have changed it now. Also, here roiTensorPtrSrc has 0/1/2/3, so I have used int4.

AryanSalmanpour · 2021-09-17T02:06:15Z

src/modules/rppi_validate.hpp

+    return layoutParams;
+}
+
+// inline void copy_roiTensor(RpptROIPtr roiTensorPtrSrc, rpp::Handle& handle)


remove commented out codes if it is not needed.

Done. Cleaned up this rppi_validate file for just some formatting changes, tabs to spaces, and removal of unwanted commented code.

AryanSalmanpour · 2021-09-17T02:16:35Z

src/include/hip/rpp_hip_common.hpp

+
+__device__ __forceinline__ float rpp_hip_unpack0(int src)
+{
+    return (float)(signed char)(src & 0xFF);


why the casting to (signed char) is needed here?

Actually I found this to be the only way to do the unpack correctly. It takes src, gets the first byte by masking with 0xFF, then we explicitly tell the compiler to interpret the MSB as a signed bit, then convert to float. If I directly convert to float, the most significant bit was interpreted not as a sign, and as part of an 8 bit number.

AryanSalmanpour · 2021-09-17T02:40:16Z

@r-abishek please take a look at my latest comments and make the necessary changes.

rrawther · 2021-09-17T18:13:03Z

@paveltc please run rali and rpp unit-tests on this PR so we can merge it

paveltc · 2021-09-22T00:15:26Z

@rrawther @asalmanp This PR passes the unit tests.

r-abishek · 2021-09-23T20:34:59Z

@rrawther @asalmanp This PR passes the unit tests.

@asalmanp @kiritigowda Could we merge the PR#70 too? The unit tests have been verified.

rrawther and others added 30 commits June 18, 2021 17:33

add definitions for rpp tensor api

6f8af70

Merge branch 'rr/rpp_tensor_support' of https://github.com/rrawther/rpp…

5171fc8

… into ar/tensor_support

Initial commit

57ba698

Initial commit - pln1/pln3 tensor testsuite

1ade698

Mods for tensor test suite

77a2f37

Mods for brightness tensor host

0721b38

arrangementParams to layoutParams

0edd317

Rename to tensor_augmentations

abece7f

Fix tensor host test suites

e3d517f

Modify host tensor support for brightness

dbca6fd

Initial commit for tensor hip test suite

4dbfe11

Multiple of 8 stride option

d5b75eb

Add initial tensor support for hip

cd5a2da

Tensor test suite support for hip pln

9a51968

Fixes for GPU tensor support

200c43c

Add host ROI null check

7cb5421

Initial commit for perf tests

e80ff0f

Perf tests for RPP tensor support

f15ab98

Add gpu support for ltrb to xywh, remove roiType, fix pln3 brightness…

46deede

… methods

Remove method1 for pln3 gpu, keep method2

ef9b87a

Fix hip tensor unittests

be2db8a

Add support for fused layout conversion on host

8bd97ea

Add tensor unittest suite support for layout toggle

3f4cf82

Add tensor perf tests for host - initial commit

1733d04

Add tensor host test suite for perf tests

442fa21

Add support for NHWC-NCHW toggle in HIP

b6bf47b

Add test suite support for layout toggle

69b5b37

Reset hip unittests script

2ebbed3

Unroll pln3 kernel

b87cf38

Add initial multi-bitDepth host support, remove templates

a186a60

AryanSalmanpour requested changes Sep 7, 2021

View reviewed changes

rrawther reviewed Sep 7, 2021

View reviewed changes

rrawther requested changes Sep 7, 2021

View reviewed changes

r-abishek added 6 commits September 8, 2021 18:33

Change host to hip in folder name and help

f1a4794

Change error enums to negative

15f315d

Avoid pointer or index increment by collating loads

1285059

Use variadic funcitons and pack templating to handle loads/stores

cf6686a

Fix i8 blank image issue in hip

5a76332

Combine loads in f16/f32 and organize rpp_hip_common file

cea25d2

rrawther approved these changes Sep 14, 2021

View reviewed changes

r-abishek added 3 commits September 16, 2021 19:34

Fix I8 store issue - trials

870df2f

Fix I8 store issue

efd2338

Merge branch 'ar/tensor_support_i8' into ar/tensor_support

eb9b22c

AryanSalmanpour requested changes Sep 17, 2021

View reviewed changes

AryanSalmanpour reviewed Sep 17, 2021

View reviewed changes

Add manual typecast to float4

c258931

r-abishek added 2 commits September 16, 2021 22:42

Use int4 to read roiTensorPtrSrc

2297628

rppi_validate cleanup

4a4d428

AryanSalmanpour approved these changes Sep 17, 2021

View reviewed changes

Test suite build fix

7c73a6a

kiritigowda merged commit ac1907f into ROCm:master Sep 23, 2021

AryanSalmanpour mentioned this pull request Sep 27, 2021

Jenkins - MIVisionX HIP Backend on Ubuntu20 & CentOS 8 ROCm/MIVisionX#629

Merged

r-abishek deleted the ar/tensor_support branch October 12, 2021 01:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce tensor support #70

Introduce tensor support #70

r-abishek commented Aug 31, 2021

AryanSalmanpour Sep 7, 2021

r-abishek Sep 13, 2021

AryanSalmanpour Sep 7, 2021

r-abishek Sep 13, 2021

AryanSalmanpour Sep 7, 2021

r-abishek Sep 13, 2021

AryanSalmanpour Sep 7, 2021

r-abishek Sep 13, 2021

rrawther Sep 7, 2021

r-abishek Sep 13, 2021

rrawther left a comment

rrawther left a comment

r-abishek commented Sep 17, 2021

AryanSalmanpour Sep 17, 2021

r-abishek Sep 17, 2021

AryanSalmanpour Sep 17, 2021

r-abishek Sep 17, 2021

AryanSalmanpour Sep 17, 2021

r-abishek Sep 17, 2021

AryanSalmanpour Sep 17, 2021

r-abishek Sep 17, 2021

AryanSalmanpour commented Sep 17, 2021

rrawther commented Sep 17, 2021

paveltc commented Sep 22, 2021

r-abishek commented Sep 23, 2021

Introduce tensor support #70

Introduce tensor support #70

Conversation

r-abishek commented Aug 31, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rrawther left a comment

Choose a reason for hiding this comment

rrawther left a comment

Choose a reason for hiding this comment

r-abishek commented Sep 17, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AryanSalmanpour commented Sep 17, 2021

rrawther commented Sep 17, 2021

paveltc commented Sep 22, 2021

r-abishek commented Sep 23, 2021