Add large tensor support binary arithmetic #15785

ChaiBapchya · 2019-08-07T22:58:18Z

Description

Added 11 binary arithmetic operators - add, sub, rsub, neg, mul, div, rdiv, mod, rmod, imod, pow

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Code is well-documented:
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Design choice

Choice of operator to call

I had a choice between
__op__ vs op_symbol vs mx.nd.op

Since x.__add__(y) <=> x+y <=> mx.nd.add(x, y)

However, due to cases like these

mod : x.__mod__(y) <=> x%y <=> mx.nd.modulo(x, y)
rmod: x.__rmod__(y) <=> y%x <=> mx.nd.modulo(y, x)

I chose to stick with the __op__ so that the function is consistent.

Choice 2

I chose to have separate functions because
a. Easier to debug & test separate operators
b. No 100% 1-to-1 correlation
Divide ops are different
__div__ in MXNet vs __truediv__

create_2d_tensor vs nd.ones

Chose nd.ones due to performance reasons.
After monitoring multiple runs of test_large_array
Upon running the entire file, it would crash due to lack of memory error.
480Gig machine (dedicated for this one task) - p3.16xl

Reworked the code to ensure
a. variables are reused
b. in-house MXNet function (mx.nd.ones) used over the previous method (create_2d_tensor uses combination of functions from numpy and mxnet)
c. arange is not really needed to test if the function works for large tensor

apeforest

LGTM. Thanks for the quick action.

tests/nightly/test_large_array.py

* test rdiv * floating_point exception handle * add 10 other ops * added rpow and made numpy consistent * attempt to solve memory issue * linting fix * Trigger notification * lint

ChaiBapchya added 3 commits August 7, 2019 10:37

test rdiv

c267493

floating_point exception handle

f36d641

add 10 other ops

c070b8d

apeforest approved these changes Aug 7, 2019

View reviewed changes

apeforest reviewed Aug 7, 2019

View reviewed changes

tests/nightly/test_large_array.py Outdated Show resolved Hide resolved

apeforest reviewed Aug 7, 2019

View reviewed changes

tests/nightly/test_large_array.py Outdated Show resolved Hide resolved

added rpow and made numpy consistent

7f1c2b5

ChaiBapchya force-pushed the lts_binary_arithmetic branch from 0d00648 to a151181 Compare August 8, 2019 04:43

attempt to solve memory issue

a151181

apeforest reviewed Aug 8, 2019

View reviewed changes

tests/nightly/test_large_array.py Show resolved Hide resolved

apeforest reviewed Aug 8, 2019

View reviewed changes

tests/nightly/test_large_array.py Show resolved Hide resolved

ChaiBapchya added 4 commits August 8, 2019 18:12

linting fix

c612176

Trigger notification

09aa924

lint

d26801e

Merge branch 'master' into lts_binary_arithmetic

eb8d2ab

ChaiBapchya mentioned this pull request Aug 13, 2019

[CI] unix cpu validation Timeout #15880

Open

apeforest merged commit 11ce2a2 into apache:master Aug 13, 2019

access2rohit approved these changes Aug 13, 2019

View reviewed changes

ChaiBapchya deleted the lts_binary_arithmetic branch August 14, 2019 16:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add large tensor support binary arithmetic #15785

Add large tensor support binary arithmetic #15785

ChaiBapchya commented Aug 7, 2019 •

edited

Loading

apeforest left a comment

Add large tensor support binary arithmetic #15785

Add large tensor support binary arithmetic #15785

Conversation

ChaiBapchya commented Aug 7, 2019 • edited Loading

Description

Checklist

Essentials

Design choice

Choice of operator to call

Choice 2

create_2d_tensor vs nd.ones

apeforest left a comment

Choose a reason for hiding this comment

ChaiBapchya commented Aug 7, 2019 •

edited

Loading