Xeon optimizations #2914

sanchit-misra · 2021-05-15T07:15:21Z

Description

This PR contains the single socket optimizations of SpMM for Xeon as mentioned in our DistGNN paper: https://arxiv.org/abs/2104.06700

Checklist

Please feel free to remove inapplicable items for your PR.

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
To the my best knowledge, examples get faster or equal performance and accuracy is not affected.

Changes

Provided Xeon optimized implementations of SpMMSumCsr() and SpMMCmpCsr(). We have observed up to 4.4x speedup on the SpMM kernel without change in accuracy.

Suggestion for best performance

The optimizations can achieve better performance for dense full graphs (like Reddit) if the neighbors for each node in CSR matrix (indices) are sorted.

…d/max reduction operators in spmm

…/dgl into xeon-optimizations

dgl-bot · 2021-05-15T07:15:52Z

To trigger regression tests:

@dgl-bot run [instance-type] [which tests] [compare-with-branch];
For example: @dgl-bot run g4dn.4xlarge all dmlc/master or @dgl-bot run c5.9xlarge kernel,api dmlc/master

updated env.sh

…f distgnn.

jermainewang · 2021-08-25T08:23:59Z

Look like the PR has conflicts with the previous efforts of adding libxsmm into DGL. I will close this one. Please rebase and create a new PR. THanks.

sanchit-misra and others added 18 commits April 21, 2021 04:39

graphsage single socket optimizations

4534367

Added version of SAGEConv that uses optimized MLP cell

52ef1a1

Pulled down update to submodule libxsmm

30c609c

Added optimized implementations for unary/binary operators and sum/ad…

5d1c45a

…d/max reduction operators in spmm

Merge branch 'xeon-optimizations' of https://github.com/sanchit-misra…

5e59d51

…/dgl into xeon-optimizations

enabled USE_LIBXSMM macro

9d80eb0

Merge branch 'xeon-optimizations' of https://github.com/sanchit-misra…

7446cae

…/dgl into xeon-optimizations

Pulled down update to libxsmm

c14a572

Removed sageconv_opt from this branch

3d9f63a

Added libxsmm support for cmp redop

77d1e72

updated libxsmm for cmp redop

061efc1

Removed manual vectorization

b97244c

Merge branch 'dmlc:master' into xeon-optimizations

a111e4b

Code cleanup

aa0a303

Merge branch 'dmlc:master' into xeon-optimizations

4935d63

Merge branch 'dmlc:master' into xeon-optimizations

b8d115d

Minor update

8d5eb88

Merge branch 'dmlc:master' into xeon-optimizations

93984b7

yuk12 added 11 commits June 2, 2021 21:10

added distgnn

54b5587

update to distgnn

9ad9f01

added graphsage applition to ogbn-products and proteins datasets

3f1f061

updated readme

f776c13

modificaiton in sageconv.py

c4d98dd

adding all installations script and env setting script

70f1235

adding all installations script and env setting script

472196d

updated env.sh

updated env.sh

b4da7b0

added Torch_DIR for torch_ccl in setup_env.sh

8b27954

added distgnn readme, contains instructions on installation and use o…

0347a50

…f distgnn.

dependency install script

5978070

yuk12 added 6 commits August 4, 2021 21:52

update to install script

f9ad8d4

added dist exeuction script for distgnn dist runs

bc263ea

fixed partition script

b8aa727

added dgl commit setup_env.sh

c8f7303

updated dist scripts

7558ffa

updated setup script

cfb73e2

jermainewang closed this Aug 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xeon optimizations #2914

Xeon optimizations #2914

sanchit-misra commented May 15, 2021

dgl-bot commented May 15, 2021

jermainewang commented Aug 25, 2021

Xeon optimizations #2914

Xeon optimizations #2914

Conversation

sanchit-misra commented May 15, 2021

Description

Checklist

Changes

Suggestion for best performance

dgl-bot commented May 15, 2021

jermainewang commented Aug 25, 2021