make HostDeviceVector single gpu only #4773

rongou · 2019-08-14T19:11:06Z

Closes #4531
Part of 1.0.0 roadmap #4680

Finally got the CI to pass. I can do more cleanups, but thought you guys should take a look first.

trivialfis · 2019-08-21T06:31:00Z

@rongou I haven't looked into this in detail yet. But seems GPUDistribution can be removed too?

rongou · 2019-08-21T17:36:09Z

@trivialfis yes there is a whole bunch of stuff that can be removed or cleaned up. I didn't want to make this bigger than it already is, but I'll do some of the more obvious ones.

trivialfis · 2019-08-22T06:24:38Z

@rongou Nice. Please ping me when it's ready.

rongou · 2019-08-22T06:27:00Z

It's ready now.

…

On Wed, Aug 21, 2019, 11:25 PM Jiaming Yuan ***@***.***> wrote: @rongou <https://github.com/rongou> Nice. Please ping me when it's ready. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#4773?email_source=notifications&email_token=AADZLTOW4QM5OZSJXEOU5JDQFYWN3A5CNFSM4ILYBAO2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD44AYIA#issuecomment-523766816>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AADZLTKUSMRERF3BTETCZJTQFYWN3ANCNFSM4ILYBAOQ> .

src/common/device_helpers.cuh

src/common/hist_util.cu

src/common/host_device_vector.cu

src/linear/updater_gpu_coordinate.cu

sriramch · 2019-08-23T15:49:16Z

src/predictor/gpu_predictor.cu

@@ -352,40 +310,45 @@ class GPUPredictor : public xgboost::Predictor {
    InitModel(model, tree_begin, tree_end);

    size_t batch_offset = 0;
+    auto* preds = out_preds;
+    std::unique_ptr<HostDeviceVector<bst_float>> batch_preds{nullptr};


instead of dual copy:

copying input prediction vector into a temp one based on batch size and,

copying back the output prediction vector from the kernel back into out_preds

is it not possible to:

slice and dice the out_preds based on batch size - basically, get a device span from out_preds and get a subspan based on the batch size and batch offset

pass the subspan to predict internal as a pass through to the kernel

That'll assume the whole prediction vector will be on device, right? I think the current approach doesn't put the whole vector on device in external memory mode, although I haven't 100% verified it.

even if the device copy is prevented here, it'll be copied right back (soon after this) in the objective functions while computing the gradients, or later when the prediction cache is updated during training.

Sounds reasonable. I'll send a followup PR.

src/predictor/gpu_predictor.cu

rongou · 2019-08-24T18:55:17Z

@RAMitchell @trivialfis @sriramch I think this is as far as this PR would go. Please take another look.

I still want to remove the sharding in updater_gpu_hist.cu, but it seems a bit more involved, so probably leave it to a follow up PR.

trivialfis

Awesome! This will allow us focusing more on algorithm instead of data distribution.

rongou force-pushed the single-gpu-hdv branch from a25727f to 5103efb Compare August 15, 2019 21:56

rongou added 5 commits August 16, 2019 15:00

make HostDeviceVector single gpu only

62efe25

get non-tests to compile

23c6272

make code compile

c9dccbb

fixed some tests

9e1ea22

fix more tests

e89cf17

rongou force-pushed the single-gpu-hdv branch from 70163e7 to e89cf17 Compare August 16, 2019 22:00

rongou added 3 commits August 16, 2019 16:06

make n_gpus private

ab302d2

fix one gpu predictor test

962155d

Merge remote-tracking branch 'upstream/master' into single-gpu-hdv

57be3a8

rongou mentioned this pull request Aug 19, 2019

[RFC] Remove support for single process multi-GPU #4531

Closed

rongou added 8 commits August 19, 2019 13:50

fix after mege

ddbfc8b

fix gpu predictor external memory code

5959ca3

fix cpu compile

4a6f38f

Merge branch 'master' into single-gpu-hdv

f51a2fa

fix mgpu tests

352fe00

fix python cpu test

2b0a981

fix

3797f57

better initialization

dac2608

rongou changed the title ~~[WIP] make HostDeviceVector single gpu only~~ make HostDeviceVector single gpu only Aug 21, 2019

trivialfis self-requested a review August 21, 2019 06:31

rongou added 6 commits August 21, 2019 10:37

Merge branch 'master' into single-gpu-hdv

97e74fc

fix base margin in gpu predictor

888b202

remove GPUDistribution

fdda5b3

remove GPUSet

3b55244

fix cpu build

4837b3a

remove reference to n_gpus

7124923

Merge branch 'master' into single-gpu-hdv

f1930c4

sriramch reviewed Aug 23, 2019

View reviewed changes

rongou added 6 commits August 23, 2019 11:53

remove sharding in host device vector

974aeed

fix windows build

f02f084

clean up hist_util

ab700bd

clean up gpu coordinate updater

2ffcf54

clean up gpu predictor

3107dca

more clean up of gpu predictor

d6520e4

RAMitchell approved these changes Aug 24, 2019

View reviewed changes

trivialfis approved these changes Aug 25, 2019

View reviewed changes

RAMitchell merged commit 38ab79f into dmlc:master Aug 25, 2019

rongou mentioned this pull request Aug 26, 2019

further cleanup of single process multi-GPU code #4810

Merged

rongou deleted the single-gpu-hdv branch September 19, 2019 23:09

lock bot locked as resolved and limited conversation to collaborators Dec 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make HostDeviceVector single gpu only #4773

make HostDeviceVector single gpu only #4773

rongou commented Aug 14, 2019 •

edited

Loading

trivialfis commented Aug 21, 2019

rongou commented Aug 21, 2019

trivialfis commented Aug 22, 2019

rongou commented Aug 22, 2019 via email

sriramch Aug 23, 2019

rongou Aug 24, 2019 •

edited

Loading

sriramch Aug 25, 2019

rongou Aug 26, 2019

rongou commented Aug 24, 2019

trivialfis left a comment

make HostDeviceVector single gpu only #4773

make HostDeviceVector single gpu only #4773

Conversation

rongou commented Aug 14, 2019 • edited Loading

trivialfis commented Aug 21, 2019

rongou commented Aug 21, 2019

trivialfis commented Aug 22, 2019

rongou commented Aug 22, 2019 via email

sriramch Aug 23, 2019

Choose a reason for hiding this comment

rongou Aug 24, 2019 • edited Loading

Choose a reason for hiding this comment

sriramch Aug 25, 2019

Choose a reason for hiding this comment

rongou Aug 26, 2019

Choose a reason for hiding this comment

rongou commented Aug 24, 2019

trivialfis left a comment

Choose a reason for hiding this comment

rongou commented Aug 14, 2019 •

edited

Loading

rongou Aug 24, 2019 •

edited

Loading