Skip to content
This repository has been archived by the owner on Mar 21, 2024. It is now read-only.

Remove policies for arch < sm35. #213

Merged
merged 1 commit into from
Oct 30, 2020

Conversation

alliepiper
Copy link
Collaborator

@alliepiper alliepiper commented Oct 14, 2020

Goal is to clean up code and reduce build time:

Before:

Target Build Time (Wall) Build Time (CPU seconds)
thrust.cpp.cuda.cpp17.all 18:36.33 12414
cub.cpp17.all 22:58.27 4872

After:

Target Build Time (Wall) Build Time (CPU seconds)
thrust.cpp.cuda.cpp17.all 18:18.72 12174
cub.cpp17.all 13:53.14 3943

Thrust isn't significantly affected, as it spends most of its time building non-CUB algorithms (the set operation and minmax tests in particular). We can probably bring that down by removing old SM policies in a follow up patch, though Thrust currently only appears to go down to sm30 so there are fewer policies to take out.

CUB's compile time is reduced by nearly 40%. There's still a lot of room for improvement by looking at the CPU vs walltime (build used 12 cores). This is because there are one or two tests that take a very long time to compile. We can improve this by splitting up those tests so they can be parallelized.

@alliepiper
Copy link
Collaborator Author

DVS CL 29199322

@alliepiper alliepiper added the testing: internal ci in progress Currently testing on internal NVIDIA CI (DVS). label Oct 14, 2020
@alliepiper alliepiper assigned alliepiper and unassigned brycelelbach and griwes Oct 14, 2020
@alliepiper alliepiper removed the testing: internal ci in progress Currently testing on internal NVIDIA CI (DVS). label Oct 19, 2020
@alliepiper alliepiper force-pushed the enh/remove_old_sm_archs branch from 8b3b747 to fc38233 Compare October 29, 2020 17:34
@alliepiper
Copy link
Collaborator Author

Starting DVS on CL 29260540.

🤞

@alliepiper alliepiper added the testing: internal ci in progress Currently testing on internal NVIDIA CI (DVS). label Oct 29, 2020
@alliepiper
Copy link
Collaborator Author

No timeouts 🎉

@alliepiper alliepiper merged commit 200cf19 into NVIDIA:main Oct 30, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
testing: gpuCI passed Passed gpuCI testing. testing: internal ci in progress Currently testing on internal NVIDIA CI (DVS).
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants