Skip to content

Commit

Permalink
Remove uneeded PTX code generation from libfaiss builds.
Browse files Browse the repository at this point in the history
The CMake CUDA Architecture value of `60` means to generate
both PTX and SASS for that arch. We only need SASS for the
architectures we support, and one PTX version for future
hardware versions.

So now we build on SASS for everything ( `60-real` ) and
use 80 as the baseline for newer archs likes 90

By removing this unneeded PTX code we can reduce
the libfaiss.a binary to 305MB from the current 484MB.
  • Loading branch information
robertmaynard committed Oct 3, 2023
1 parent 834c543 commit 238d610
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -350,7 +350,7 @@ workflows:
exec: linux-x86_64-gpu
label: main
cuda: "11.4"
cuda_archs: "60;61;70;72;75;80;86"
cuda_archs: "60-real;61-real;62-real;70-real;72-real;75-real;80;86-real"
compiler_version: "11.2"
filters:
tags:
Expand All @@ -363,7 +363,7 @@ workflows:
label: main
raft: "ON"
cuda: "11.4"
cuda_archs: "60;61;70;72;75;80;86"
cuda_archs: "60-real;61-real;62-real;70-real;72-real;75-real;80;86-real"
compiler_version: "11.2"
filters:
tags:
Expand Down Expand Up @@ -415,15 +415,15 @@ workflows:
name: Linux x86_64 GPU nightlies (CUDA 11.4)
exec: linux-x86_64-gpu
cuda: "11.4"
cuda_archs: "60;61;70;72;75;80;86"
cuda_archs: "60-real;61-real;62-real;70-real;72-real;75-real;80;86-real"
compiler_version: "11.2"
label: nightly
- build_conda:
name: Linux x86_64 GPU w/ RAFT nightlies (CUDA 11.4)
exec: linux-x86_64-gpu
raft: "ON"
cuda: "11.4"
cuda_archs: "60;61;70;72;75;80;86"
cuda_archs: "60-real;61-real;62-real;70-real;72-real;75-real;80;86-real"
compiler_version: "11.2"
label: nightly
- build_conda:
Expand Down

0 comments on commit 238d610

Please sign in to comment.