Fails: compile_nvidia: note: building ggml-cuda with nvcc -arch=native #562

skome · 2024-09-09T08:56:32Z

skome
Sep 9, 2024

linux, nvidia, CUDA toolkit installed (and works)
llamafile -t 12 -ngl 35 (etc) yields:
compile_nvidia: note: building ggml-cuda with nvcc -arch=native...
llamafile_log_command: /usr/bin/nvcc -arch=native --shared --forward-unknown-to-host-compiler -use_fast_math --compiler-options "-fPIC -O3 -march=native -mtune=native" -DNDEBUG -DGGML_BUILD=1 -DGGML_SHARED=1 -DGGML_CUDA_MMV_Y=1 -DGGML_MULTIPLATFORM -DGGML_CUDA_DMMV_X=32 -DK_QUANTS_PER_ITERATION=2 -DGGML_CUDA_PEER_MAX_BATCH_SIZE=128 -DGGML_USE_CUBLAS -o /home/sam/.llamafile/ggml-cuda.so.8c95tc /home/sam/.llamafile/ggml-cuda.cu -lcublas -lcuda
nvcc fatal : Value 'native' is not defined for option 'gpu-architecture'
Compile: warning: /usr/bin/nvcc returned nonzero exit status
get_nvcc_arch_flag: note: building nvidia compute capability detector
^^^^^^^^ that last line sounds promising, meanwhile I would like to know how to change "native" to "compute_86"
Do I need to compile from source?

ETA: recompiled, same errors. Re-built NVIDIA/CUDA drivers (from their website) and everything works.

invisiblepancake · 2024-09-24T06:17:50Z

invisiblepancake
Sep 24, 2024

Im on unbutu debian. Make some remarkable by my know what =)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fails: compile_nvidia: note: building ggml-cuda with nvcc -arch=native #562

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Fails: compile_nvidia: note: building ggml-cuda with nvcc -arch=native #562

skome Sep 9, 2024

Replies: 1 comment

invisiblepancake Sep 24, 2024

skome
Sep 9, 2024

invisiblepancake
Sep 24, 2024