faster transformer compile error with docker #51

ganguagua · 2019-07-18T12:16:47Z

image: nvidia/cuda 10.0-cudnn7-devel-ubuntu16.04 docker image
cmake -DSM=70 -DCMAKE_BUILD_TYPE=Release -DBUILD_TF=ON -DTF_PATH=/usr/lib/python2.7/site-packages/tensorflow .. output:
-- The CXX compiler identification is GNU 5.4.0
-- The CUDA compiler identification is NVIDIA 10.0.130
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Check for working CUDA compiler: /usr/local/cuda/bin/nvcc
-- Check for working CUDA compiler: /usr/local/cuda/bin/nvcc -- works
-- Detecting CUDA compiler ABI info
-- Detecting CUDA compiler ABI info - done
-- Looking for C++ include pthread.h
-- Looking for C++ include pthread.h - found
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Failed
-- Looking for pthread_create in pthreads
-- Looking for pthread_create in pthreads - not found
-- Looking for pthread_create in pthread
-- Looking for pthread_create in pthread - found
-- Found Threads: TRUE
-- Found CUDA: /usr/local/cuda (found suitable version "10.0", minimum required is "10.0")
-- Found CUDA: /usr/local/cuda (found version "10.0")
-- Assign GPU architecture (sm=70)
-- Configuring done
-- Generating done
-- Build files have been written to: /root/DeepLearningExamples/FasterTransformer/build

make output:
CMakeFiles/gemm_fp32.dir/gemm_fp32.cu.o: In function __sti____cudaRegisterAll()': tmpxft_0000054d_00000000-5_gemm_fp32.cudafe1.cpp:(.text.startup+0x15): undefined reference to __cudaRegisterLinkedBinary_44_tmpxft_0000054d_00000000_6_gemm_fp32_cpp1_ii_5cd8620e'
collect2: error: ld returned 1 exit status
tools/gemm_test/CMakeFiles/gemm_fp32.dir/build.make:83: recipe for target 'bin/gemm_fp32' failed
make[2]: *** [bin/gemm_fp32] Error 1
CMakeFiles/Makefile2:148: recipe for target 'tools/gemm_test/CMakeFiles/gemm_fp32.dir/all' failed
make[1]: *** [tools/gemm_test/CMakeFiles/gemm_fp32.dir/all] Error 2
Makefile:83: recipe for target 'all' failed
make: *** [all] Error 2

The text was updated successfully, but these errors were encountered:

IrishCoffee · 2019-07-19T09:12:55Z

It seems that there is something wrong with your environment. Could you please check your driver first?

ganguagua · 2019-07-19T09:41:32Z

It seems that there is something wrong with your environment. Could you please check your driver first?

my environment:
cmake version 3.15.0
gcc (GCC) 4.8.5 20150623 (Red Hat 4.8.5-36)
cuda 10.0
cudnn 7
Tensorflow-gpu 1.13.1

jackkosaian · 2019-07-19T22:56:14Z

I also receive this error on V100. My driver version is 418.56.

jackkosaian · 2019-07-19T23:16:14Z

I was able to resolve this by adding the following lines to the end of various CMakeLists:

tools/gemm_test/CMakeLists.txt

set_target_properties(gemm_fp32 PROPERTIES CUDA_RESOLVE_DEVICE_SYMBOLS ON)
set_target_properties(gemm_fp16 PROPERTIES CUDA_RESOLVE_DEVICE_SYMBOLS ON)

fastertransformer/cuda/CMakeLists.txt

set_target_properties(fastertransformer PROPERTIES CUDA_RESOLVE_DEVICE_SYMBOLS ON)

fastertransformer/tf_op/CMakeLists.txt

set_target_properties(tf_fastertransformer PROPERTIES CUDA_RESOLVE_DEVICE_SYMBOLS ON)

I found this solution here.

I'm not sure whether this is the "correct" solution to the problem, but I'm able to compile and run ./build/bin/gemm_fp16 with these changes.

ganguagua · 2019-07-20T08:52:00Z

I was able to resolve this by adding the following lines to the end of various CMakeLists:
tools/gemm_test/CMakeLists.txt
set_target_properties(gemm_fp32 PROPERTIES CUDA_RESOLVE_DEVICE_SYMBOLS ON)
set_target_properties(gemm_fp16 PROPERTIES CUDA_RESOLVE_DEVICE_SYMBOLS ON)

fastertransformer/cuda/CMakeLists.txt
set_target_properties(fastertransformer PROPERTIES CUDA_RESOLVE_DEVICE_SYMBOLS ON)

fastertransformer/tf_op/CMakeLists.txt
set_target_properties(tf_fastertransformer PROPERTIES CUDA_RESOLVE_DEVICE_SYMBOLS ON)

I found this solution here.
I'm not sure whether this is the "correct" solution to the problem, but I'm able to compile and run ./build/bin/gemm_fp16 with these changes.

It works, thanks very much!

nvpstr closed this as completed Jul 30, 2019

byshiue transferred this issue from NVIDIA/DeepLearningExamples Apr 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

faster transformer compile error with docker #51

faster transformer compile error with docker #51

ganguagua commented Jul 18, 2019

IrishCoffee commented Jul 19, 2019

ganguagua commented Jul 19, 2019

jackkosaian commented Jul 19, 2019

jackkosaian commented Jul 19, 2019

ganguagua commented Jul 20, 2019

faster transformer compile error with docker #51

faster transformer compile error with docker #51

Comments

ganguagua commented Jul 18, 2019

IrishCoffee commented Jul 19, 2019

ganguagua commented Jul 19, 2019

jackkosaian commented Jul 19, 2019

jackkosaian commented Jul 19, 2019

ganguagua commented Jul 20, 2019