-
Notifications
You must be signed in to change notification settings - Fork 891
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
faster transformer compile error with docker #51
Comments
It seems that there is something wrong with your environment. Could you please check your driver first? |
my environment: |
I also receive this error on V100. My driver version is 418.56. |
I was able to resolve this by adding the following lines to the end of various CMakeLists: tools/gemm_test/CMakeLists.txt
fastertransformer/cuda/CMakeLists.txt
fastertransformer/tf_op/CMakeLists.txt
I found this solution here. I'm not sure whether this is the "correct" solution to the problem, but I'm able to compile and run |
It works, thanks very much! |
image: nvidia/cuda 10.0-cudnn7-devel-ubuntu16.04 docker image
cmake -DSM=70 -DCMAKE_BUILD_TYPE=Release -DBUILD_TF=ON -DTF_PATH=/usr/lib/python2.7/site-packages/tensorflow .. output:
-- The CXX compiler identification is GNU 5.4.0
-- The CUDA compiler identification is NVIDIA 10.0.130
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Check for working CUDA compiler: /usr/local/cuda/bin/nvcc
-- Check for working CUDA compiler: /usr/local/cuda/bin/nvcc -- works
-- Detecting CUDA compiler ABI info
-- Detecting CUDA compiler ABI info - done
-- Looking for C++ include pthread.h
-- Looking for C++ include pthread.h - found
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Failed
-- Looking for pthread_create in pthreads
-- Looking for pthread_create in pthreads - not found
-- Looking for pthread_create in pthread
-- Looking for pthread_create in pthread - found
-- Found Threads: TRUE
-- Found CUDA: /usr/local/cuda (found suitable version "10.0", minimum required is "10.0")
-- Found CUDA: /usr/local/cuda (found version "10.0")
-- Assign GPU architecture (sm=70)
-- Configuring done
-- Generating done
-- Build files have been written to: /root/DeepLearningExamples/FasterTransformer/build
make output:
CMakeFiles/gemm_fp32.dir/gemm_fp32.cu.o: In function
__sti____cudaRegisterAll()': tmpxft_0000054d_00000000-5_gemm_fp32.cudafe1.cpp:(.text.startup+0x15): undefined reference to
__cudaRegisterLinkedBinary_44_tmpxft_0000054d_00000000_6_gemm_fp32_cpp1_ii_5cd8620e'collect2: error: ld returned 1 exit status
tools/gemm_test/CMakeFiles/gemm_fp32.dir/build.make:83: recipe for target 'bin/gemm_fp32' failed
make[2]: *** [bin/gemm_fp32] Error 1
CMakeFiles/Makefile2:148: recipe for target 'tools/gemm_test/CMakeFiles/gemm_fp32.dir/all' failed
make[1]: *** [tools/gemm_test/CMakeFiles/gemm_fp32.dir/all] Error 2
Makefile:83: recipe for target 'all' failed
make: *** [all] Error 2
The text was updated successfully, but these errors were encountered: