-
Notifications
You must be signed in to change notification settings - Fork 174
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fails to build against CUDA9 #492
Comments
I had the same problem when I built against CUDA10.1, I was able to compile using your patch, but would it cause other runtime problems? That was what I really worried about. ptxas warning : Value of threads per SM for entry Z24THNN_CudaHalfLSTMForwardI6__halfmLin1EEv10TensorInfoIT_T0_ES4_S4_S4_S4_S4_S4_S3_S3 is out of range. .minnctapersm will be ignored /tmp/luarocks_cunn-scm-1-5240/cunn/lib/THCUNN/LookupTable.cu(49): warning: function "__any" [ 28%] Building NVCC (Device) object lib/THCUNN/CMakeFiles/THCUNN.dir/THCUNN_generated_MSECriterion.cu.o lib/THCUNN/CMakeFiles/THCUNN.dir/build.make:175: recipe for target 'lib/THCUNN/CMakeFiles/THCUNN.dir/THCUNN_generated_LookupTable.cu.o' failed |
See torch/issues/492 for reference
It looks like CUDA9 deprecates
__shfl
and__any
. I was able to compile using the following quick&dirty patch:The text was updated successfully, but these errors were encountered: