We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I'm running Windows 10, CUDA 11.6 and Microsoft Visual Studio 2022
After successfully setting up the conda environment and installing the requirements I ran:
python setup.py install
It is failing to compile custom quantization kernel
"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.6\bin\nvcc" -c llmtune/engine/quant/cuda/quant_cuda_kernel.cu -o build\temp.win-amd64-cpython-39\Release\llmtune/engine/quant/cuda/quant_cuda_kernel.obj -IF:\2022anaconda\envs\llmtune2\lib\site-packages\torch\include -IF:\2022anaconda\envs\llmtune2\lib\site-packages\torch\include\torch\csrc\api\include -IF:\2022anaconda\envs\llmtune2\lib\site-packages\torch\include\TH -IF:\2022anaconda\envs\llmtune2\lib\site-packages\torch\include\THC "-IC:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.6\include" -IF:\2022anaconda\envs\llmtune2\include -IF:\2022anaconda\envs\llmtune2\Include "-IC:\Program Files\Microsoft Visual Studio\2022\Community\VC\Tools\MSVC\14.33.31629\include" "-IC:\Program Files\Microsoft Visual Studio\2022\Community\VC\Auxiliary\VS\include" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.22000.0\ucrt" "-IC:\Program Files (x86)\Windows Kits\10\\include\10.0.22000.0\\um" "-IC:\Program Files (x86)\Windows Kits\10\\include\10.0.22000.0\\shared" "-IC:\Program Files (x86)\Windows Kits\10\\include\10.0.22000.0\\winrt" "-IC:\Program Files (x86)\Windows Kits\10\\include\10.0.22000.0\\cppwinrt" -Xcudafe --diag_suppress=dll_interface_conflict_dllexport_assumed -Xcudafe --diag_suppress=dll_interface_conflict_none_assumed -Xcudafe --diag_suppress=field_without_dll_interface -Xcudafe --diag_suppress=base_class_has_different_dll_interface -Xcompiler /EHsc -Xcompiler /wd4190 -Xcompiler /wd4018 -Xcompiler /wd4275 -Xcompiler /wd4267 -Xcompiler /wd4244 -Xcompiler /wd4251 -Xcompiler /wd4819 -Xcompiler /MD -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr -DTORCH_API_INCLUDE_EXTENSION_H -DTORCH_EXTENSION_NAME=quant_cuda -D_GLIBCXX_USE_CXX11_ABI=0 -gencode=arch=compute_86,code=compute_86 -gencode=arch=compute_86,code=sm_86 --use-local-env quant_cuda_kernel.cu F:/2022anaconda/envs/llmtune2/lib/site-packages/torch/include\c10/macros/Macros.h(138): warning C4067: unexpected tokens following preprocessor directive - expected a newline F:\2022anaconda\envs\llmtune2\lib\site-packages\torch\include\pybind11\cast.h(624): error: too few arguments for template template parameter "Tuple" detected during instantiation of class "pybind11::detail::tuple_caster<Tuple, Ts...> [with Tuple=std::pair, Ts=<T1, T2>]" (721): here F:\2022anaconda\envs\llmtune2\lib\site-packages\torch\include\pybind11\cast.h(717): error: too few arguments for template template parameter "Tuple" detected during instantiation of class "pybind11::detail::tuple_caster<Tuple, Ts...> [with Tuple=std::pair, Ts=<T1, T2>]" (721): here
Any thoughts on how to resolve?
The text was updated successfully, but these errors were encountered:
No branches or pull requests
I'm running Windows 10, CUDA 11.6 and Microsoft Visual Studio 2022
After successfully setting up the conda environment and installing the requirements I ran:
python setup.py install
It is failing to compile custom quantization kernel
Any thoughts on how to resolve?
The text was updated successfully, but these errors were encountered: