-
Notifications
You must be signed in to change notification settings - Fork 304
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GPU Operator with RHEL8/SElinux: Driver Container failed to deploy if SELinux Enforcing mode is activated. Error message: modprobe: ERROR: could not insert 'nvidia': Permission denied #553
Comments
We also facing same issue, with SELinux Enforcing gpu-operator driver-daemon pod failed, it fail to install nvidia module, getting "permission deined" error,
Kubernete cluster created through kubeadm ( 1.27.0), following the setup details:
Step to reproduce the issue:
Used helm to install gpu-operator
custom-values passed to gpu-operator
GPU-operator driver pod log after successful installation(with SELinux disabled)
GPU operator driver pod failed with error (SELinux enforcing)
GPU host audit logs (grep "avc" /var/log/audit/audit*) (SELinux enforcing)
GPU host : cat /etc/containerd/config.toml
|
After changing the label of
|
@shivamerla - We tried the solution on our RHEL8.8 setup by referring to your private test image. You have added We did some patching to understand the issue in our setup. We could see the context type of compiled modules still showing
I think these are copied from source Then, we tried with
so can we proide the fix with |
@anoopsinghnegi i have updated the image, please re-pull and verify with |
@shivamerla, It's working, driver loaded successfully with SELinux enforcing using image |
@shivamerla - any update on this issue - even the latest version of gpu-operator v23.9.1 is failing with SELinux enforcing. |
Steps to reproduce the issue:
1/ Set the RHEL 8 server in SELinux ENFORCING mode:
2/ Install GPU Operator
Result:
Driver container logs:
The text was updated successfully, but these errors were encountered: