-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[dashboard] dashboard error because of gpustat version 1.1 #34196
Comments
let me try repro when I have bandwidth |
Same here, this error breaks our whole simulation engine. Setting |
Can you verify what version of nvml you are using (either |
Actuall, from reading the linked issue, |
gpustat author here. As noted in wookayin/gpustat#153 this is highly likely to be due to conflicting dependencies on pynvml (nvidia-ml-py3 is a problem, should've been never installed). It'd be appreciated if you can provide the output of the following
to confirm this is a broken pynvml issue. Or please share the file You can also check |
|
The pynvml.py file is only 113 Bytes which is weird. Can you paste the content of the file? |
|
According to @wookayin analysis(wookayin/gpustat#153), the wrong pynvml should be used in my environment. If most people don't have this issue, I think this issue can be closed. I will continue to see which package introduced this wrong pynvml. thanks. |
Just to make it explicit here: there are two packages that provide
Thanks for the report. |
What happened + What you expected to happen
2023-04-09 08:36:28,479 ERROR services.py:1195 -- Failed to start the dashboard: Failed to start the dashboard, return code 1
The last 10 lines of /tmp/ray/session_2023-04-09_08-36-26_413307_64/logs/dashboard.log:
File "/usr/local/lib/python3.8/dist-packages/ray/dashboard/modules/reporter/reporter_agent.py", line 52, in
import gpustat.core as gpustat
File "/usr/local/lib/python3.8/dist-packages/gpustat/init.py", line 16, in
from .core import GPUStat, GPUStatCollection
File "/usr/local/lib/python3.8/dist-packages/gpustat/core.py", line 24, in
from gpustat.nvml import pynvml as N
File "/usr/local/lib/python3.8/dist-packages/gpustat/nvml.py", line 57, in
_original_nvmlGetFunctionPointer = pynvml._nvmlGetFunctionPointer
AttributeError: module 'pynvml' has no attribute '_nvmlGetFunctionPointer'
There is no error when using the gpustat 1.0.0
Versions / Dependencies
Ray 2.2.0
Reproduction script
ray start --node-ip-address=${head_address} --head --dashboard-host='0.0.0.0' --dashboard-port=8265
Issue Severity
None
The text was updated successfully, but these errors were encountered: