-
Notifications
You must be signed in to change notification settings - Fork 166
Issues: NVIDIA/dcgm-exporter
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
After the dcp indicator is enabled, dcgm-exporter reports an error
question
Further information is requested
#431
opened Dec 10, 2024 by
15234660879
Exporter does not provide any of the DCGM_FI_DEV_*_UTIL metrics
question
Further information is requested
#430
opened Dec 4, 2024 by
kt-pham
Memory usage increased 2.25x after upgrading from 3.3.6-3.4.2 to 3.3.9-3.6.1
#425
opened Nov 22, 2024 by
age9990
Support collecting pod labels
enhancement
New feature or request
#423
opened Nov 20, 2024 by
mtparet
dcgm-exporter counter value goes down
bug
Something isn't working
#417
opened Nov 14, 2024 by
luccabb
Not collecting GPU metrics; Error getting devices count: Cannot perform the requested operation because NVML doesn't exist on this system
question
Further information is requested
#416
opened Nov 13, 2024 by
saichanumolu9
Checksum mismatch for github.com/emicklei/go-restful/[email protected]
bug
Something isn't working
#415
opened Nov 7, 2024 by
WilliamVenner
Segfaults with dcgm-exporter 3.3.0 and higher
bug
Something isn't working
#412
opened Oct 30, 2024 by
andrewjamesbrown
Segmentation fault when running with the default configuration for the GPU Operator on kind
bug
Something isn't working
#409
opened Oct 29, 2024 by
klueska
failed to transform metrics for transform 'podMapper'; err: failure getting pod resources;
bug
Something isn't working
#408
opened Oct 29, 2024 by
jicki
I want to see how many GPU cores have been allocated to each container through metrics.
enhancement
New feature or request
#399
opened Oct 12, 2024 by
changhyuni
can not collect gpu utilization metric when mig enable for some pods
bug
Something isn't working
#397
opened Oct 8, 2024 by
melikeiremguler
dcgm-exporter daemonset Startup error Failed to pass the health check
question
Further information is requested
#393
opened Sep 26, 2024 by
guoliangmiao
In the case of gpu pass-through, does dcgm-exporter on the physical host support capturing gpu metrics of kvm virtual machines?
question
Further information is requested
#392
opened Sep 21, 2024 by
lddlww
DCGM Exporter in EKS p4d.24xlarge instance type controller error
bug
Something isn't working
#387
opened Sep 5, 2024 by
camilopaezrios
DCGM Exporter in EKS p4d.24xlarge instance type controller error
#386
opened Sep 5, 2024 by
camilopaezrios
DCGM-exporter pods stuck in Running State, Not getting Ready without GPU allocation.
question
Further information is requested
#385
opened Sep 3, 2024 by
rohitreddy1698
Add a health status metric for every gpu card
question
Further information is requested
#384
opened Aug 30, 2024 by
lx1036
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.