Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make alert ipmi_dcmi_power_cosumption_watts more standard. #97

Closed
jneo8 opened this issue Nov 14, 2023 · 1 comment · Fixed by canonical/prometheus-hardware-exporter#45
Labels
bug Something isn't working
Milestone

Comments

@jneo8
Copy link
Contributor

jneo8 commented Nov 14, 2023

No description provided.

@peppepetra
Copy link

peppepetra commented Nov 14, 2023

The current threshold for the alert rule is 200W.

This is generating false negatives. In one environment all machines are alerting and currently although the cloud is not experiencing any issue, the power consumption is the following:

ipmi_dcmi_power_cosumption_watts{instance="nova-compute/0", job="hardware-observer_0_default", juju_application="nova-compute", juju_model="openstack", juju_unit="nova-compute/0"} | 285
ipmi_dcmi_power_cosumption_watts{instance="nova-compute/1", job="hardware-observer_0_default", juju_application="nova-compute", juju_model="openstack", juju_unit="nova-compute/1"} | 229
ipmi_dcmi_power_cosumption_watts{instance="nova-compute/2", job="hardware-observer_0_default", juju_application="nova-compute", juju_model="openstack", juju_unit="nova-compute/2"} | 229
ipmi_dcmi_power_cosumption_watts{instance="nova-compute/3", job="hardware-observer_0_default", juju_application="nova-compute", juju_model="openstack", juju_unit="nova-compute/3"} | 238
ipmi_dcmi_power_cosumption_watts{instance="nova-compute/4", job="hardware-observer_0_default", juju_application="nova-compute", juju_model="openstack", juju_unit="nova-compute/4"} | 282
ipmi_dcmi_power_cosumption_watts{instance="nova-compute/5", job="hardware-observer_0_default", juju_application="nova-compute", juju_model="openstack", juju_unit="nova-compute/5"} | 230
ipmi_dcmi_power_cosumption_watts{instance="nova-compute/6", job="hardware-observer_0_default", juju_application="nova-compute", juju_model="openstack", juju_unit="nova-compute/6"} | 218
ipmi_dcmi_power_cosumption_watts{instance="nova-compute/7", job="hardware-observer_0_default", juju_application="nova-compute", juju_model="openstack", juju_unit="nova-compute/7"} | 246
ipmi_dcmi_power_cosumption_watts{instance="nova-compute/8", job="hardware-observer_0_default", juju_application="nova-compute", juju_model="openstack", juju_unit="nova-compute/8"} | 254

max over time is 346W since we started using hw-observer

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
3 participants