Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

General collector fail alerts too frequent #147

Closed
honghan-wong opened this issue Jan 8, 2024 · 0 comments · Fixed by #148
Closed

General collector fail alerts too frequent #147

honghan-wong opened this issue Jan 8, 2024 · 0 comments · Fixed by #148
Milestone

Comments

@honghan-wong
Copy link
Contributor

When ipmiselcollector has timeout, the alerts will trigger.
However the ipmisel sometime has a small timeout.

I would like to propose to have for: 1m for ipmi sel alerts to trigger instead of for: 0m to prevent ipmi-sel timeout hitting the alerts.

The error is as below:

Jan 08 00:09:39 MACHINE01 python3[12345]: 2024-01-08 00:09:39 ERROR Command 'ipmi-sel --output-event-state --interpret-oem-data --entity-sensor-names' timed out after 30 seconds
honghan-wong added a commit to honghan-wong/hardware-observer-operator that referenced this issue Jan 8, 2024
@honghan-wong honghan-wong mentioned this issue Jan 8, 2024
Pjack pushed a commit that referenced this issue Jan 15, 2024
* Fix: #147

* update to 1m eval time

* fix unit test eval time

---------

Co-authored-by: honghan <[email protected]>
@Pjack Pjack added this to the 23.10.3 milestone Mar 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants