-
Notifications
You must be signed in to change notification settings - Fork 102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bind_boot_time_seconds appears to be shifting strangely #81
Comments
The exporter is only taking what bind reports in the Without examining the raw metric data, it's hard to say what's going on. |
That's what I'm saying: somehow it's mangling the raw metric. I am positive of this. I checked the raw. The raw in the XML is correct and more importantly, does not change. Yet the export is changing the value from the XML in strange ways. And this is reflected in the raw data from the exporter. |
The only thing we could do here is to build a version with logging of the raw XML data to see what's returned. Without some concrete proof that the exporter is doing something, there's nothing we can do. |
I definitely agree; this is going to need some XML dumping. However, I don't see any way to do that and frankly I have zero experience working in Go (so frankly, I suspect my attempt would mangle output.) Also probably safe to go ahead and drop the |
That reminds me, we should add support for the new json format. |
This is intriguing. Whilst working on a very trivial patch to eliminate |
The issue appears to be the use of
max(node_time_seconds{instance=~"$node:.*"})
which appears to not work in Prometheus 2.19, and a possible change in how Bind 9.16 reports uptime. The result is that changing the queries totime() - max(bind_boot_time_seconds{instance=~"$node:.*"})
produces sensible seeming results, but these are actually still off by an order of magnitude.i.e. a Bind 9.16 reported boot time of 2020-07-14T21:10:48.999Z, with a current time of 2020-07-14T22:11:56.299Z will report incorrectly with the above, claiming 5.8 hours. I thought I had found the order of magnitude error, but then I noticed that something is still wrong because it wasn't updating correctly. That's when I noticed that
bind_boot_time_seconds
was moving. It went from1594766106
to1594740334
, which is definitely not correct.The actual Bind statistics do not reflect a change in the corresponding XML or JSON.
The text was updated successfully, but these errors were encountered: