-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Etcd snapshot controller spews logs while waiting for nodes to annotate #8892
Comments
A good example of this is available in the logs from rancher/rke2#5048 It continues on like this for a while until things settle down.
|
Hi @brandond, is this issue and the corresponding backports ready to be moved to Test? I see the PR linked has been merged |
Yes, sorry - I apparently missed assigning it to myself so I didn't see it in the project board. |
I'm wondering if whatever rancher manager is doing in their CI is causing this because I cannot reproduce it whether through rancher or standalone, even when having the etcd-snapshots tuned to take one every minute. I have tried all versions available. I will confirm no regressions to this logging and then close this issue. Using |
@rancher-max you'll see this spew when using embedded etcd, but no snapshots exist - either locally, or in the configured S3 bucket. |
Validated on v1.28.4-rc1+k3s1Environment DetailsInfrastructure
Node(s) CPU architecture, OS, and Version:
Cluster Configuration:
Config.yaml:
Testing Steps
Replication Results:
Validation Results:
Based on the above, I'm going to close this out. I think the log spew must appear under different circumstances, and I'm not noticing any base functionality changes here so should be okay either way. |
Early in cluster startup, before any etcd nodes have reconciled their snapshots and set the annotations, the full sync loop will continually return errNotReconciled and be immediately reenqueued:
k3s/pkg/etcd/snapshot_controller.go
Lines 201 to 203 in e53c189
k3s/pkg/etcd/snapshot_controller.go
Lines 62 to 67 in e53c189
Unfortunately this creates a LOT of log spew, especially on RKE2 where startup takes a little longer to settle out.
We should move the log line further down the function, or reduce its verbosity.
The text was updated successfully, but these errors were encountered: