-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[bug]: WTCL: received error code CodePermanentFailure in StateUpdateReply for seqnum=xxx #7949
Comments
Hey, |
Thanks @MButcho - and do you have access to the watchtower's logs too? |
No I don't, but may be able to contact the person providing watchtower. Let me ping him over Telegram. |
Well, the only lines on my side (watchtower server) with that session string that you mentioned is actually written every 5 minutes and that's it: 2023-09-06 18:29:14.305 [INF] WTWR: Accepted incoming peer 02a0a6a24489427882a03c077d556d5981c59478715a4fb4dce6f0f75336d67d59@x.x.x.x:49682 The cause I can't really imagine. probably an error has invalidated some data. On my side info are very poor pherhaps a debug log should help more, |
lncli version 0.16.2-beta commit=v0.16.2-beta And this is the log with command 2023-09-06 19:39:16.654 [INF] WTWR: Accepted incoming peer 02a0a6a24489427882a03c077d556d5981c59478715a4fb4dce6f0f75336d67d59@x.x.x.x:38540 |
After server restart today, another seqnum started to apear: |
Thanks for the extra logs @MButcho 🙏 The first thing I think we should do is prio getting this PR in as this will allow the client to just mark the problem session(s) as borked so that it can continue its operations. From a quick look at the code, the server will return I think the first scenario is very unlikely. So if it is the second scenario, then there is likely some race happening somewhere similar to #6397. I will dig into this a bit. But I think in the meantime, we should get #7545 in so that things can continue when investigations like this need to happen. |
@ellemouton Should I stop lnd, remove wtclient.db and start lnd? |
Hi, I read around, that someone does this. Afterwards you have to reconnect the watchtower |
Yes @MButcho - I think that is a good idea for now (and yes, like @fbrz76 says - you will need to reconnect to the tower if you do that). It will allow you to get unstuck and continue to backup updates. And your previous backups will remain on the tower 👍 Then when we have this fix in then there will be a better way to move past something like this without having to delete your db. |
Stopped lnd, removed wtclient.db, started lnd and added new tower: lncli wtclient towers
lncli wtclient stats
2023-09-14 22:26:48.687 [INF] WTCL: (legacy) Client stats: tasks(received=0 accepted=0 ineligible=0) sessions(acquired=0 exhausted=0) Lets see if it starts backuping again |
Not even trying to connect tower now, no connection from the tower as well. Very strange, tower is present but I get following:
Later
I have restored old wtclient.db and backing up again. I keep it with the error for now. Better error in log than no backup ;)
lncli wtclient stats
|
You should defs connect to a tower though.
This just means that that the tower you've connected to is not responding. So you perhaps need to give it another address or something? |
You maybe missed this part, I have added that to a comment later:
|
Update, there was DNS issue with the tower, I connected with new wtclient.db and all seems fine |
Hi all, Once #8239 makes it into a release (likely 0.18.0), then if you experience something like this issue, you should be able to manually terminate the problem session using the new Please comment here if the issue persists (once 0.18.0 has been released and the new command has been tried) |
Background
WTCL is working fine but I receive this error every 5 mins on the same seqnum:
[ERR] WTCL: (anchor) SessionQueue(xxx) unable to send state update: received error code CodePermanentFailure in StateUpdateReply for seqnum=570
Your environment
Steps to reproduce
It happens continually
Expected behaviour
The error should not be in log
Actual behaviour
Error in log
The text was updated successfully, but these errors were encountered: