Overloads with idling threads #76

scarlettekk · 2023-02-01T16:49:45Z

scarlettekk
Feb 1, 2023

Hey! I'm having a strange issue where I have many worker threads (24 numcpus) that simply idle and do mostly nothing, and yet the main process eats an entire core, and still throws many overload messages, dropping ntor onion skins while the worker threads use basically no CPU. Also possibly of mention is that after updating to v5, I let it run for a day and checked compare.sh, yet no relays were in the list. htop screenshot below:

Enkidu-6 · 2023-02-01T22:47:52Z

Enkidu-6
Feb 1, 2023
Maintainer

Hi,

I don't see anything wrong with your htop. The threads are using anywhere around 0.7% to 2.0% CPU which is good and the way it should be. The 106% CPU usage you see is actually a combined value of what Tor uses on one CPU plus what workers use on multiple threads and CPUs. TOR process itself is not using 106% of one CPU.

As for compare.sh please check /var/tmp for a file named file2. See if it has entries and how old the file is. You may have a problem downloading the most recent file. Please post the result of your compare.sh here.

What's your advertised bandwidth? Do you mind telling me your server's nick so I can take a look at your bandwidth history graph?

1 reply

scarlettekk Feb 2, 2023
Author

My BW is 10MiB, burst 15MiB. relay nick is chornobyl. You can see I had it on 5MiB BW in the past, but tried to bump it to 10 more recently (in the 6 month view you can see when I had it unlimited, it would do 20 but dropped many onionskins). /var/tmp/file2 is created when I run compare.sh and has various IP addresses in it. Should the workers not be doing more? I know the main process is single threaded so it only uses about one core, but I would expect that if tor was dropping onionskins that the system would be nailed at 400% for all cores - but the worker threads aren't doing anything, mostly just the main process.

Enkidu-6 · 2023-02-02T16:21:23Z

Enkidu-6
Feb 2, 2023
Maintainer

The threads are doing alright, that's the usual load per worker. Tor looks at the number of workers and their current workload and how long it will take for them to finish the job. If the time for a worker to become available for the new job is longer than a certain threshold then Tor starts dropping NTor. If you see one or two worker threads show 0% from time to time, then you know your NumCPUs is just at that right spot.

24 for your kind of Bandwidth should be enough but I think Tor gets into trouble when you reach your burst and with the DDoS attacks, I'm assuming you reach the burst often. Try to figure out what kind of bandwidth you're comfortable with and keep your burst close to your max Advertised bandwidth and then figure out the sweet spot for the NumCPUs.

Also it seems that you restart too often, When you get an overload status, keep Tor running. The overload message will go away after a couple of heartbeats or 3. As the time goes by, the ban list gets populated and you establish enough connections with other relays, your system runs smoother and smoother. Each time you change the NumCPUs though, you'll have to restart Tor. A -HUP will not do.

As for conntrack.sh, you didn't post the result. Does it show 0 relays or does it show low numbers. Please post the results if you can. You also didn't tell me how old file2 was.

1 reply

scarlettekk Feb 2, 2023
Author

This is on a cloud server, so I'd like to run it as fast as the processor can handle. I'll try tuning the burst a bit, and restarting less. Conntrack can be found here. File2 didn't exist until I ran compare.sh, so it was created at that moment.

Enkidu-6 · 2023-02-02T23:46:08Z

Enkidu-6
Feb 2, 2023
Maintainer

So conntrack.sh is working. However your iptables rules are not. None of those IP addresses with above 2 connections would have been there (except for your IP and snowflake servers) had the rules been applied . And the number of IPs with two connections that are also dual-or relays should have been almost equal (may be 5 more than dual-or relays ).

Are you running the newest version of the rules.? Did you run conntrack.sh right after you ran multi.sh script? You should download the newest version and make sure it runs with no error. Make sure you have populated ipv4.txt and ipv6.txt with correct IP and correct format. After you run it, confirm that the rules have been applied by typing

iptables -S -t mangle

and make sure ipsets are there too by typing:

ipset -L

If you like, you can copy and paste the result of iptables -S -t mangle here so I can take a look at it. But from what I see, the problem may not be the NumCPUs. The problem is that the rules are not doing what they should do.

1 reply

scarlettekk Feb 6, 2023
Author

re-downloaded and re-ran multi.sh, it seems to fill in the iptables correctly.

-P INPUT ACCEPT
-P FORWARD ACCEPT
-P OUTPUT ACCEPT
-P POSTROUTING ACCEPT
-A PREROUTING -d 129.213.133.98/32 -p tcp -m tcp --dport 443 -m set --match-set allow-list src -j ACCEPT
-A PREROUTING -d 129.213.133.98/32 -p tcp -m tcp --dport 443 -m recent --set --name ddos-129.213.133.98-443 --mask 255.255.255.255 --rsource
-A PREROUTING -d 129.213.133.98/32 -p tcp -m tcp --dport 443 -m set --match-set dual-or src -m connlimit --connlimit-upto 2 --connlimit-mask 32 --connlimit-saddr -j ACCEPT
-A PREROUTING -d 129.213.133.98/32 -p tcp -m tcp --dport 443 --tcp-flags FIN,SYN,RST,ACK SYN -m connlimit --connlimit-above 2 --connlimit-mask 32 --connlimit-saddr -j SET --add-set tor-129.213.133.98-443 src
-A PREROUTING -d 129.213.133.98/32 -p tcp -m tcp --dport 443 -m connlimit --connlimit-above 2 --connlimit-mask 32 --connlimit-saddr -j SET --add-set tor-129.213.133.98-443 src
-A PREROUTING -d 129.213.133.98/32 -p tcp -m tcp --dport 443 -m set --match-set tor-129.213.133.98-443 src -j DROP
-A PREROUTING -d 129.213.133.98/32 -p tcp -m tcp --dport 443 -m connlimit --connlimit-above 1 --connlimit-mask 32 --connlimit-saddr -j DROP
-A PREROUTING -d 129.213.133.98/32 -p tcp -m tcp --dport 443 -j ACCEPT

The IP is correct - and this time the number of dual-or relays matches the dual-connection numbers, but there's still a bunch of ips in the dual connection list, is that normal? pastebin here

Enkidu-6 · 2023-02-06T13:39:25Z

Enkidu-6
Feb 6, 2023
Maintainer

No, that's not normal, If you run multi.sh after having Tor run for a while, you'd see something like that but after about 10 minutes all should disappear.

Just noticed. I can't see your public IP in the result of your conntrack.sh, only a local IP. 10.0.0.225. Are you behind a router and on a local LAN? If that's the case the IP addresses in your ipv4.txt and ipv6.txt are wrong. The destination for the incoming packets should be your local IP. Chances are your ban list ipset is empty too.

1 reply

scarlettekk Feb 6, 2023
Author

that is probably it. this system is behind a subnet and that's probably what's causing the issue. I'll update the ips then give it another go, tysm

Enkidu-6 · 2023-02-06T13:50:03Z

Enkidu-6
Feb 6, 2023
Maintainer

Looking at the results again, I see that your public IPV6 address seems to be the destination of the the packets, so I'd keep that in ipv6.txt for now but definitely change the IPV4.

1 reply

scarlettekk Feb 6, 2023
Author

Will do. The system also has a link-scope IPv6, should that be an entry in ipv6.txt?

Enkidu-6 · 2023-02-06T13:53:25Z

Enkidu-6
Feb 6, 2023
Maintainer

That IP seems to have very few connections and they're probably local. The public IPV6 has 411 connections which tells me that IP is the main destination, so keep that. Just change the IPV4 for now and we'll monitor your system to see if any other changes will be necessary.

0 replies

Enkidu-6 · 2023-02-06T14:03:10Z

Enkidu-6
Feb 6, 2023
Maintainer

Well, again to clarify, the IPV6 with 411 connections in that image goes in ipv6.txt and 10.0.0.255 goes in ipv4.txt and if you have done port forwarding and your local ORPort is different from your public ORPort, the local one goes there too.

You don't have to restart either. If you run multi.sh and if we're right, those connections start to disappear one by one. If you see that happen, then it's working.

0 replies

scarlettekk · 2023-02-06T14:17:10Z

scarlettekk
Feb 6, 2023
Author

It works perfectly! Only items in list are loopbacks and one other that is another machine I'm running - thanks so much for your help.

…

On Mon, Feb 6, 2023, 09:03 Enkidu ***@***.***> wrote: Well, again to clarify, the IPV6 with 411 connections in that image goes in ipv6.txt and 10.0.0.255 goes in ipv4.txt and if you have done port forwarding and your local ORPort is different from your public ORPort, the local one goes there too. You don't have to restart either. If you run multi.sh and if we're right, those connections start to disappear one by one. If you see that happen, then it's working. — Reply to this email directly, view it on GitHub <#76 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AKEN7JCUW224FTSJVAPGH23WWEACRANCNFSM6AAAAAAUN6ZBGA> . You are receiving this because you authored the thread.Message ID: ***@***.***>

0 replies

Enkidu-6 · 2023-02-06T14:23:52Z

Enkidu-6
Feb 6, 2023
Maintainer

My pleasure. Glad it worked out. Eventually once you run conntrack.sh, In the top section where it shows IPs with more than 2 connections, you should see your own IPs and two or 3 IPs from snowflake.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overloads with idling threads #76

{{title}}

Replies: 9 comments 5 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Overloads with idling threads #76

scarlettekk Feb 1, 2023

Replies: 9 comments · 5 replies

Enkidu-6 Feb 1, 2023 Maintainer

scarlettekk Feb 2, 2023 Author

Enkidu-6 Feb 2, 2023 Maintainer

scarlettekk Feb 2, 2023 Author

Enkidu-6 Feb 2, 2023 Maintainer

scarlettekk Feb 6, 2023 Author

Enkidu-6 Feb 6, 2023 Maintainer

scarlettekk Feb 6, 2023 Author

Enkidu-6 Feb 6, 2023 Maintainer

scarlettekk Feb 6, 2023 Author

Enkidu-6 Feb 6, 2023 Maintainer

Enkidu-6 Feb 6, 2023 Maintainer

scarlettekk Feb 6, 2023 Author

Enkidu-6 Feb 6, 2023 Maintainer

scarlettekk
Feb 1, 2023

Replies: 9 comments 5 replies

Enkidu-6
Feb 1, 2023
Maintainer

scarlettekk Feb 2, 2023
Author

Enkidu-6
Feb 2, 2023
Maintainer

scarlettekk Feb 2, 2023
Author

Enkidu-6
Feb 2, 2023
Maintainer

scarlettekk Feb 6, 2023
Author

Enkidu-6
Feb 6, 2023
Maintainer

scarlettekk Feb 6, 2023
Author

Enkidu-6
Feb 6, 2023
Maintainer

scarlettekk Feb 6, 2023
Author

Enkidu-6
Feb 6, 2023
Maintainer

Enkidu-6
Feb 6, 2023
Maintainer

scarlettekk
Feb 6, 2023
Author

Enkidu-6
Feb 6, 2023
Maintainer