Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

High CPU load on x64 dockerhost system #3822

Open
sxa opened this issue Nov 25, 2024 · 4 comments
Open

High CPU load on x64 dockerhost system #3822

sxa opened this issue Nov 25, 2024 · 4 comments

Comments

@sxa
Copy link
Member

sxa commented Nov 25, 2024

Machine is frequently running with a load of around 60 while the test runs are in progress. The machine has 16 cores. Noting that we have over ten test containers on this system so we may need to revise the CPU caps (and make sure they are working as expected).

We could also look at reducing the priority of the Solaris vagrant machines since that is a low priority platform.

Noting also that the riscv64 builds are running with a particularly high CPU % in the containers:

CONTAINER ID   NAME                CPU %      MEM USAGE / LIMIT     MEM %     NET I/O           BLOCK I/O         PIDS
36134aa4628d   distracted_galois   1176.88%   7.998GiB / 62.79GiB   12.74%    218MB / 515kB     6.38GB / 6.04GB   359
@sxa
Copy link
Member Author

sxa commented Dec 24, 2024

@Haroon-Khel If you get some gaps between JSF runs can you see if there's something wrong with the configuration of the containers on this host. We may need to drop the number we're running too :'(

@Haroon-Khel
Copy link
Contributor

Taking a look

@Haroon-Khel
Copy link
Contributor

At the moment we limit cpu usage to 0-3,

command: "{{ docker_run_command }} --restart unless-stopped -p {{ docker_port }}:22 --cpuset-cpus='0-3' --memory=6G --detach --name {{ docker_image | upper }}.{{ docker_port }}{{ arm32_suffix | upper }} aqa_{{ docker_image }}{{ arm32_suffix }}"

We could look at reducing this to 0-2

And yes theres certainly room for dropping the number of running containers outside of a release period, ive stopped https://ci.adoptium.net/computer/test-docker-ubuntu2004-x64-2/

@sxa
Copy link
Member Author

sxa commented Dec 24, 2024

At the moment we limit cpu usage to 0-3,

command: "{{ docker_run_command }} --restart unless-stopped -p {{ docker_port }}:22 --cpuset-cpus='0-3' --memory=6G --detach --name {{ docker_image | upper }}.{{ docker_port }}{{ arm32_suffix | upper }} aqa_{{ docker_image }}{{ arm32_suffix }}"

We could look at reducing this to 0-2

And yes theres certainly room for dropping the number of running containers outside of a release period, ive stopped https://ci.adoptium.net/computer/test-docker-ubuntu2004-x64-2/

Bearing in mind the machine only has 16 cores, so if we have "0-3" (Presumably 4 cores?) for each container, that's very quickly going to exceeed the amount of CPU available. I'd be tempted to drop them to 0-1 which I presume would be 2 cores each, and make sure we have no more than 8 containers (Bear in mind that this also doesn't include build work, or the Solaris work on the machine which will push it a bit higher at times)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: No status
Development

No branches or pull requests

2 participants