(RHEL-9322) cgroup: drastically simplify caching of cgroups members mask #295

dtardon · 2022-06-22T16:52:57Z

Previously we tried to be smart: when a new unit appeared and it only
added controllers to the cgroup mask we'd update the cached members mask
in all parents by ORing in the controller flags in their cached values.
Unfortunately this was quite broken, as we missed some conditions when
this cache had to be reset (for example, when a unit got unloaded),
moreover the optimization doesn't work when a controller is removed
anyway (as in that case there's no other way for the parent to iterate
though all children if any other, remaining child unit still needs it).
Hence, let's simplify the logic substantially: instead of updating the
cache on the right events (which we didn't get right), let's simply
invalidate the cache, and generate it lazily when we encounter it later.
This should actually result in better behaviour as we don't have to
calculate the new members mask for a whole subtree whever we have the
suspicion something changed, but can delay it to the point where we
actually need the members mask.

This allows us to simplify things quite a bit, which is good, since
validating this cache for correctness is hard enough.

Fixes: #9512
(cherry picked from commit 5af8805)

Resolves: #2096371

msekletar

I think we shouldn't just cherry pick the single commit from the lengthy series in systemd/systemd#10894 and hope for the best. I have a gut feeling this will introduce regressions in our handling of cgroups. At very least we shouldn't be introducing dead code.

msekletar · 2022-07-12T12:23:12Z

src/core/unit.h

-        CGroupMask cgroup_members_mask;
+        CGroupMask cgroup_realized_mask;           /* In which hierarchies does this unit's cgroup exist? (only relevant on cgroupsv1) */
+        CGroupMask cgroup_enabled_mask;            /* Which controllers are enabled (or more correctly: enabled for the children) for this unit's cgroup? (only relevant on cgroupsv2) */
+        CGroupMask cgroup_invalidated_mask;        /* A mask specifiying controllers which shall be considered invalidated, and require re-realization */


cgroup_invalidated_mask isn't used anywhere. Also a comment contains typo in "specifiying".

I attempted to backport the whole series at first, but abandoned that approach in the end. The PR depends on previous PR(s). It would likely take several days to sort it out and backport everything needed.

… controllers being pulled in This shouldn't make much difference in real life, but is a bit cleaner. (cherry picked from commit 442ce77) Related: RHEL-9322

…on a cgroup This changes cg_enable_everywhere() to return which controllers are enabled for the specified cgroup. This information is then used to correctly track the enablement mask currently in effect for a unit. Moreover, when we try to turn off a controller, and this works, then this is indicates that the parent unit might succesfully turn it off now, too as our unit might have kept it busy. So far, when realizing cgroups, i.e. when syncing up the kernel representation of relevant cgroups with our own idea we would strictly work from the root to the leaves. This is generally a good approach, as when controllers are enabled this has to happen in root-to-leaves order. However, when controllers are disabled this has to happen in the opposite order: in leaves-to-root order (this is because controllers can only be enabled in a child if it is already enabled in the parent, and if it shall be disabled in the parent then it has to be disabled in the child first, otherwise it is considered busy when it is attempted to remove it in the parent). To make things complicated when invalidating a unit's cgroup membershup systemd can actually turn off some controllers previously turned on at the very same time as it turns on other controllers previously turned off. In such a case we have to work up leaves-to-root *and* root-to-leaves right after each other. With this patch this is implemented: we still generally operate root-to-leaves, but as soon as we noticed we successfully turned off a controller previously turned on for a cgroup we'll re-enqueue the cgroup realization for all parents of a unit, thus implementing leaves-to-root where necessary. (cherry picked from commit 27adcc9) Related: RHEL-9322

After creating a cgroup we need to initialize its "cgroup.subtree_control" file with the controllers its children want to use. Currently we do so whenever the mkdir() on the cgroup succeeded, i.e. when we know the cgroup is "fresh". Let's update the condition slightly that we also do so when internally we assume a cgroup doesn't exist yet, even if it already does (maybe left-over from a previous run). This shouldn't change anything IRL but make things a bit more robust. (cherry picked from commit 1fd3a10) Related: RHEL-9322

Previously we tried to be smart: when a new unit appeared and it only added controllers to the cgroup mask we'd update the cached members mask in all parents by ORing in the controller flags in their cached values. Unfortunately this was quite broken, as we missed some conditions when this cache had to be reset (for example, when a unit got unloaded), moreover the optimization doesn't work when a controller is removed anyway (as in that case there's no other way for the parent to iterate though all children if any other, remaining child unit still needs it). Hence, let's simplify the logic substantially: instead of updating the cache on the right events (which we didn't get right), let's simply invalidate the cache, and generate it lazily when we encounter it later. This should actually result in better behaviour as we don't have to calculate the new members mask for a whole subtree whever we have the suspicion something changed, but can delay it to the point where we actually need the members mask. This allows us to simplify things quite a bit, which is good, since validating this cache for correctness is hard enough. Fixes: #9512 (cherry picked from commit 5af8805) Resolves: RHEL-9322

This way we can corectly ensure that when a unit that requires some controller goes away, we propagate the removal of it all the way up, so that the controller is turned off in all the parents too. (cherry picked from commit b8b6f32) Related: RHEL-9322

(cherry picked from commit 43738e0) Related: RHEL-9322

msekletar · 2024-05-21T15:22:11Z

CI failures looked related but to be sure they are not just flukes @lnykryn force-pushed the feature branch to rerun the checks.

jamacku · 2024-05-22T08:23:36Z

Unit tests seem to be failing because of #434, But I'm not sure about CentOS CI.

msekletar · 2024-05-22T08:47:31Z

CentOS CI failure is concerning. @dtardon can you please have a look?

mrc0mmand · 2024-06-25T09:50:25Z

Oh well, the CentOS CI for this repo is currently FUBAR, since CentOS Stream 8 is no more. I'll need to figure something out to make it working again...

msekletar · 2024-06-25T10:12:54Z

@mrc0mmand Maybe we could arrange "Redhat developer subscription for teams", i.e. self-service subscription and then we could generate activation key that we would store as GH secret and then use that to enable all needed repos in UBI RHEL container image (e.g. CRB).

mrc0mmand · 2024-06-25T10:16:47Z

@mrc0mmand Maybe we could arrange "Redhat developer subscription for teams", i.e. self-service subscription and then we could generate activation key that we would store as GH secret and then use that to enable all needed repos in UBI RHEL container image (e.g. CRB).

I mean, yeah, that would work for the container stuff (and I plan to actually do that), but this wouldn't work for CentOS CI, since, as the name suggests, it only supports CentOS. So I'll move the C8S job to C9S, which should work just fine. The environment will be different, but it should be enough for making sure we haven't royally screwed something up (we run the same tests internally on actual RHEL 8 anyway).

mrc0mmand · 2024-06-26T07:50:32Z

CentOS CI should be back (albeit with some compromises, see systemd/systemd-centos-ci@f415a75). I'll look into the container stuff next.

mrc0mmand · 2024-06-26T11:27:48Z

A quick workaround for the current C8S containers is in #440.

mergify bot added the pr/needs-ci Formerly needs-ci label Jun 22, 2022

systemd-rhel-bot added the pr/needs-review Formerly needs-review label Jun 22, 2022

systemd-rhel-bot changed the title ~~cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) cgroup: drastically simplify caching of cgroups members mask Jun 22, 2022

systemd-rhel-bot added the tracker/unapproved Formerly needs-acks label Jun 22, 2022

mergify bot removed the pr/needs-ci Formerly needs-ci label Jun 22, 2022

systemd-rhel-bot removed the tracker/unapproved Formerly needs-acks label Jul 11, 2022

msekletar requested changes Jul 12, 2022

View reviewed changes

systemd-rhel-bot removed the pr/needs-review Formerly needs-review label Jul 12, 2022

dtardon force-pushed the master branch from 7a5edd5 to a729ea1 Compare July 18, 2022 12:09

systemd-rhel-bot added tracker/unapproved Formerly needs-acks and removed tracker/unapproved Formerly needs-acks labels Aug 11, 2022

systemd-rhel-bot changed the title ~~(#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask Aug 21, 2022

mergify bot added pr/needs-ci Formerly needs-ci and removed pr/needs-ci Formerly needs-ci labels Aug 21, 2022

jamacku added this to the RHEL-8.8 milestone Aug 24, 2022

systemd-rhel-bot added tracker/unapproved Formerly needs-acks and removed tracker/unapproved Formerly needs-acks labels Aug 31, 2022

mergify bot added the pr/needs-ci Formerly needs-ci label Sep 21, 2022

jamacku force-pushed the master branch from 3d7bf8e to 4c241b8 Compare October 3, 2022 07:54

systemd-rhel-bot changed the title ~~(#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) (#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask Oct 22, 2022

systemd-rhel-bot added pr/needs-review Formerly needs-review and removed pr/needs-review Formerly needs-review labels Nov 19, 2022

jamacku changed the title ~~(#2096371) (#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) cgroup: drastically simplify caching of cgroups members mask Dec 7, 2022

systemd-rhel-bot removed the tracker/unapproved Formerly needs-acks label Dec 8, 2022

systemd-rhel-bot changed the title ~~(#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask Dec 16, 2022

jamacku changed the title ~~(#2096371) (#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (#2096371) cgroup: drastically simplify caching of cgroups members mask Jan 6, 2023

systemd-rhel-bot added tracker/unapproved Formerly needs-acks and removed tracker/unapproved Formerly needs-acks labels Jan 13, 2023

poettering added 5 commits April 9, 2024 15:55

cgroup: units that aren't loaded properly should not result in cgroup…

3aa896a

… controllers being pulled in This shouldn't make much difference in real life, but is a bit cleaner. (cherry picked from commit 442ce77) Related: RHEL-9322

dtardon force-pushed the bz2096371-slice-cgroup branch from 78e2717 to 83ada01 Compare April 9, 2024 13:59

jamacku removed the needs-rebase label Apr 9, 2024

github-actions bot changed the title ~~(#2096371) cgroup: drastically simplify caching of cgroups members mask~~ (RHEL-9322) cgroup: drastically simplify caching of cgroups members mask Apr 9, 2024

jamacku force-pushed the rhel-8.10.0 branch from e5dfa54 to 045ba12 Compare April 11, 2024 08:52

github-actions bot added tracker/missing Formerly needs-bz and removed tracker/missing Formerly needs-bz labels Apr 12, 2024

test: extend testcase to ensure controller membership doesn't regress

b719f06

(cherry picked from commit 43738e0) Related: RHEL-9322

lnykryn force-pushed the bz2096371-slice-cgroup branch from 83ada01 to b719f06 Compare May 21, 2024 15:12

github-actions bot removed the tracker/unapproved Formerly needs-acks label May 22, 2024

msekletar closed this Jun 11, 2024

msekletar reopened this Jun 11, 2024

msekletar closed this Jun 25, 2024

msekletar reopened this Jun 25, 2024

github-actions bot added tracker/missing Formerly needs-bz and removed tracker/missing Formerly needs-bz labels Jun 26, 2024

github-actions bot added tracker/missing Formerly needs-bz and removed tracker/missing Formerly needs-bz labels Nov 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(RHEL-9322) cgroup: drastically simplify caching of cgroups members mask #295

(RHEL-9322) cgroup: drastically simplify caching of cgroups members mask #295

dtardon commented Jun 22, 2022 •

edited by jamacku

Loading

msekletar left a comment

msekletar Jul 12, 2022 •

edited

Loading

dtardon Jul 12, 2022

msekletar commented May 21, 2024

jamacku commented May 22, 2024

msekletar commented May 22, 2024

mrc0mmand commented Jun 25, 2024

msekletar commented Jun 25, 2024

mrc0mmand commented Jun 25, 2024

mrc0mmand commented Jun 26, 2024

mrc0mmand commented Jun 26, 2024

(RHEL-9322) cgroup: drastically simplify caching of cgroups members mask #295

Are you sure you want to change the base?

(RHEL-9322) cgroup: drastically simplify caching of cgroups members mask #295

Conversation

dtardon commented Jun 22, 2022 • edited by jamacku Loading

msekletar left a comment

Choose a reason for hiding this comment

msekletar Jul 12, 2022 • edited Loading

Choose a reason for hiding this comment

dtardon Jul 12, 2022

Choose a reason for hiding this comment

msekletar commented May 21, 2024

jamacku commented May 22, 2024

msekletar commented May 22, 2024

mrc0mmand commented Jun 25, 2024

msekletar commented Jun 25, 2024

mrc0mmand commented Jun 25, 2024

mrc0mmand commented Jun 26, 2024

mrc0mmand commented Jun 26, 2024

dtardon commented Jun 22, 2022 •

edited by jamacku

Loading

msekletar Jul 12, 2022 •

edited

Loading