Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge release/2.6 into google/2.6 #15759

Open
wants to merge 6 commits into
base: google/2.6
Choose a base branch
from
Open

Merge release/2.6 into google/2.6 #15759

wants to merge 6 commits into from

Conversation

mjmac
Copy link
Contributor

@mjmac mjmac commented Jan 22, 2025

wangshilong and others added 6 commits January 13, 2025 18:09
Change the name to more closely reflect the underlying
SWIM status, and reduce user confusion. An engine that
has been marked DEAD by SWIM cannot participate in pool
services, and has most likely already SIGKILL-ed itself.

Update "engines become unavailable" in admin/troubleshooting.md to link
to the new "suspect ranks" description.

Signed-off-by: Li Wei <[email protected]>
Signed-off-by: Michael MacDonald <[email protected]>
Signed-off-by: Wang Shilong <[email protected]>
Co-authored-by: Michael MacDonald <[email protected]>
Co-authored-by: Li Wei <[email protected]>
When a DTX leader globally commit the DTX, it is possible that some
DTX participant(s) cannot commit such DTX entry because of kinds of
issues, such as network or space trouble. Under such case, the DTX
leader needs to keep the active DTX entry persistently for further
commit/resync. But it does not means related modification attched
to such DTX entry on the leader target cannot be committed, instead,
we can commit related modification with only keeping the DTX header.
That is enough for the DTX leader to do further DTX commit/resync
to handle related former failed DTX participant(s).

The benefit is that VOS aggregation on the leader target will not
be affected by remote DTX commit failure.

Signed-off-by: Fan Yong <[email protected]>
Currently, DAOS supports EC (Erasure Coding) object classes with redundancy levels of N+1 and N+2.
In certain scenarios, users may wish to use N+3 for enhanced redundancy and safety. Generally,
DAOS’s EC and rebuild mechanisms are designed to handle various parity levels effectively.

With the introduction of new object classes, comprehensive testing should be conducted to
ensure that these changes do not introduce any unexpected issues or disrupt existing functionalities.

Extend test cases to cover EC_4P3X object classes as a min test coverage.

Signed-off-by: Wang Shilong <[email protected]>
The evt recx trace is used for vos aggregation debugging, and it's currently
reset on akey iteration callback, but the akey iteration callback could be
skipped in some cases, for example, when evt aggregation hit an aborted recx,
it'll start over in evtree level without the recx trace reset, that could
lead to integer overflow on the 'int ap_trace_count'.

This patch moved the ap_trace_count reset to merge window open/close to ensure
the evt recx trace always being reset properly.

Signed-off-by: Niu Yawei <[email protected]>
Co-authored-by: Niu Yawei <[email protected]>
Features: pool
Change-Id: I9e7d49d5c31dc048546de2010a88fcca5ff4a9d2
Signed-off-by: Michael MacDonald <[email protected]>
Copy link

Errors are component not formatted correctly,Ticket number prefix incorrect,PR title is malformatted. See https://daosio.atlassian.net/wiki/spaces/DC/pages/11133911069/Commit+Comments,Unable to load ticket data
https://daosio.atlassian.net/browse/Merge

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

5 participants