Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge release/2.6 into google/2.6 #15794

Merged
merged 5 commits into from
Jan 27, 2025
Merged

Conversation

jolivier23
Copy link
Contributor

wangshilong and others added 5 commits January 24, 2025 12:35
It has been observed that pool_svc_step_up_cb may encounter a
-DER_NOTLEADER and pass it to ds_pool_failed_add. This error is a
replica error and may be transient; it doesn't indicate that the PS is
unavailable. This patch addresses the observed scenario by replacing the
ds_pool_failed_add call from pool_svc_step_up_cb with a special
up-but-with-error mode for the PS, which can only serve requests by
returning an error.

  - Add pool_svc.ps_error for indicating the special up-but-with-error
    mode. Check and return it in pool_svc_lookup_leader. Handle it
    specially in callers of pool_svc_lookup.

  - Use this new mode only for a conservative set of errors. Including
    an error by mistake is worse than missing an error.

  - Add pool UUIDs to a few log messages to make future debugging
    easier.

The ds_pool_failed_add mechanism should be used for replica errors only.
And, such errors should not immediately stop PS clients from trying
other replicas. This issue is relatively tricky and will not be
addressed by the current patch.

Signed-off-by: Li Wei <[email protected]>
…15772)

increase timout for Unit Test with memcheck

Signed-off-by: Lei Huang <[email protected]>
Improve concurrent POOL_QUERY, POOL_CONNECT, and POOL_TGT_QUERY_MAP efficiency by giving them a chance to share the same pool map buffer and pool map buffer bulk handle.

Introduce pool space query on service leader to avoid space query flooding. The pool space cache expiration time is 2 seconds by default, one can change the expiration time via DAOS_POOL_SPACE_CACHE_INTVL, if the expiration time is set to zero, space cache will be disabled.

Signed-off-by: Li Wei <[email protected]>
Signed-off-by: Niu Yawei <[email protected]>
Co-authored-by: Xuezhao Liu <[email protected]>
Co-authored-by: Liang Zhen <[email protected]>
Co-authored-by: Dalton Bohning <[email protected]>
Tag first release candidate for 2.6.3.

Signed-off-by: Dalton Bohning <[email protected]>
…le/2.6

Change-Id: Ibf7e11fd2917bc9758fdcf94d819420bfe20edbe
Signed-off-by: Jeff Olivier <[email protected]>
Copy link

Errors are component not formatted correctly,Ticket number prefix incorrect,PR title is malformatted. See https://daosio.atlassian.net/wiki/spaces/DC/pages/11133911069/Commit+Comments,Unable to load ticket data
https://daosio.atlassian.net/browse/Merge

@jolivier23 jolivier23 merged commit fe0e820 into google/2.6 Jan 27, 2025
48 of 52 checks passed
@jolivier23 jolivier23 deleted the jeffolivier/google/2.6 branch January 27, 2025 21:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

6 participants