-
Notifications
You must be signed in to change notification settings - Fork 306
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DAOS-16685 dfuse: Change event queue poll to use NOWAIT. #15377
Conversation
PR-repos: mercury@PR-125:4 Test-provider: ucx+dc_x Skip-build-leap15-icc: true Skip-build-el8-gcc: true Skip-func-test-vm: true Skip-unit-tests: true Skip-func-hw-test-medium-verbs-provider: true Skip-func-hw-test-medium-ucx-provider: false Required-githooks: true Signed-off-by: Joseph Moore <[email protected]>
Ticket title is 'LRZ: Linux commands over dfuse hanging (crt_hg_req_send_cb, obj_shard_coll_query_cb timeouts)' |
PR-repos: mercury@PR-125:4 Test-provider: ucx+dc_x Skip-build-leap15-icc: true Skip-build-el8-gcc: true Skip-func-test-vm: true Skip-unit-tests: true Skip-func-hw-test-medium-verbs-provider: true Skip-func-hw-test-medium-ucx-provider: false Features: dfuse Required-githooks: true Signed-off-by: Joseph Moore <[email protected]>
PR-repos: mercury@PR-125:4 Test-provider: ucx+dc_x Skip-build-leap15-icc: true Skip-build-el8-gcc: true Skip-func-test-vm: true Skip-unit-tests: true Skip-func-hw-test-medium-verbs-provider: true Skip-func-hw-test-medium-ucx-provider: false Features: dfuse Required-githooks: true Signed-off-by: Joseph Moore <[email protected]>
Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15377/4/execution/node/922/log |
PR-repos: mercury@PR-125:5 Test-provider: ucx+ud_x Skip-build-leap15-icc: true Skip-build-el8-gcc: true Skip-func-test-vm: true Skip-unit-tests: true Skip-func-hw-test-medium-verbs-provider: true Skip-func-hw-test-medium-ucx-provider: false Required-githooks: true Signed-off-by: Joseph Moore <[email protected]>
Test stage Functional Hardware Medium UCX Provider completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15377/6/execution/node/876/log |
Required-githooks: true Signed-off-by: Joseph Moore <[email protected]>
PR-repos: mercury@PR-125:5 Test-provider: ucx+ud_x Skip-build-leap15-icc: true Skip-build-el8-gcc: true Skip-func-test-vm: true Skip-unit-tests: true Skip-func-hw-test-medium-verbs-provider: true Skip-func-hw-test-medium-ucx-provider: false Features: dfuse Required-githooks: true Signed-off-by: Joseph Moore <[email protected]>
Test stage Functional Hardware Large completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15377/8/execution/node/878/log |
Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15377/8/execution/node/1019/log |
Test stage Functional Hardware Large completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15377/9/execution/node/829/log |
Test stage Functional Hardware Medium UCX Provider completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15377/9/execution/node/875/log |
Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15377/9/execution/node/921/log |
Skip-func-hw-test-medium-ucx-provider: false Features: dfuse Required-githooks: true Signed-off-by: Joseph Moore <[email protected]>
src/client/dfuse/dfuse_core.c
Outdated
@@ -32,7 +32,6 @@ dfuse_progress_thread(void *arg) | |||
cont: | |||
errno = 0; | |||
rc = sem_wait(&eqt->de_sem); | |||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you remove this change please
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep, leftover from some changes I made for debugging.
Skip-func-hw-test-medium-ucx-provider: false Features: dfuse Required-githooks: true Signed-off-by: Joseph Moore <[email protected]>
Test stage Functional Hardware Large completed with status UNSTABLE. https://build.hpdd.intel.com/job/daos-stack/job/daos//view/change-requests/job/PR-15377/13/testReport/ |
Test stage Functional Hardware Medium UCX Provider completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15377/13/execution/node/1563/log |
Test stage Functional Hardware Medium UCX Provider completed with status FAILURE. https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-15377/14/display/redirect |
Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-15377/14/display/redirect |
Test stage Functional Hardware Medium Verbs Provider completed with status FAILURE. https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-15377/14/display/redirect |
Test stage Functional Hardware Large completed with status UNSTABLE. https://build.hpdd.intel.com/job/daos-stack/job/daos//view/change-requests/job/PR-15377/14/testReport/ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm, but we should keep ticket open (or add a new one) to investigate why version with DAOS_EQ_WAIT causes problems. I am concerned about things magically fixing themselves like this when WAIT variant should have worked fine to start with.
this is tested here: |
PR-repos: mercury@PR-125:4
Test-provider: ucx+dc_x
Skip-build-leap15-icc: true
Skip-build-el8-gcc: true
Skip-func-test-vm: true
Skip-unit-tests: true
Skip-func-hw-test-medium-verbs-provider: true
Skip-func-hw-test-medium-ucx-provider: false
Required-githooks: true
Before requesting gatekeeper:
Features:
(orTest-tag*
) commit pragma was used or there is a reason documented that there are no appropriate tags for this PR.Gatekeeper: