Support timeout based search request cancellation #817

sohami · 2021-06-04T03:05:21Z

Is your feature request related to a problem? Please describe.
Currently the optional "timeout" parameter in the SearchRequest applies to the individual child shard level search requests not at the parent search request. The shard request to a node is sent in multiple batches based on "maxConcurrentRequestsPerNode" parameter. So if a search request results in sending N such batches, the parent request timeout will essentially be N*batchNumber. Also the timeout is only honored in query phase and not in Fetch phase. If there is a long running search for which client doesn't want to wait for the result anymore, they have to use the task API to cancel such request. In cases, when user doesn't initiate the cancellation the previous search will still be consuming the cluster resources until it completes.

Describe the solution you'd like
The proposal is to have a separate parameter in search request like "cancel_after_timeinterval" which can be set by the user both at request level and at cluster level. Based on this new parameter, after the timeout expiry the search requests will be cancelled automatically using the cancellation framework. This will help: 1) to reduce the wasted resource usage. 2) automatic cancellation mechanism for the search request, where client doesn't have to explicitly use the task API to cancel it.

Describe alternatives you've considered

Use the existing timeout parameter in search request instead of new parameter. This may create confusion for current users because of change in behavior and may break certain client applications. Current timeout behavior is to return partial results, however with cancellation no results will be returned since fetch phase will error out for cancelled search requests.

Additional context
Add any other context or screenshots about the feature request here.

Bukhtawar · 2021-06-17T12:25:34Z

I guess we should also have a mechanism to support partial results we have gotten before the timeout unless
allow_partial_search_results is set to false

sohami · 2021-07-02T06:26:50Z

@Bukhtawar: Cancellation mechanism will be helpful to terminate the workload in events when it is of no use to the client and it wants to stop the workload from consuming any more resources ASAP. If client submits a cancellation request for a task externally using task API then also the search request is failed in fetch phase irrespective of allow_partial_search_results flag. I would prefer to keep the behavior same in both the use cases.

To provide more context, the allow_partial_search_results flag is only honored in query phase after getting results (docId) from all or subset of the shards. If the search request is cancelled before fetch phase then even send of fetch request from coordinator to shards will be failed with task cancellation exception.

AmiStrn · 2021-07-02T14:54:33Z

I was looking into a similar task recently, I support this requirement 100%. What do we need to get started on this? @sohami are you in the process of writing the code for this yet?

sohami · 2021-07-02T21:18:22Z

@AmiStrn - yes. I was waiting on some initial feedback. Will raise the PR as soon as I am done

sohami · 2021-07-19T20:44:07Z

@AmiStrn - I am dividing the change into 2 separate PRs. First one introduces the request level parameter support. Second one will consume the parameter to schedule cancellation task (WIP). Would be great if you can help review this.

AmiStrn · 2021-07-19T20:48:39Z

Would be great if you can help review this.

Gladly:)

Bukhtawar · 2021-07-19T20:55:29Z

@sohami I don't think size 0 aggregations have a fetch phase. Does partial results make sense here

I guess it should work out of the box since we plan on simply cancelling

sohami · 2021-07-19T22:56:48Z

@Bukhtawar - Yes in cases when there is no fetch phase and some of the shards successfully completed the execution before cancellation, then partial results will be returned if allow_partial_search_results is set to true. But note that the cancellation on timeout is basically an intent from client that it is not waiting on any results anymore and just wants the request to terminate. In case client wants to get the partial results, then current search timeout parameter should be used. That will enforce timeout only in search phase and irrespective of query type with size >=0, it will return the partial results.

AmiStrn · 2021-07-27T16:54:06Z

@dblock how do we get assignee's from the maintainers to review and approve PR's such as #986 ? (I had commented on the PR, but have not approved since I'm not a maintainer)

edit: didn't notice that there is quite some lag with the PRs :) Please let us know what to expect in terms of a timeline on this.

dblock · 2021-07-27T18:35:09Z

@dblock how do we get assignee's from the maintainers to review and approve PR's such as #986 ? (I had commented on the PR, but have not approved since I'm not a maintainer)
edit: didn't notice that there is quite some lag with the PRs :) Please let us know what to expect in terms of a timeline on this.

Can't promise an SLA rn, but we do also have some automation that is nagging maintainers that PRs are open for longer than we would like without action. For now, if you feel that no traction has been had on an issue, feel free to tag me and I'll go find someone to review.

dblock · 2021-07-27T18:38:46Z

edit: didn't notice that there is quite some lag with the PRs :) Please let us know what to expect in terms of a timeline on this.

While you're not a maintainer you did a solid review of that PR, so nobody (including myself) felt the need to jump in. Thank you. I will click buttons after the next iteration to get more of the tests to run, and take a closer look at the code as well if needed.

dblock · 2021-08-12T15:04:33Z

#986 implements much of this, leaving to @sohami to close when you feel like it does everything you want or close and open smaller issues for the max_-related business.

kkewwei · 2024-04-05T09:58:36Z

I guess we should also have a mechanism to support partial results we have gotten before the timeout unless allow_partial_search_results is set to false

@sohami @AmiStrn we also have the same need: OS should returns partial results after the timeout in coordinate node. The ideal situation is that the coordinate node returns partial results(if allow_partial_search_results is true) and send cancel request when timeout. If possible, I would like to try to implement it.

…pensearch-project#817) * Bump org.owasp.dependencycheck from 9.0.8 to 9.0.9 in /java-client Bumps org.owasp.dependencycheck from 9.0.8 to 9.0.9. --- updated-dependencies: - dependency-name: org.owasp.dependencycheck dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <[email protected]> * Update changelog Signed-off-by: dependabot[bot] <[email protected]> --------- Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: dependabot[bot] <dependabot[bot]@users.noreply.github.com>

kkewwei · 2024-06-26T09:19:37Z

@sohami, @AmiStrn please have a look, whether it should be supported, I'm pleasure to implement it.

AmiStrn · 2024-07-08T05:37:25Z

i think it should be supported. sorry i was not more clear about that, i did a 👍 on your commend in agreement with it. I think @sohami is the one to say if they are not going to work on this.
@sohami are you actively working on this task?

kkewwei · 2024-09-30T02:29:16Z

i think it should be supported. sorry i was not more clear about that, i did a 👍 on your commend in agreement with it. I think @sohami is the one to say if they are not going to work on this. @sohami are you actively working on this task?

@sohami, If you are not working on it, I will try to implement it?

kkewwei · 2024-11-19T09:16:46Z

@Bukhtawar @AmiStrn @sohami, please have a look when you are free. #16681

sohami added the enhancement Enhancement or improvement to existing feature or request label Jun 4, 2021

sohami mentioned this issue Jul 19, 2021

Part 1: Support for cancel_after_timeinterval parameter in search and msearch request #986

Merged

2 tasks

sohami mentioned this issue Aug 12, 2021

Part 1: Support for cancel_after_time_interval parameter in search and… #1085

Merged

2 tasks

sohami mentioned this issue Jan 7, 2022

Add support for cancel_after_time_interval opensearch-project/opensearch-api-specification#273

Closed

jed326 mentioned this issue Jun 28, 2023

Add early termination support for concurrent segment search #8306

Merged

6 tasks

sohami mentioned this issue Mar 29, 2024

[Feature Request] cancel_after_time_interval and timeout parameters should remain one #11642

Open

kkewwei linked a pull request Nov 19, 2024 that will close this issue

Coordinator can return partial results after the timeout when allow_partial_search_results is true #16681

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support timeout based search request cancellation #817

Support timeout based search request cancellation #817

sohami commented Jun 4, 2021

Bukhtawar commented Jun 17, 2021

sohami commented Jul 2, 2021

AmiStrn commented Jul 2, 2021

sohami commented Jul 2, 2021

sohami commented Jul 19, 2021 •

edited

Loading

AmiStrn commented Jul 19, 2021

Bukhtawar commented Jul 19, 2021 •

edited

Loading

sohami commented Jul 19, 2021

AmiStrn commented Jul 27, 2021 •

edited

Loading

dblock commented Jul 27, 2021

dblock commented Jul 27, 2021

dblock commented Aug 12, 2021

kkewwei commented Apr 5, 2024 •

edited

Loading

kkewwei commented Jun 26, 2024

AmiStrn commented Jul 8, 2024 •

edited

Loading

kkewwei commented Sep 30, 2024

kkewwei commented Nov 19, 2024

Support timeout based search request cancellation #817

Support timeout based search request cancellation #817

Comments

sohami commented Jun 4, 2021

Bukhtawar commented Jun 17, 2021

sohami commented Jul 2, 2021

AmiStrn commented Jul 2, 2021

sohami commented Jul 2, 2021

sohami commented Jul 19, 2021 • edited Loading

AmiStrn commented Jul 19, 2021

Bukhtawar commented Jul 19, 2021 • edited Loading

sohami commented Jul 19, 2021

AmiStrn commented Jul 27, 2021 • edited Loading

dblock commented Jul 27, 2021

dblock commented Jul 27, 2021

dblock commented Aug 12, 2021

kkewwei commented Apr 5, 2024 • edited Loading

kkewwei commented Jun 26, 2024

AmiStrn commented Jul 8, 2024 • edited Loading

kkewwei commented Sep 30, 2024

kkewwei commented Nov 19, 2024

sohami commented Jul 19, 2021 •

edited

Loading

Bukhtawar commented Jul 19, 2021 •

edited

Loading

AmiStrn commented Jul 27, 2021 •

edited

Loading

kkewwei commented Apr 5, 2024 •

edited

Loading

AmiStrn commented Jul 8, 2024 •

edited

Loading