GRPC Document API benchmark #16711

amberzsy · 2024-11-23T06:24:48Z

Please describe the end goal of this project

based on GRPC document API implementation, perform benchmark on varies document apis with different scenarios.

Supporting References

#15190

Issues

Bulk API benchmark.
Index document benchmark.
Update document benchmark
Get document benchmark.

Related component

Indexing

amberzsy · 2024-12-09T19:28:02Z

assigned to @karenyrx.

dblock · 2024-12-16T17:06:43Z

[Catch All Triage - 1, 2, 3]

karenyrx · 2025-01-02T21:26:23Z

Results

Latency benchmark results from testing with a single data-node cluster and small GRPC payload:

Summary

GRPC latency of the first request is 1.2 - 1.25x higher than HTTP.
GRPC latency of the subsequent requests is 1.5 - 5x lower than HTTP.
For both GRPC and HTTP, CPU, memory, IO usage were similar, based on internal dashboard observations.

Next steps:

Confirm where the improvements are arising from: HTTP2, or Protobuf, or something else. To do so:
i) Instrument codebase with detailed metrics to breakdown granular latency of GRPC vs. HTTP internal ops.
ii) Confirm if results align with HTTP2 + JSON benchmarking results (to be performed by @reta )
Test with a larger payload (e.g. 5MB request) to gain more confidence
Submit PR for Protobuf and GRPC support for Bulk endpoint in Opensearch (in parallel with step 1 and 2)

mgodwan · 2025-01-06T15:29:28Z

@karenyrx Thanks for the benchmarks. The idea is indeed promising.
I would like to understand and discuss the proposal around how you're thinking to map document schema using protobuf, and how dynamic mappings would work for such scenarios.

amberzsy added Meta Meta issue, not directly linked to a PR untriaged labels Nov 23, 2024

opensearch-infra bot added this to OpenSearch Roadmap Nov 23, 2024

github-project-automation bot moved this to New in OpenSearch Roadmap Nov 23, 2024

finnegancarroll mentioned this issue Nov 25, 2024

[META] gRPC Server #16556

Closed

1 task

mch2 added this to Performance Roadmap Nov 25, 2024

github-project-automation bot moved this to Todo in Performance Roadmap Nov 25, 2024

amberzsy mentioned this issue Dec 5, 2024

[META] Productionalizing Client/Server GRPC in OpenSearch #16787

Open

18 tasks

amberzsy changed the title ~~[META] GRPC poc and benchmark - [Indexing]~~ GRPC Document API benchmark Dec 5, 2024

dblock added Search:Performance and removed untriaged labels Dec 16, 2024

peterzhuamazon added this to Search Project Board Dec 19, 2024

github-project-automation bot moved this to 🆕 New in Search Project Board Dec 19, 2024

finnegancarroll mentioned this issue Dec 23, 2024

[META] Add Production-Ready Features to gRPC Transport Plugin #16906

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GRPC Document API benchmark #16711

GRPC Document API benchmark #16711

amberzsy commented Nov 23, 2024 •

edited

Loading

amberzsy commented Dec 9, 2024 •

edited

Loading

dblock commented Dec 16, 2024

karenyrx commented Jan 2, 2025

mgodwan commented Jan 6, 2025

GRPC Document API benchmark #16711

GRPC Document API benchmark #16711

Comments

amberzsy commented Nov 23, 2024 • edited Loading

Please describe the end goal of this project

Supporting References

Issues

Related component

amberzsy commented Dec 9, 2024 • edited Loading

dblock commented Dec 16, 2024

karenyrx commented Jan 2, 2025

Results

Summary

Next steps:

mgodwan commented Jan 6, 2025

amberzsy commented Nov 23, 2024 •

edited

Loading

amberzsy commented Dec 9, 2024 •

edited

Loading