Bulk import performance reported incorrectly #4135

patchwork01 · 2025-01-24T13:11:42Z

Description

Before the change EmrBulkImportPerformanceST was reliably over 3.5 million records per second, it's now around 2.2 million per second. This has failed on every performance test since the morning of 22nd January. The last successful performance test was on the morning of 20th January.

The PR where this was changed also stopped setting the maximum connections to S3 explicitly in the system tests. It was set to 25 connections, it now uses the default of 100. This could have caused the problem. UPDATE: After retesting on m7i, the performance was still at 2.2 million records/s, and still the same with 25 max connections on m7g. It seems the move to Graviton and the max connections change were not the problem.

There was also a PR at a similar time that changed how the start and finish time are reported in the job tracker. This seems likely to have caused the problem. It seems like instead of taking the time the job started and finished in the Spark driver, it's taking the start time to be the time the job was received in the starter lambda.

Steps to reproduce

Run EmrBulkImportPerformanceST
See error

Expected behaviour

The test should pass.

For performance calculation, the reporting code should count the time a bulk import job started as the time it started in the Spark driver, rather than the time it was accepted in the job starter lambda.

Background

Possibly introduced by:

Issue 3916 - Switch EMR instance types #4109

Here are all the PRs that were merged between the passing test and the failing test:

patchwork01 added the bug Something isn't working label Jan 24, 2025

patchwork01 added this to the 0.28.0 milestone Jan 24, 2025

patchwork01 self-assigned this Jan 24, 2025

patchwork01 changed the title ~~Bulk import performance test failing after switch to Graviton~~ Bulk import performance reported incorrectly Jan 27, 2025

patchwork01 mentioned this issue Jan 27, 2025

Issue 4135 - Fix bulk import reporting #4145

Merged

4 tasks

patchwork01 closed this as completed in #4145 Jan 28, 2025

patchwork01 mentioned this issue Jan 28, 2025

Test bulk import performance figures reporting in clients module #4151

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bulk import performance reported incorrectly #4135

Bulk import performance reported incorrectly #4135

patchwork01 commented Jan 24, 2025 •

edited

Loading

Bulk import performance reported incorrectly #4135

Bulk import performance reported incorrectly #4135

Comments

patchwork01 commented Jan 24, 2025 • edited Loading

Description

Steps to reproduce

Expected behaviour

Background

patchwork01 commented Jan 24, 2025 •

edited

Loading