Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

wrccdc "delete -where" lake operation sees 43% perf hit at super#5502 changes #5578

Open
philrz opened this issue Jan 15, 2025 · 0 comments
Open

Comments

@philrz
Copy link
Contributor

philrz commented Jan 15, 2025

tl;dr

In the SuperDB data lake operations run in Autoperf, the step change shown here in the wrccdc "delete -where" queries is correlated with the arrival of the changes in #5502.

Image

Details

The run time of this lake maintenance operation went from 1.29 minutes to 1.86 minutes, which represents a 43% performance hit. As usual when these changes in the macro trend are observed, it's likely that the nature of the changes makes it such that this is a totally appropriate cost for gaining the other benefits of the change. However, since an anticipated perf hit was not called out in the PR notes, I've opened this issue to document the transition and in case anyone wants to look closer.

The exact operation against the full wrccdc data set looks like:

super db delete -use wrccdc -where '_path=="loaded_scripts"'

At the moment Autoperf is still running against the SuperDB data lake, so the lake is still using sequential runtime against BSUP data.

@philrz philrz changed the title wrccdc "delete -where" lake opertion sees 43% perf hit at super#5502 changes wrccdc "delete -where" lake operation sees 43% perf hit at super#5502 changes Jan 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant