Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add the batch size to reduce the memory load during writing to Elastic Search. #1339

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

CompareSan
Copy link

@CompareSan CompareSan commented Jan 31, 2025

Related Issues

Proposed Changes:

Add a batch size to reduce the size of the elasticsearch_actions list and consequently reduce the memory load.

How did you test it?

I used memray to profile the memory load and these are the results based on indexing 500k documents versus indexing the same amount of documents but with a batch size of 5k. The memory load is 64% less. (358MB to 130MB).

Screenshot 2025-01-31 at 14 33 23 Screenshot 2025-01-31 at 14 33 43

Notes for the reviewer

Checklist

@CompareSan CompareSan requested a review from a team as a code owner January 31, 2025 13:42
@CompareSan CompareSan requested review from mpangrazzi and removed request for a team January 31, 2025 13:42
@CLAassistant
Copy link

CLAassistant commented Jan 31, 2025

CLA assistant check
All committers have signed the CLA.

@github-actions github-actions bot added type:documentation Improvements or additions to documentation integration:elasticsearch and removed type:documentation Improvements or additions to documentation labels Jan 31, 2025
@mpangrazzi
Copy link
Contributor

@CompareSan can we close this as well? See #1338

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Writing large number of documents to elastic search
3 participants