Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Huge memory allocations #100

Closed
Molkree opened this issue Sep 20, 2021 · 1 comment
Closed

Huge memory allocations #100

Molkree opened this issue Sep 20, 2021 · 1 comment

Comments

@Molkree
Copy link
Collaborator

Molkree commented Sep 20, 2021

I've just hit an unfortunate situation where OOIPy blows up RAM usage when getting one second of data.
Minimal reproducible example:

import datetime

from ooipy.request import hydrophone_request

start_time_str = "2021-09-19T12-16-49"
end_time_str = "2021-09-19T12-16-50"
node = "PC01A"

start_time = datetime.datetime.strptime(start_time_str, "%Y-%m-%dT%H-%M-%S")
end_time = datetime.datetime.strptime(end_time_str, "%Y-%m-%dT%H-%M-%S")

hydrophone_data = hydrophone_request.get_acoustic_data(
    start_time, end_time, node, verbose=True
)

In Google Colab I see ~10GB allocation and then one more ~10GB which crashes the kernel.

image

The same thing crashed our processing in GitHub VMs.

Here's why I narrowed it down to just one second of data:
image

@John-Ragland
Copy link
Member

I think that this issue is caused by how data is organized from OOI's side of things. They have actually since fixed each mseed file to be 5 minutes (and they are doing some type of gap filling).

I'm closing this for now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants