Use cupy to measure memory leak #777

bdice · 2024-09-06T18:20:49Z

While adding support for Python 3.12 in #773, we found a problem where GPUtil does not support Python 3.12 (#775).

This PR removes the dependency on GPUtil and replaces it with cupy, which is already a dependency.

Closes #775. Closes #776.

jameslamb

I'm much happier with this change than #776. Thank you!

grlee77

Agree that using cuda-python is preferable to GPUtil. We can also get this info from CuPy without adding the dependency, though (see suggestion). Let me know what you think.

grlee77 · 2024-09-06T19:16:13Z

dependencies.yaml

+            packages:
+              - cuda-python>=12.0,<13.0a0
+          - matrix: {cuda: "11.*"}
+            packages: &test_cuda_python_cu11


out of curiosity, what does this &test_cuda_python_cu11 notation do?

This is a YAML anchor. We use this to define a list that we re-use in *test_cuda_python_cu11. We need a "fallback" list when the cuda selector is not defined.

grlee77 · 2024-09-06T19:23:22Z

python/cucim/tests/performance/clara/test_read_region_memory_usage.py

+    def get_used_gpu_memory_mib():
+        """Get the used GPU memory in MiB."""
+        status, free, total = cuda.cudart.cudaMemGetInfo()
+        if status != cuda.cudart.cudaError_t.cudaSuccess:
+            raise RuntimeError("Failed to get GPU memory info.")
+        memory_used = (total - free) / (2**20)
+        return memory_used
+
+    status, num_gpus = cuda.cudart.cudaGetDeviceCount()
+    if status != cuda.cudart.cudaError_t.cudaSuccess or num_gpus == 0:


We can also potentially just use CuPy for this and avoid having to add the cuda-python dependency:

add up top:

import cupy as cp

then can use here:

def get_used_gpu_memory_mib(): """Get the used GPU memory in MiB.""" dev = cp.cuda.Device() free, total = dev.mem_info memory_used = (total - free) / (2**20) return memory_used num_gpus = cp.cuda.runtime.getDeviceCount() if num_gpus == 0:

Great. I'll implement that and remove the cuda-python dependency.

jameslamb

re-approving, awesome that the net result here is -1 dependencies 😁

grlee77

Looks great, thanks!

gigony · 2024-09-06T20:21:53Z

Thanks @bdice for fixing the issue!

jameslamb · 2024-09-06T20:36:21Z

/merge

Use cuda-python to measure memory leak

1b83eb1

bdice requested review from a team as code owners September 6, 2024 18:20

bdice requested review from jameslamb and grlee77 September 6, 2024 18:20

bdice added bug Something isn't working non-breaking Introduces a non-breaking change labels Sep 6, 2024

bdice self-assigned this Sep 6, 2024

Clarify units are MiB.

8075ba3

jameslamb approved these changes Sep 6, 2024

View reviewed changes

grlee77 requested changes Sep 6, 2024

View reviewed changes

Use cupy instead of cuda-python.

a509f2d

bdice changed the title ~~Use cuda-python to measure memory leak~~ Use cupy to measure memory leak Sep 6, 2024

bdice requested a review from grlee77 September 6, 2024 19:41

jameslamb approved these changes Sep 6, 2024

View reviewed changes

grlee77 approved these changes Sep 6, 2024

View reviewed changes

gigony approved these changes Sep 6, 2024

View reviewed changes

rapids-bot bot merged commit 73e6d93 into rapidsai:branch-24.10 Sep 6, 2024
45 checks passed

jakirkham added this to the v24.10.00 milestone Oct 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use cupy to measure memory leak #777

Use cupy to measure memory leak #777

bdice commented Sep 6, 2024 •

edited

Loading

jameslamb left a comment

grlee77 left a comment

grlee77 Sep 6, 2024

bdice Sep 6, 2024

grlee77 Sep 6, 2024

bdice Sep 6, 2024

jameslamb left a comment

grlee77 left a comment

gigony commented Sep 6, 2024

jameslamb commented Sep 6, 2024

Use cupy to measure memory leak #777

Use cupy to measure memory leak #777

Conversation

bdice commented Sep 6, 2024 • edited Loading

jameslamb left a comment

Choose a reason for hiding this comment

grlee77 left a comment

Choose a reason for hiding this comment

grlee77 Sep 6, 2024

Choose a reason for hiding this comment

bdice Sep 6, 2024

Choose a reason for hiding this comment

grlee77 Sep 6, 2024

Choose a reason for hiding this comment

bdice Sep 6, 2024

Choose a reason for hiding this comment

jameslamb left a comment

Choose a reason for hiding this comment

grlee77 left a comment

Choose a reason for hiding this comment

gigony commented Sep 6, 2024

jameslamb commented Sep 6, 2024

bdice commented Sep 6, 2024 •

edited

Loading