offload parameters during forward propagation but the CUDA memory actually be not freed #2360

zengzh95 · 2022-09-26T08:41:06Z

zengzh95
Sep 26, 2022

When I swap the parameters or activations of a module to CPU after its forward propagation, the CUDA memory actually be not freed.
But once the backward propagation of this module is performed, we can release the CUDA memory occupied by parameters or activations.

Is this related to the Autograd mechanism of PyTorch?
How can we free parameters or activations before backward propagation in PyTorch?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

offload parameters during forward propagation but the CUDA memory actually be not freed #2360

{{title}}

Replies: 0 comments

Select a reply

offload parameters during forward propagation but the CUDA memory actually be not freed #2360

zengzh95 Sep 26, 2022

Replies: 0 comments

zengzh95
Sep 26, 2022