OOM for TECO GAN #224

stalagmite7 · 2021-06-24T05:13:04Z

Seems like using even a height of 360 (whicle maintaining aspect ratio) for tecogan gives runtime OOM errors; whats the largest size possible that I can use to try to upscale to 4k? I imagine if I want to upscale to 4k, I would use 1080p as the resolution for my input but its too big for the GPU to handle; if there a way to use only CPU for this?

TakuyaNarihira · 2021-06-24T07:21:59Z

Thanks for reporting.

It's probably because the clear_buffer option in forward() method is not specified in the following code block.
https://github.com/sony/nnabla-examples/blob/master/GANs/tecogan/generate.py#L83-L85

With .forward(clear_buffer=True), it will aggressively release unused memory in the network.

Could you try this quickly?

            pre_gen_warp.forward(clear_buffer=True)
            pre_warp.data.copy_from(pre_gen_warp.data)
        outputs.forward(clear_buffer=True)

We'll also see if it works properly and reduces memory usage later soon.

stalagmite7 · 2021-06-24T07:45:13Z

Thanks for the quick response! I just got AFK, I’ll try it in a few hours and keep you posted!

stalagmite7 · 2021-06-24T16:22:10Z

Tried this, got a invalid configuration error from CUDA

Error during forward propagation:
  TransposeCuda <-- ERROR
Traceback (most recent call last):
  File "generate.py", line 105, in <module>
    main()
  File "generate.py", line 84, in main
    pre_gen_warp.forward(clear_buffer=True)
  File "_variable.pyx", line 564, in nnabla._variable.Variable.forward
RuntimeError: target_specific error in forward_impl
/home/gitlab-runner/builds/zxvvzZDJ/0/nnabla/builders/all/nnabla-ext-cuda/src/nbla/cuda/function/./generic/transpose.cu:184
(cudaGetLastError()) failed with "invalid configuration argument" (cudaErrorInvalidConfiguration).

Cursory checking looks like it could be a number of blocks error from CUDA. Will need to dig in further on my end later today.

TakuyaNarihira · 2021-06-25T23:56:22Z

Looks it exceeds the limitation of the number of blocks. We should introduce the grid-strided loop in CUDA kernel. I created a issue in sony/nnabla-ext-cuda#321 (Let's continue there on this specific matter).

Btw, how long is your input video sequence?

stalagmite7 · 2022-01-31T06:26:57Z

Checking back in, I know it says the fix has been deployed but the OOM error persists. Like I asked before, what is the maximum size possible that I can upscale a video to? I am trying 1080 -> 4k but I get the OOM errors. Seems to work for smaller video sizes, so does that mean 1080p cases won't be handled by this implementation?

Srinidhi-Srinivasa · 2022-02-01T11:51:32Z

Checking back in, I know it says the fix has been deployed but the OOM error persists. Like I asked before, what is the maximum size possible that I can upscale a video to? I am trying 1080 -> 4k but I get the OOM errors. Seems to work for smaller video sizes, so does that mean 1080p cases won't be handled by this implementation?

@stalagmite7, is it possible to share more information about computation environment?

Srinidhi-Srinivasa · 2022-02-03T08:42:42Z

Checking back in, I know it says the fix has been deployed but the OOM error persists. Like I asked before, what is the maximum size possible that I can upscale a video to? I am trying 1080 -> 4k but I get the OOM errors. Seems to work for smaller video sizes, so does that mean 1080p cases won't be handled by this implementation?

@stalagmite7
Following are approximate memory requirements to run TeCoGAN:

Resolution	Peak Memory Usage (in MB)
144p	708
280p	2816
360p	4074
480p	6818

Please note that it may not be possible to run TeCoGAN on any resolution higher than this on GPUs which have upto 32 GB of memory.

Current pre-trained weights are in NHWC (channel last) format which is not supported in CPU version. However, it is indeed possible to run inference on CPU-only by transposing weights into NCHW format and setting "channel_last" flag to "False" in PF.conv functions.
Following are reference codes for that:
Memory-Layout-Conversion
convert_parameter_format.py

stalagmite7 · 2022-02-03T16:32:22Z

Sorry it took me so long; the GPU is a Nvidia 3060 Ti . The input video as I mentioned was 1080p resolution; you're saying this is too high to get TecoGan to try to process, then?

Srinidhi-Srinivasa · 2022-02-03T16:51:36Z

Sorry it took me so long; the GPU is a Nvidia 3060 Ti . The input video as I mentioned was 1080p resolution; you're saying this is too high to get TecoGan to try to process, then?

Yes.

TakuyaNarihira mentioned this issue Jun 25, 2021

Add clear buffer in forward pass in TecoGAN #225

Merged

TakuyaNarihira closed this as completed in #225 Jun 25, 2021

TakuyaNarihira reopened this Jun 25, 2021

YukioOobuchi closed this as completed Jun 25, 2021

TakuyaNarihira mentioned this issue Jun 25, 2021

Exceeding the maximum number of blocks in transpose kernel sony/nnabla-ext-cuda#321

Closed

TakuyaNarihira reopened this Jun 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OOM for TECO GAN #224

OOM for TECO GAN #224

stalagmite7 commented Jun 24, 2021

TakuyaNarihira commented Jun 24, 2021

stalagmite7 commented Jun 24, 2021 via email •

edited

Loading

stalagmite7 commented Jun 24, 2021

TakuyaNarihira commented Jun 25, 2021 •

edited

Loading

stalagmite7 commented Jan 31, 2022

Srinidhi-Srinivasa commented Feb 1, 2022 •

edited

Loading

Srinidhi-Srinivasa commented Feb 3, 2022 •

edited

Loading

stalagmite7 commented Feb 3, 2022

Srinidhi-Srinivasa commented Feb 3, 2022

OOM for TECO GAN #224

OOM for TECO GAN #224

Comments

stalagmite7 commented Jun 24, 2021

TakuyaNarihira commented Jun 24, 2021

stalagmite7 commented Jun 24, 2021 via email • edited Loading

stalagmite7 commented Jun 24, 2021

TakuyaNarihira commented Jun 25, 2021 • edited Loading

stalagmite7 commented Jan 31, 2022

Srinidhi-Srinivasa commented Feb 1, 2022 • edited Loading

Srinidhi-Srinivasa commented Feb 3, 2022 • edited Loading

stalagmite7 commented Feb 3, 2022

Srinidhi-Srinivasa commented Feb 3, 2022

stalagmite7 commented Jun 24, 2021 via email •

edited

Loading

TakuyaNarihira commented Jun 25, 2021 •

edited

Loading

Srinidhi-Srinivasa commented Feb 1, 2022 •

edited

Loading

Srinidhi-Srinivasa commented Feb 3, 2022 •

edited

Loading