-
Notifications
You must be signed in to change notification settings - Fork 103
ColossalAI cannot run the shufflenet_v2_x1_0 model as torch do #139
Comments
Hi, could you provide your training code for us to reproduce this bug? Besides, could you double-check your dataset settings? |
I have tried our code with a simple change of model from resnet to shufflenet. It takes about 32521MiB with |
Hi @songyuc, you can uninstall your current
There was a bug in previous release that takes up extra GPU memory. With our latest version, |
Thank you for the guide! I will try it later. |
🐛 Describe the bug
models.shufflenet_v2_x1_0
can be trained withBATCH_SIZE = 16384
, which cannot be run successfully with ColossalAI.The information is below:
Environment
CUDA: 11.4
The text was updated successfully, but these errors were encountered: