-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RuntimeError: Mismatch in Input Channels for Convolution Layer #4
Comments
From the error message, it seems that the issue is caused by a mismatch in the model architecture. We can investigate this from the following two perspectives:
Additionally, to better assist you in diagnosing the issue, could you please provide more error screenshots? This will help me pinpoint the exact cause of the problem. For your information, if you are trying to perform text-to-video (T2V) tasks, I plan to release LoRA fine-tuning and full fine-tuning support for CogVideo 1.5 in the next version. This will allow you to fine-tune the model for T2V tasks effectively. |
Thank you for your response, the issue has been resolved. |
Thanks for the response! I wasn’t aware that I can't use images. Is it generally the case that images cannot be used as a dataset for Loras in video models? I'm a newcomer, and it's really hard to grasp these connections right away. |
During model training, a runtime error occurs due to a channel mismatch in a convolutional layer:
RuntimeError: Given groups=1, weight of size [3072, 32, 2, 2], expected input[4, 33, 32, 32] to have 32 channels, but got 33 channels instead
This error indicates that the model expects input with 32 channels but receives input with 33 channels instead, causing the training process to fail.
Already Checked
The dataset consists of 512x512 images with 24-bit depth and has been verified to have 3 channels (RGB). All images meet these criteria.
The image sizes and channels were re-verified using a script, and they match the expected specifications.
Question
What steps can be taken to identify the root cause of this channel mismatch? Could this issue be related to model configuration or preprocessing steps? Any guidance on resolving this would be greatly appreciated.
The text was updated successfully, but these errors were encountered: