RuntimeError: The size of tensor a (64) must match the size of tensor b (65) at non-singleton dimension 2 #12

michael554466 · 2024-05-08T12:10:58Z

When I try to run the chatbot.py, it spits out this error when trying to generate the response:

Traceback (most recent call last):
File "L:\Projects\Python\GitHub-Repos\fcc-intro-to-llms\chatbot.py", line 199, in
generated_chars = decode(m.generate(context.unsqueeze(0), max_new_tokens=150)[0].tolist())
File "L:\Projects\Python\GitHub-Repos\fcc-intro-to-llms\chatbot.py", line 176, in generate
logits, loss = self.forward(index_cond)
File "L:\Projects\Python\GitHub-Repos\fcc-intro-to-llms\chatbot.py", line 156, in forward
x = self.blocks(x) # (B,T,C)
File "L:\Projects\Python\cuda\lib\site-packages\torch\nn\modules\module.py", line 1532, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "L:\Projects\Python\cuda\lib\site-packages\torch\nn\modules\module.py", line 1541, in _call_impl
return forward_call(*args, **kwargs)
File "L:\Projects\Python\cuda\lib\site-packages\torch\nn\modules\container.py", line 217, in forward
input = module(input)
File "L:\Projects\Python\cuda\lib\site-packages\torch\nn\modules\module.py", line 1532, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "L:\Projects\Python\cuda\lib\site-packages\torch\nn\modules\module.py", line 1541, in _call_impl
return forward_call(*args, **kwargs)
File "L:\Projects\Python\GitHub-Repos\fcc-intro-to-llms\chatbot.py", line 121, in forward
y = self.sa(x)
File "L:\Projects\Python\cuda\lib\site-packages\torch\nn\modules\module.py", line 1532, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "L:\Projects\Python\cuda\lib\site-packages\torch\nn\modules\module.py", line 1541, in _call_impl
return forward_call(*args, **kwargs)
File "L:\Projects\Python\GitHub-Repos\fcc-intro-to-llms\chatbot.py", line 88, in forward
out = torch.cat([h(x) for h in self.heads], dim=-1) # (B, T, F) -> (B, T, [h1, h1, h1, h1, h2, h2, h2, h2, h3, h3, h3, h3])
File "L:\Projects\Python\GitHub-Repos\fcc-intro-to-llms\chatbot.py", line 88, in
out = torch.cat([h(x) for h in self.heads], dim=-1) # (B, T, F) -> (B, T, [h1, h1, h1, h1, h2, h2, h2, h2, h3, h3, h3, h3])
File "L:\Projects\Python\cuda\lib\site-packages\torch\nn\modules\module.py", line 1532, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "L:\Projects\Python\cuda\lib\site-packages\torch\nn\modules\module.py", line 1541, in _call_impl
return forward_call(*args, **kwargs)
File "L:\Projects\Python\GitHub-Repos\fcc-intro-to-llms\chatbot.py", line 67, in forward
wei = wei.masked_fill(self.tril[:T, :T] == 0, float('-inf')) # (B, T, T)
RuntimeError: The size of tensor a (64) must match the size of tensor b (65) at non-singleton dimension 2

Any idea on how to fix this plz?
Gpu: 4070 12gb
torch: 2.3.0+cu121

ahmadsm1 · 2024-05-13T19:54:23Z

Did you get this working by any chance?

vVlLaAdD05 · 2024-08-06T09:59:11Z

This worked for me:

In the GPTModel class, generator function, do you have the index_cond variable?
index_cond = index[:, -block_size:]
logits, loss = self.forward(index_cond)

For me, the error occurred because the T dimension, when running this code: index = torch.cat((index, next_index), dim=1) #(B, T+1), exceeded the block_size.
In your example, T reaches the value 65, but your block_size is 64. However, T must NOT exceed block_size.

Check this and reply with an update to your issue.
I hope this helps you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: The size of tensor a (64) must match the size of tensor b (65) at non-singleton dimension 2 #12

RuntimeError: The size of tensor a (64) must match the size of tensor b (65) at non-singleton dimension 2 #12

michael554466 commented May 8, 2024

ahmadsm1 commented May 13, 2024

vVlLaAdD05 commented Aug 6, 2024 •

edited

Loading

RuntimeError: The size of tensor a (64) must match the size of tensor b (65) at non-singleton dimension 2 #12

RuntimeError: The size of tensor a (64) must match the size of tensor b (65) at non-singleton dimension 2 #12

Comments

michael554466 commented May 8, 2024

ahmadsm1 commented May 13, 2024

vVlLaAdD05 commented Aug 6, 2024 • edited Loading

vVlLaAdD05 commented Aug 6, 2024 •

edited

Loading