Batch size beyond 16 is throwing an error #1177

venkycreator · 2024-08-01T09:18:24Z

Earlier I used to do inference using the past code till batch size of 64 , but don't know now it is not even running till batch size of 16 what things should i need to change in order to run as like earlier

regisss · 2024-08-01T09:38:12Z

Please give more information:

what code are you talking about?
the commands you ran?
what are the versions of the library you used?

venkycreator · 2024-08-02T05:10:57Z

Earlier there is a code (for text generation run_generation.py) which doesn't have flash attention and also there is a argument called attn_softmax_bf16 , i exactly dont know what is the reason that i was unable to run batch size beyond 16

The commands that were there in the readme were only executed

And below attached file is the error that i got while running
error.txt

regisss closed this as completed Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch size beyond 16 is throwing an error #1177

Batch size beyond 16 is throwing an error #1177

venkycreator commented Aug 1, 2024

regisss commented Aug 1, 2024

venkycreator commented Aug 2, 2024 •

edited

Loading

Batch size beyond 16 is throwing an error #1177

Batch size beyond 16 is throwing an error #1177

Comments

venkycreator commented Aug 1, 2024

regisss commented Aug 1, 2024

venkycreator commented Aug 2, 2024 • edited Loading

venkycreator commented Aug 2, 2024 •

edited

Loading