-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Slow speed on Windows #10
Comments
That's very slow and suspect it is not using your GPU. On my system (Apple MBP M2 Max with 96 GiB RAM), memory usage at 4096 token context length is 15.16 GiB, which would fit entirely within your 24 GiB 4090. |
I did install the appropriate GPU torch and Task Manager shows it is the GPU and not the CPU being used. Task Manager also shows dedicated GPU memory is 21.9/24.0 so not maxed out there. For the install I basically use these pip commands to get the requirements, gradio, and then swap CPU torch out for GPU torch.
|
For any other WIndows users (or to help test this issue) here is an install.bat and run.bat. Save them both to an empty directory, command prompt into that directory, run install.bat, then run run.bat to start it. install.bat
run.bat
|
Testing the first example prompt gives this error after clicking it
because it does not put the prompt text into the "Type a message" field? If I reload the UI and manually type the prompt "Create a 3D model of a wooden hammer" into the "Type a message" field it does then start without error. |
There are two fields of pre-written prompts, ones above the entry box and ones below. The ones above give me an error, but the ones below seem to work. |
Someone posted then deleted a suggestion to try flash-attn. |
Are you using |
How do I set that? I do not see either in app.py. |
Try this: Unfortunately I get an OOM error after switching to bf16 on a 4070 ti =( |
I run on NVIDIA L20 GPUs, it does not work! |
How fast is this supposed to generate the OBJ vertex points? I have it installed locally (Windows with a 24GB 4090), the gradio starts, and when I prompt it the vertex generation seems to take around 10 seconds per line/vertex.
Is this normal? Any tps to speed it up?
Thanks.
The text was updated successfully, but these errors were encountered: