Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Randomly the machine get stuck on loading model #112

Open
Sapessii opened this issue Sep 19, 2024 · 2 comments
Open

Randomly the machine get stuck on loading model #112

Sapessii opened this issue Sep 19, 2024 · 2 comments

Comments

@Sapessii
Copy link

Hi,

as the title suggests completely random, the machine gets stuck on

Using model weights format ['*.safetensors']

and I have to manually terminate the worker and restart it.

Do you have any suggestions?

@avacaondata
Copy link

I have also experienced this a couple of times, and haven't found a way to prevent it, I'm so sorry for not being able to help you here :( @Sapessii Maybe you should ping one of the maintainers for some advice, so they can instruct you on best practices to avoid this. Good luck :)

@nielsrolf
Copy link

Bumping this because I'm also experiencing it a lot, lately pretty much 100% of the time when trying to deploy 70b models

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants