How to save the quanted model #50

cquxl · 2024-11-08T05:17:19Z

There are some same problems when saving quanted model , coding is here:
if args.save_model:
model.save_pretrained(args.save_model)
tokenizer.save_pretrained(args.save_model)
Could you please provide the right method to save the quanted model?

sashkboos · 2024-11-08T12:55:30Z

Thanks @cquxl for your issue.

what is the problem with that? What is the error you get while saving the model?

cquxl · 2024-11-09T06:54:18Z

Thanks for your reply.

After saving model using the above code, it occurs some problems like "ValueError: You are trying to save a non contiguous tensor".
Then, I correct it by using the following code " if args.save_model:
for name, param in model.named_parameters():
if not param.is_contiguous():
param.data = param.data.contiguous()
model.save_pretrained(args.save_model)
tokenizer.save_pretrained(args.save_model)"， which can save the model. However, if I use "transformers.
AutoModelForCausalLM.from_pretrained("saved model ")" to load the model, it occurs another problem:"ValueError: Trying to set a tensor of shape torch.Size([1]) in "weight" (which has shape torch.Size([8192])), this looks incorrect.".
Could you please tell me how to save the quanted model like OmniQuant, and then use the quanted model for evaluation by lm_eval.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to save the quanted model #50

How to save the quanted model #50

cquxl commented Nov 8, 2024

sashkboos commented Nov 8, 2024

cquxl commented Nov 9, 2024

How to save the quanted model #50

How to save the quanted model #50

Comments

cquxl commented Nov 8, 2024

sashkboos commented Nov 8, 2024

cquxl commented Nov 9, 2024