Anole-HF cannot be converted to Anole-Torch accurately #45

Hritikbansal · 2024-10-25T23:32:40Z

Hi,

I converted Anole-HF (https://huggingface.co/leloy/Anole-7b-v0.1-hf) to Anole-torch using bin_to_pth.py script.

I observe that the {q/k}_norm.weight, {q/k}_norm.bias, self_attn.{q/k}_proj.weight params do not match between the converted Anole-torch (from HF) and the original Anole-torch (Anole-7b-v0.1) shared as a part of this repo.

I believe that this is due to some special operations (permute and t()) that convert_chameleon_weights_to_hf.py performs on the Torch checkpoint to give the HF checkpoint, which the bin_to_pth.py does not revert.

Can you fix the bin_to_pth file for accurate conversion?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anole-HF cannot be converted to Anole-Torch accurately #45

Anole-HF cannot be converted to Anole-Torch accurately #45

Hritikbansal commented Oct 25, 2024 •

edited

Loading

Anole-HF cannot be converted to Anole-Torch accurately #45

Anole-HF cannot be converted to Anole-Torch accurately #45

Comments

Hritikbansal commented Oct 25, 2024 • edited Loading

Hritikbansal commented Oct 25, 2024 •

edited

Loading