[QUESTION]　To convert a Llama 3.1 70B checkpoint in torch dcp format to the HuggingFace format, #1245

kaiyama12345679 · 2024-10-19T05:16:45Z

kaiyama12345679
Oct 19, 2024

I've pre-trained Llama-3.1 70B using Megatron-LM (8 tensor parallel, 2 pipeline parallel, using distributed optimizer), and I have confirmed that the checkpoint is saved in the torch dcp format.
However, I am not sure how to convert this checkpoint into a format that can be uploaded to HuggingFace. If anyone knows how to do this, I would greatly appreciate your help.

zhangyilalala · 2024-11-19T09:21:17Z

zhangyilalala
Nov 19, 2024

@kaiyama12345679 Hi~ Did you find a solution later? I'm encountering the same issue.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION]　To convert a Llama 3.1 70B checkpoint in torch dcp format to the HuggingFace format, #1245

{{title}}

Replies: 1 comment

{{title}}

Select a reply

[QUESTION] To convert a Llama 3.1 70B checkpoint in torch dcp format to the HuggingFace format, #1245

kaiyama12345679 Oct 19, 2024

Replies: 1 comment

zhangyilalala Nov 19, 2024

[QUESTION]　To convert a Llama 3.1 70B checkpoint in torch dcp format to the HuggingFace format, #1245

kaiyama12345679
Oct 19, 2024

zhangyilalala
Nov 19, 2024