[QUESTION] To convert a Llama 3.1 70B checkpoint in torch dcp format to the HuggingFace format, #1245
Unanswered
kaiyama12345679
asked this question in
Q&A
Replies: 1 comment
-
@kaiyama12345679 Hi~ Did you find a solution later? I'm encountering the same issue. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I've pre-trained Llama-3.1 70B using Megatron-LM (8 tensor parallel, 2 pipeline parallel, using distributed optimizer), and I have confirmed that the checkpoint is saved in the torch dcp format.
However, I am not sure how to convert this checkpoint into a format that can be uploaded to HuggingFace. If anyone knows how to do this, I would greatly appreciate your help.
Beta Was this translation helpful? Give feedback.
All reactions