[QUESTION] When will model have _extra_state
?
#1104
Unanswered
1049451037
asked this question in
Q&A
Replies: 1 comment
-
I had faced this error too, recently, and had to write a downstream script to remove this from every layer -- looks like the Mcore format is out of sync with what MegatronLM consumes? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
After updating to the main branch of Megatron-LM recently, I met this error when loading model:
The checkpoint is transformed by the
tools/checkpoint/convert.py
, and loaded bypretrain_gpt.py
.Beta Was this translation helpful? Give feedback.
All reactions