-
Notifications
You must be signed in to change notification settings - Fork 103
there maybe some bug about the train_gpt.py(https://github.com/hpcaitech/ColossalAI-Examples/blob/main/language/gpt/train_gpt.py) #196
Comments
|
my config is below.
|
Currently, a config file is not necessary. See this as the latest CAI GPT example |
if I want to add parallel parallelism and sequence parallelism(SP) on the gpt, how should I run the code? I am confused by the different code and different document. And your code seems tensor parallelism. BTW, I talk with your colleague and he told me the SP can not combined with TP. |
He is right, SP and TP can not work together :( |
🐛 Describe the bug
I try to run a config by using the train_gpt.py. I add a model on the gpt.py .
And I change my dateset webtext to this .
I run this model and colossalai just spends 1s for 1 iteration. But I run the same model on Megatron-LM and I need about 100s for one iteration.
Environment
No response
The text was updated successfully, but these errors were encountered: