Skip to content

Update llama32 vision (mllama) use attention bias #6560

Update llama32 vision (mllama) use attention bias

Update llama32 vision (mllama) use attention bias #6560

Optional_L2_Megatron_GPT_Pretraining_and_Resume_Training_PP2  /  main

succeeded Nov 22, 2024 in 3m 51s