Skip to content

Update llama32 vision (mllama) use attention bias #6560

Update llama32 vision (mllama) use attention bias

Update llama32 vision (mllama) use attention bias #6560

L2_Megatron_GPT_SFT_Eval_inference_seq_len_greaterThan_training_seq_len  /  main

succeeded Nov 23, 2024 in 1m 56s