[Kernel] Upload a MoE config file for Mixtral8x7B 8GPU on AMD_Instinct_MI300X_OAM machine (fp16) #261

Jacob0226 · 2024-11-04T07:15:39Z

This is just a tuned MoE config file which includes boarder examples.
The Jira ticket GPUAI-2067 adopts this config for better MoE performance.

Signed-off-by: jacob <[email protected]>

shajrawi · 2024-11-11T16:20:45Z

@divakar-amd can you please review? see details here https://ontrack-internal.amd.com/browse/GPUAI-2067

@Jacob0226 I cannot access the documentation wiki in JIRA - can you please explain how the tuning results were gathered?

Jacob0226 · 2024-11-12T02:15:59Z

@divakar-amd can you please review? see details here https://ontrack-internal.amd.com/browse/GPUAI-2067

@Jacob0226 I cannot access the documentation wiki in JIRA - can you please explain how the tuning results were gathered?

I share the ticket to both of you.
I use the script benchmarks/kernels/benchmark_mixtral_moe_rocm.py to get the runtime on various bs values.
For the Mixtral8x7B, it needs more bs value and this new MoE config basically cover bs value from 1 to 32768. The original one in the vllm repo has a max number of bs 4000.

Jacob0226 · 2024-11-12T02:19:27Z

The jira ticket may contain too much info you don't need here.
When I benchmark Mixtral with vLLM, I found I need more bs config in MoE, so I use benchmark_mixtral_moe_rocm.py to do the tunings.

Jacob0226 and others added 3 commits November 3, 2024 23:06

Upload a Mixtral8x7B 8GPU MoE for AMD_Instinct_MI300X_OAM machine (fp16)

a411530

Signed-off-by: jacob <[email protected]>

Merge branch 'main' into Mixtral8x7b_MoE_Config

eafee0c

Merge branch 'main' into Mixtral8x7b_MoE_Config

e49b6c4

shajrawi requested a review from divakar-amd November 11, 2024 16:19

Merge branch 'main' into Mixtral8x7b_MoE_Config

f5d34d2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Kernel] Upload a MoE config file for Mixtral8x7B 8GPU on AMD_Instinct_MI300X_OAM machine (fp16) #261

[Kernel] Upload a MoE config file for Mixtral8x7B 8GPU on AMD_Instinct_MI300X_OAM machine (fp16) #261

Jacob0226 commented Nov 4, 2024

shajrawi commented Nov 11, 2024

Jacob0226 commented Nov 12, 2024

Jacob0226 commented Nov 12, 2024

[Kernel] Upload a MoE config file for Mixtral8x7B 8GPU on AMD_Instinct_MI300X_OAM machine (fp16) #261

Are you sure you want to change the base?

[Kernel] Upload a MoE config file for Mixtral8x7B 8GPU on AMD_Instinct_MI300X_OAM machine (fp16) #261

Conversation

Jacob0226 commented Nov 4, 2024

shajrawi commented Nov 11, 2024

Jacob0226 commented Nov 12, 2024

Jacob0226 commented Nov 12, 2024