Add Shopee-SlimMoA-v1 to AlpacaEval #398

LLM-Alignment-sh · 2024-08-23T11:38:34Z

We have built and examined a new faster and more accurate MoA method, could you help us get included in the leaderboard? Thanks!

                                    length_controlled_winrate  win_rate  standard_error  n_total  avg_length
Shopee-SlimMoA-v1                                       77.45     75.61            1.27      805        1994
gpt-4o-2024-05-13                                       57.46     51.33            1.47      805        1873
gpt-4-turbo-2024-04-09                                  55.02     46.12            1.47      805        1802
gpt-4o-mini-2024-07-18                                  50.73     44.65            1.46      805        1861
gpt4_1106_preview                                       50.00     50.00            0.00      805        2049
claude-3-opus-20240229                                  40.51     29.11            1.39      805        1388
Meta-Llama-3.1-405B-Instruct-Turbo                      39.26     39.11            1.43      805        1988
Meta-Llama-3.1-70B-Instruct-Turbo                       38.06     39.13            1.43      805        2044
claude-3-sonnet-20240229                                34.87     25.56            1.34      805        1420
Meta-Llama-3-70B-Instruct                               34.42     33.18            1.39      805        1919
gemini-pro                                              24.38     18.18            1.16      805        1456
Mixtral-8x7B-Instruct-v0.1                              23.69     18.26            1.19      805        1465
Meta-Llama-3-8B-Instruct                                22.92     22.57            1.26      805        1899
Meta-Llama-3.1-8B-Instruct-Turbo                        20.85     21.84            1.25      802        2181
Mistral-7B-Instruct-v0.2                                17.11     14.72            1.08      805        1676
alpaca-7b                                                5.88      2.59            0.49      805         396

YannDubs · 2024-08-26T21:38:24Z

Great job @LLM-Alignment-sh 💯

* update * update * update * update * update * update * update * update --------- Co-authored-by: qingtao.yu <[email protected]> Co-authored-by: Yann Dubois <[email protected]>

qty3456 and others added 9 commits August 23, 2024 18:09

update

01f3cb2

update

71de785

update

33f3c8e

update

ad8069d

update

c9a9b16

update

de9e313

update

d91b89d

update

86aaf94

Merge branch 'main' into main

5ed3b18

YannDubs merged commit 4da8a95 into tatsu-lab:main Aug 26, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Shopee-SlimMoA-v1 to AlpacaEval #398

Add Shopee-SlimMoA-v1 to AlpacaEval #398

LLM-Alignment-sh commented Aug 23, 2024

YannDubs commented Aug 26, 2024

Add Shopee-SlimMoA-v1 to AlpacaEval #398

Add Shopee-SlimMoA-v1 to AlpacaEval #398

Conversation

LLM-Alignment-sh commented Aug 23, 2024

YannDubs commented Aug 26, 2024