llama : improve BPE pre-processing + LLaMA 3 and Deepseek support #11445
Job | Run time |
---|---|
26m 53s | |
29m 39s | |
28m 27s | |
15m 13s | |
15m 25s | |
15m 54s | |
20m 9s | |
10m 7s | |
14m 43s | |
9m 12s | |
8m 33s | |
3h 14m 15s |
Job | Run time |
---|---|
26m 53s | |
29m 39s | |
28m 27s | |
15m 13s | |
15m 25s | |
15m 54s | |
20m 9s | |
10m 7s | |
14m 43s | |
9m 12s | |
8m 33s | |
3h 14m 15s |