[Question]The locally deployed deepseek-v3 loses 5 points compared to the API #212

Wen1163204547 · 2025-01-03T03:47:46Z

I deploy deepseek-v3 locally using 8xH20, test LiveBench-0831 with temperature=0, and without system prompt. The result shows a 5-point drop compared to the API. Are this released model and the API the same model?

GeeeekExplorer · 2025-01-03T07:24:12Z

Which inference engine are you using in the local deployment?

Wen1163204547 · 2025-01-03T08:15:14Z

@GeeeekExplorer vllm-0.6.6.post1

qq1469617613 · 2025-01-06T03:20:19Z

能分享一份本地部署的部署文档嘛我也想本地部署一下试试

Wen1163204547 · 2025-01-06T05:58:19Z

能分享一份本地部署的部署文档嘛我也想本地部署一下试试

做了些集成，只能分享一些参数配置：
max_num_seqs: 32
quantization: fp8
max_model_len: 9000
trust_remote_code: true
tensor_parallel_size: 8
enable_chunked_prefill: true
gpu_memory_utilization: 0.98
max_num_batched_tokens: 1024

chenatu · 2025-01-08T03:54:22Z

请问8张H20速度每秒多少token

Wen1163204547 · 2025-01-08T05:59:40Z

@GeeeekExplorer 我试了vllm-0.6.6.post1 / vllm-0.6.6，都是53上下，但是API能跑到57左右。请问API和开源的模型是同一个模型吗？

Wen1163204547 · 2025-01-08T06:00:15Z

请问8张H20速度每秒多少token

用两台H100吧，比H20快很多

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]The locally deployed deepseek-v3 loses 5 points compared to the API #212

[Question]The locally deployed deepseek-v3 loses 5 points compared to the API #212

Wen1163204547 commented Jan 3, 2025

GeeeekExplorer commented Jan 3, 2025

Wen1163204547 commented Jan 3, 2025

qq1469617613 commented Jan 6, 2025

Wen1163204547 commented Jan 6, 2025

chenatu commented Jan 8, 2025

Wen1163204547 commented Jan 8, 2025

Wen1163204547 commented Jan 8, 2025

[Question]The locally deployed deepseek-v3 loses 5 points compared to the API #212

[Question]The locally deployed deepseek-v3 loses 5 points compared to the API #212

Comments

Wen1163204547 commented Jan 3, 2025

GeeeekExplorer commented Jan 3, 2025

Wen1163204547 commented Jan 3, 2025

qq1469617613 commented Jan 6, 2025

Wen1163204547 commented Jan 6, 2025

chenatu commented Jan 8, 2025

Wen1163204547 commented Jan 8, 2025

Wen1163204547 commented Jan 8, 2025