文件数量多，KV缓存查询速度慢 #3

xxWeiDG · 2024-11-28T01:29:16Z

当文件数量多的时候，kv缓存查询的速度降低很多，有什么优化办法吗

lskeeper · 2024-12-16T04:36:40Z

Nice work! I have a question related to the above one. In case we are dealing with a large scale corpus, it means the KV cache will likely be stored in remote storage, which would incur costs from network communication. In this case, do you still see clear win on inference speed-up?

xxWeiDG · 2024-12-16T05:47:02Z

It's not just the network cost, the more data you have the slower it will be. It's not just the network cost, the more data you have the slower it will be. Xiaohua Yan ***@***.***>于2024年12月16日周一12:37写道：

…

Nice work! I have a question related to the above one. In case we are dealing with a large scale corpus, it means the KV cache will likely be stored in remote storage, which would incur costs from network communication. In this case, do you still see clear win on inference speed-up? — Reply to this email directly, view it on GitHub <#3 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/BBSGQBLFJOFFM4KSCXZNSMD2FZKG7AVCNFSM6AAAAABSUA7WV2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNBUGU3TQMZQHE> . You are receiving this because you authored the thread.Message ID: ***@***.***>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

文件数量多，KV缓存查询速度慢 #3

文件数量多，KV缓存查询速度慢 #3

xxWeiDG commented Nov 28, 2024

lskeeper commented Dec 16, 2024

xxWeiDG commented Dec 16, 2024 via email

文件数量多，KV缓存查询速度慢 #3

文件数量多，KV缓存查询速度慢 #3

Comments

xxWeiDG commented Nov 28, 2024

lskeeper commented Dec 16, 2024

xxWeiDG commented Dec 16, 2024 via email