How to increase cpu usage of query node #38674
Replies: 2 comments 3 replies
-
To improve cpu usage, you can increase the replica_num of loading.
With replica_number=4, the cluster requires 4X memory capacity. QPS will increase to 800. You need more clients to send requests parallelly. DISKANN requires higher disk I/O than other index types. If you don't believe the disk I/O is the bottle, you can try:
|
Beta Was this translation helpful? Give feedback.
-
I had the similar issue, here's my perspective:
|
Beta Was this translation helpful? Give feedback.
-
I want to maximize the search throughput using the DiskANN method. According to the Milvus benchmark report, search throughput increases proportionally with CPU cores when scaling up query nodes. However, when I attempted scaling up, I observed that the CPU usage of the query nodes remains very low (around 10%).
Which component in the pipeline could be a potential bottleneck before the query nodes?
Currently, we are measuring QPS using multi(5) vector search with a limit of 10, and the QPS is around 200. It seems unlikely that disk I/O is the bottleneck.
The reasoning is based on this discussion, which assumes that disk I/O per limit is 10.
With a 5 vector search, a limit of 10, and 200 QPS, the I/O generated per second would be calculated as 200 * 10 * 10 * 5 = 100K I/O. However, since there are 100 query node replicas, the I/O per query node is only 1000.
This is significantly below the IOPS capacity of the disk in use, so I concluded that disk I/O is not the bottleneck. If there is any flaw in this reasoning, please let me know.
Thank you.
Beta Was this translation helpful? Give feedback.
All reactions