Milbus Speed Up wway #38573
-
I am currently using Milvus as a vector database. However, when utilizing vector search with HNSW, IVF_FLAT, and GPU_CAGRA, the average search time was around 0.5 seconds, with almost no noticeable difference among them. Of course, I only tested with a single data point, but is it normal for the search times to be so consistent? Additionally, the parameters for each index type are as follows: m: 16 nlist: 128 intermediate_graph_degree: 64 metric_type: L2 |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
I guess your collection is in "Strong" consistency level. Consistency level is to control the data visibility for search/query in the scenario that data is inserted/updated continually during search/query. Let's say there is a client continually sends insert requests to add new data. Another client calls search(). Since insert action is asynchronously, the inserted data might be invisible to search requests and user might get unexpected results. Consistency level allows users to control the data visibility for search requests. Let's say a search request is received at time point "t1". Strong level ------- Ensure the data inserted before "t1" is visible for this search request. Since the data is consumed asynchronously, the search request needs to wait a short time, typically 200ms ~ 400ms. Try the "Eventually" level:
Normally, for small collection contains no more than one million rows with index. a search latency should be around 10 milliseconds. |
Beta Was this translation helpful? Give feedback.
I guess your collection is in "Strong" consistency level.
Consistency level is to control the data visibility for search/query in the scenario that data is inserted/updated continually during search/query.
https://milvus.io/docs/consistency.md#Consistency-Level
Let's say there is a client continually sends insert requests to add new data. Another client calls search(). Since insert action is asynchronously, the inserted data might be invisible to search requests and user might get unexpected results. Consistency level allows users to control the data visibility for search requests.
Let's say a search request is received at time point "t1".
Strong level ------- Ensure the data inserted bef…