Non Uniform Performance with GPU - Indexing running over days even with GPU. #39110
Replies: 2 comments
-
|
Beta Was this translation helpful? Give feedback.
-
@rohitreddy1698 Hi, please replace the IVF_PQ in the build algorithm with NN_DESCENT. IVF_PQ performs a brute-force search-like method to construct the adjacency list, which can be very time-consuming on cards like the T4. |
Beta Was this translation helpful? Give feedback.
-
Hi,
I was trying to run some tests on GPU vs CPU Index performance.
I am trying to use the Zilliz Vector DB Bench Tool to run the benchmarking and capture performance metrics.
I have GPU Cluster setup on GKE :
Index Nodes : 3 n1-highmem-8 , nodes with 4 GPUs per node as index nodes :
4 NVIDIA T4 GPUs per node - 16* 4 GiB Memory = 64 GiB
1 ) When I run GPU IVF PQ for 100M Vectors 768 D with parameters :
nlist=10000, m=64 , n=16 - Is running over days. And ended in timeout.
On Index status is being build :
But on Grafana GPU Utilisation is show as 0. Attaching Image for reference.
Also is there a way to increase the Recall : with nlist=10000, m=32 , n=16 - recall is 31.42
For GPU CAGRA for same 100M Vectors 768 D dataset
Screenshot for parameters used.
The Indexing time is 5x more as compared to HNSW
Load Duration Output from Zilliz Vector DB Bench Tool :
GPU CAGRA : 195,700 s
HNSW : 37,292 s
The Same observed even for 10 M dataset :
GPU CAGRA : 32220 s
HNSW : 17430 s
But as per blogs from Milvus and in open source blogs I think CAGRA should be performing better.
Can you please help us on this.
Regards,
Rohit Mothe
Beta Was this translation helpful? Give feedback.
All reactions