Given a vector dataset
-
Product Quantization (PQ) : PQ [TPAMI'11] is a typical quantization algorithm that is widely used in large billion-scale indices [NeurIPS'19,ECCV'18].
-
Optimized Product Quantization (OPQ) : OPQ [TPAMI'14] allows jointly optimizes bit allocation and reconstruction error.
-
Link&Code : Link&Code[CVPR'18] design a new PQ-based method specifically for HNSW.
-
Catalyst : Catalyst[ICLR'19]utilize a compression network optimize quantization for similarity search, and DIM-RED [ICML'20] combine this method with graph-based ANNS methods.
Dataset | # Dimension | # Base | # Query | # Train |
---|---|---|---|---|
BIGANN (link) | 128 | 1,000,000 | 100,000 | 500,000 |
DEEP (link) | 96 | 1,000,000 | 100,000 | 500,000 |
GIST (link)) | 960 | 1,000,000 | 10,000 | 500,000 |
UKBENCH(link) | 128 | 1,000,000 | 200 | 97,907 |
SIFT (link) | 128 | 1,000,000 | 10,000 | 100,000 |