Add support for BFloat16 with Faiss Scalar Quantizer for extended range #2510

naveentatikonda · 2025-02-08T00:05:57Z

Description

The fp16 support using Faiss scalar quantizer(SQfp16) provides 50% memory reduction besides providing at par performance and mostly similar recall compared to fp32 vectors. But, it has a range limitation where the input vectors needs to be within the range of [-65504, 65504] which is a bottleneck to use fp16 as a default datatype instead of fp32.

To overcome this problem, we can add support for BFloat16 with Faiss Scalar Quantizer(SQbf16). BFloat16 comes with an extended range(same as fp32) by trading off precision(supports upto 2 or 3 decimal values or 7 mantissa bits) and still uses 16 bits per dimension (provides 50% memory reduction).

Intel AVX512 also has BF16 instruction set which can be used to further improve the performance on newer-generation processors.

naveentatikonda added Features Introduces a new unit of functionality that satisfies a requirement v3.0.0 labels Feb 8, 2025

github-actions bot added the untriaged label Feb 8, 2025

naveentatikonda added this to Vector Search RoadMap Feb 8, 2025

github-project-automation bot moved this to Backlog in Vector Search RoadMap Feb 8, 2025

naveentatikonda moved this from Backlog to Backlog (Hot) in Vector Search RoadMap Feb 8, 2025

naveentatikonda self-assigned this Feb 8, 2025

naveentatikonda added untriaged and removed untriaged labels Feb 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for BFloat16 with Faiss Scalar Quantizer for extended range #2510

Add support for BFloat16 with Faiss Scalar Quantizer for extended range #2510

naveentatikonda commented Feb 8, 2025 •

edited

Loading

Add support for BFloat16 with Faiss Scalar Quantizer for extended range #2510

Add support for BFloat16 with Faiss Scalar Quantizer for extended range #2510

Comments

naveentatikonda commented Feb 8, 2025 • edited Loading

Description

naveentatikonda commented Feb 8, 2025 •

edited

Loading