RaBitQ implementation #4235

alexanderguzhva · 2025-03-11T12:24:15Z

This is a reference implementation of the https://arxiv.org/pdf/2405.12497

Jianyang Gao, Cheng Long, "RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search".

The goal is to correctly set up the internals using Faiss.

The following comments for the implementation:

The code does not include the computations for the symmetric distance, because it is absent in the original article. This can be added later, though.
The original RaBitQ includes random matrix rotation as a part of it, but I've decided to rely on external faiss::IndexPreTransform and faiss::RandomRotationMatrix facilities.
Certain features required internal changes in faiss::IndexIVF, but I did that as least invasive as possible, without breaking the backward compatibility.
Not sure about naming convensions, maybe certain classes and structures need to be renamed
METRIC_INNER_PRODUCT is supported as well
More unit tests are needed?
I did not bring any hardware-specific optimizations, bcz this is a reference implementation. Certain simdlib facilities may be added later, if needed

Here's how to use IndexRaBitQ

        ds = datasets.SyntheticDataset(...)

        index_rbq = faiss.IndexRaBitQ(ds.d, faiss.METRIC_L2)
        index_rbq.qb = 8

        # wrap with random rotations
        rrot = faiss.RandomRotationMatrix(ds.d, ds.d)
        rrot.init(rrot_seed)

        index_cand = faiss.IndexPreTransform(rrot, index_rbq)
        index_cand.train(ds.get_train())
        index_cand.add(ds.get_database())

Here's how to use IndexIVFRaBitQ

        ds = datasets.SyntheticDataset(...)

        index_flat = faiss.IndexFlat(ds.d, faiss.METRIC_L2)
        index_rbq = faiss.IndexIVFRaBitQ(index_flat, ds.d, nlist, faiss.METRIC_L2)
        index_rbq.qb = 8

        # wrap with random rotations
        rrot = faiss.RandomRotationMatrix(ds.d, ds.d)
        rrot.init(rrot_seed)

        index_cand = faiss.IndexPreTransform(rrot, index_rbq)
        index_cand.train(ds.get_train())
        index_cand.add(ds.get_database())

mdouze

Looks good! Thanks for deferring the SIMD optimization to later.
I left a few comments.

mdouze · 2025-03-11T13:02:38Z

faiss/IndexIVF.h

     */
    virtual InvertedListScanner* get_InvertedListScanner(
            bool store_pairs = false,
            const IDSelector* sel = nullptr) const;

+    /** Get a scanner for this index (store_pairs means ignore labels).


I would prefer to replace get_InvertedListScanner altogether with the version that takes IVFSearchParameters (and no sel, since IDSelector is a field of IVFSearchParameters)

what about the backward compatibility? This is why I introduced get_InvertedListScanner_2. I mean that I can upgrade the method signature Faiss-wide, but what about the external code?

mdouze · 2025-03-11T13:04:23Z

faiss/impl/RaBitQuantizer.cpp

+    FAISS_ASSERT(codes != nullptr);
+    FAISS_ASSERT(x != nullptr);
+
+    if (n == 0) {


move this test before the asserts

mdouze · 2025-03-11T13:05:21Z

faiss/impl/RaBitQuantizer.cpp

+
+struct FactorsData {
+    // ||or - c||
+    float factor_0 = 0;


why not give the fields proper names? Otherwise you might as well use float factors[4]

mdouze · 2025-03-11T13:07:13Z

faiss/impl/RaBitQuantizer.cpp

+        const uint8_t* query_j = rearranged_rotated_qq.data() + j * di_8b;
+
+        // process 64-bit popcounts
+        unsigned long long count = 0;


please use explicity sized integer types (eg uint64_t)

will do. As far as I remember, I've used unsigned long long, bcz there were problems with compilations on MacOS for these __builtin_popcount functions

mdouze · 2025-03-11T13:16:22Z

faiss/impl/RaBitQuantizer.cpp

+    float factor_2 = 0;
+    // ||or||^2
+    float factor_3 = 0;
+};


In my implementation there are only 2 floats per database vector. Do you store additional ones for efficiency?

alexanderguzhva · 2025-03-11T18:36:38Z

@mdouze two more comments after the discussion

First. IndexIVF::get_InvertedListScanner() signature is made into

    virtual InvertedListScanner* get_InvertedListScanner(
            bool store_pairs = false,
            const IDSelector* sel = nullptr,
            const IVFSearchParameters* params = nullptr) const;

because of the following logic that can override sel

void IndexIVF::search_preassigned(...) const {
    ...
    IDSelector* sel = params ? params->sel : nullptr;
    const IDSelectorRange* selr = dynamic_cast<const IDSelectorRange*>(sel);
    if (selr) {
        if (selr->assume_sorted) {
            sel = nullptr; // use special IDSelectorRange processing
        } else {
            selr = nullptr; // use generic processing
        }
    }
    ....
}

Please let me know your thoughts.

Second. RaBitQ uses 3 factors

struct FactorsData {
    float or_minus_c_l2sqr = 0;
    float dp_multiplier = 0;
    // this is needed to support BOTH L2 and IP on the same data
    float or_l2sqr = 0;
};

The third one or_l2sqr is needed to support both L2 and IP, similar to how PQ / SQ can use the same data for different metrics. So, these three numbers per vector are independent from a chosen metric. These three factors can be reduced to two, if we make a decision to make factors dependent from the chosen metric.
Please let me know if you'd like to have 2 or 3 factors per vector. Just double-checking.

Signed-off-by: Alexandr Guzhva <alexanderguzhva@gmail.com>

facebook-github-bot added the CLA Signed label Mar 11, 2025

mdouze reviewed Mar 11, 2025

View reviewed changes

alexanderguzhva mentioned this pull request Mar 11, 2025

[WIP] RaBitQ implementation zilliztech/knowhere#1115

Open

alexanderguzhva force-pushed the rabitq branch from 1eb0fa8 to 68c3c36 Compare March 11, 2025 18:30

alexanderguzhva added 6 commits March 11, 2025 14:56

RaBitQ implementation

8133e4d

Signed-off-by: Alexandr Guzhva <alexanderguzhva@gmail.com>

fix formatting and compilation

3622518

Signed-off-by: Alexandr Guzhva <alexanderguzhva@gmail.com>

fix windows compilation

8127397

Signed-off-by: Alexandr Guzhva <alexanderguzhva@gmail.com>

update factors

d6ebd89

Signed-off-by: Alexandr Guzhva <alexanderguzhva@gmail.com>

modify IndexIVF interface

418b94a

Signed-off-by: Alexandr Guzhva <alexanderguzhva@gmail.com>

fix formatting

346830c

Signed-off-by: Alexandr Guzhva <alexanderguzhva@gmail.com>

alexanderguzhva force-pushed the rabitq branch from d1d5012 to 346830c Compare March 11, 2025 18:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RaBitQ implementation #4235

RaBitQ implementation #4235

alexanderguzhva commented Mar 11, 2025

mdouze left a comment

mdouze Mar 11, 2025

alexanderguzhva Mar 11, 2025

mdouze Mar 11, 2025

mdouze Mar 11, 2025

mdouze Mar 11, 2025

alexanderguzhva Mar 11, 2025

mdouze Mar 11, 2025

alexanderguzhva commented Mar 11, 2025

RaBitQ implementation #4235

Are you sure you want to change the base?

RaBitQ implementation #4235

Conversation

alexanderguzhva commented Mar 11, 2025

mdouze left a comment

Choose a reason for hiding this comment

mdouze Mar 11, 2025

Choose a reason for hiding this comment

alexanderguzhva Mar 11, 2025

Choose a reason for hiding this comment

mdouze Mar 11, 2025

Choose a reason for hiding this comment

mdouze Mar 11, 2025

Choose a reason for hiding this comment

mdouze Mar 11, 2025

Choose a reason for hiding this comment

alexanderguzhva Mar 11, 2025

Choose a reason for hiding this comment

mdouze Mar 11, 2025

Choose a reason for hiding this comment

alexanderguzhva commented Mar 11, 2025