- Providing an in-depth analysis of the graph-based ANNS workload, highlighting its optimization space.
- Offering a more detailed parameterized search strategy that allows for flexible trade-offs between search speed and accuracy.
- To incorporate hidden dimensions not embedded into vectors (e.g., temporal information), we propose a new proximity graph construction algorithm and a graph memory layout optimization.
katex: true
tags:
- Paper Notes
- Vector Search
Perspectives¶
约 250 个字 4 张图片 预计阅读时间 1 分钟
- This paper reveals that it is not worthwhile to exploit intra-iteration and intra-query parallelism.
- Not every iteration in searching contributes equally to the final result, so it is not necessary to wait for all iterations to finish.
- The paper reveals that the input queries and their ANNS results exhibit a notable temporal correlation in many real-world datasets.
Core Ideas¶
Parameterized Beam Search¶
Recency-aware Graph Construction¶
Graph Layout Optimization¶
Selective Neighbor Scan¶
This early termination of neighbor scanning has two advantages:
- it avoids fetching unpromising vector data from the memory;
- unpromising distance computations are also circumvented.
Vertex reordering¶
To avoid random access and simplify the selection logic, we propose a graph layout optimization. The idea is that, after the graph is constructed, we sort each vertex’s out-neighbor list based on the timestamp.
Then, at each iteration of the beam search, only a fraction of the vertex’s out-neighbors is fetched and evaluated.