#15024: Improve prefix sum in Lucene99HnswVectorsReader by leng25 · Pull Request #15790 · apache/lucene

leng25 · 2026-03-02T19:18:09Z

Summary

This PR implements the optimization suggested in #15024, replacing the two-step prefix sum loop in Lucene99HnswVectorsReader with a single-pass accumulator variant that avoids redundant memory reads.

Before:

currentNeighborsBuffer[0] = dataIn.readVInt();
for (int i = 1; i < arcCount; i++) {
  currentNeighborsBuffer[i] = currentNeighborsBuffer[i - 1] + dataIn.readVInt();
}

After:

int sum = 0;
for (int i = 0; i < arcCount; i++) {
  sum += dataIn.readVInt();
  currentNeighborsBuffer[i] = sum;
}

This is a follow-up to #15027 by @yossev who proposed the same fix. Since that PR went stale (merge conflicts, formatting), I'm resubmitting with conflicts resolved, formatting fixed via ./gradlew tidy, and benchmark results included.

I found this while looking for a good first issue to learn the contribution process — happy to adjust anything based on feedback!

Benchmark Results

Benchmarks were run using luceneutil KNN benchmark (knnPerfTest.py).

Machine: Intel Core i5-10210U, 8 logical cores, ~15 GB RAM
Dataset: cohere-v3-wikipedia-en 1024d, 400k docs, 10k queries, 8-bit quantized, dot_product

Baseline:

recall  latency(ms)  netCPU  avgCpuCount    nDoc  topK  fanout  maxConn  beamWidth  quantized  visited  index(s)  index_docs/s  force_merge(s)  num_segments  index_size(MB)
 0.977        9.920   9.893        0.997  400000   100     100       64        250     8 bits     7955    486.32        822.50          437.90             1         2015.68

Candidate (this PR):

recall  latency(ms)  netCPU  avgCpuCount    nDoc  topK  fanout  maxConn  beamWidth  quantized  visited  index(s)  index_docs/s  force_merge(s)  num_segments  index_size(MB)
 0.977        9.861   9.833        0.997  400000   100     100       64        250     8 bits     7955    486.32        822.50          437.90             1         2015.68

Recall is identical. Results are from a single run so small differences may fall within normal measurement variance.

kaivalnp

LGTM

Looks like this improvement is in the range of noise for knnPerfTest.py, but is good-to-have anyways.

kaivalnp · 2026-03-05T16:33:07Z

lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsReader.java

      dataIn.seek(graphLevelNodeOffsets.get(targetIndex + graphLevelNodeIndexOffsets[level]));
      arcCount = dataIn.readVInt();
      assert arcCount <= currentNeighborsBuffer.length : "too many neighbors: " + arcCount;
+      int sum = 0;


nit: Would prefer this variable inside the if block below

Done, move inside the if block

kaivalnp · 2026-03-05T16:34:53Z

lucene/CHANGES.txt


 Optimizations
 ---------------------
+* GITHUB#15024: Improve prefix sum computation in Lucene99HnswVectorsReader for faster neighbor decoding. (Luis Negrin)


This entry is under 11.0.0 -- can you move it to 10.5.0? (I can help with merge + backport)

Move into 10.5.0, would appreciate help with the merge + backport

kaivalnp

LGTM

(cherry picked from commit fb2b916)

leng25 and others added 2 commits February 28, 2026 13:57

Improve prefix sum in Lucene99HnswVectorsReader

0b322e4

Merge branch 'apache:main' into improve-hnsw-prefix-sum

94352d6

github-actions bot added the module:core/codecs label Mar 2, 2026

github-actions bot added this to the 11.0.0 milestone Mar 2, 2026

leng25 mentioned this pull request Mar 2, 2026

Improve prefix sum in Lucene99HnswVectorsReader #15024

Closed

kaivalnp reviewed Mar 5, 2026

View reviewed changes

Move CHANGES.txt entry from 11.0.0 to 10.5.0

b63e8b9

github-actions bot modified the milestones: 11.0.0, 10.5.0 Mar 9, 2026

kaivalnp approved these changes Mar 9, 2026

View reviewed changes

kaivalnp linked an issue Mar 9, 2026 that may be closed by this pull request

Improve prefix sum in Lucene99HnswVectorsReader #15024

Closed

kaivalnp merged commit fb2b916 into apache:main Mar 9, 2026
13 checks passed

kaivalnp pushed a commit that referenced this pull request Mar 9, 2026

Improve prefix sum in Lucene99HnswVectorsReader (#15024) (#15790)

b5e8bc2

(cherry picked from commit fb2b916)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#15024: Improve prefix sum in Lucene99HnswVectorsReader#15790

#15024: Improve prefix sum in Lucene99HnswVectorsReader#15790
kaivalnp merged 3 commits intoapache:mainfrom
leng25:improve-hnsw-prefix-sum

leng25 commented Mar 2, 2026

Uh oh!

kaivalnp left a comment

Uh oh!

kaivalnp Mar 5, 2026

Uh oh!

leng25 Mar 9, 2026

Uh oh!

kaivalnp Mar 5, 2026

Uh oh!

leng25 Mar 9, 2026

Uh oh!

kaivalnp left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leng25 commented Mar 2, 2026

Summary

Benchmark Results

Uh oh!

kaivalnp left a comment

Choose a reason for hiding this comment

Uh oh!

kaivalnp Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

leng25 Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

kaivalnp Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

leng25 Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

kaivalnp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants