Create int bfloat16 vector formats by thecoop · Pull Request #136627 · elastic/elasticsearch

thecoop · 2025-10-15T14:27:39Z

Create classes for quantized int vector formats that support bfloat16 and direct IO. This uses the new Lucene104 quantizer.

elasticsearchmachine · 2025-10-15T14:28:06Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

server/src/main/java/org/elasticsearch/index/codec/vectors/es93/ES93Int8FlatVectorFormat.java

...r/src/main/java/org/elasticsearch/index/codec/vectors/ES814ScalarQuantizedVectorsFormat.java

benwtrent

I need to remember why I started that Flat vs Format pattern for bit vectors that you have continued here, I am not sure its actually warranted.

...in/java/org/elasticsearch/index/codec/vectors/es93/ES93ScalarQuantizedFlatVectorsFormat.java

This reverts commit 064392f.

thecoop · 2025-10-24T09:02:08Z

I've removed the intermediate format, and pushed everything into the top-level formats

john-wagster

this lgtm. It did pop into my head that I don't remember where we've validated that Lucene104ScalarQuantizedVectorScorer is overall better and we don't need some option to be able to roll back but it's also not clear that that has anything to do with this PR just an errant thought I figured I'd write down.

thecoop · 2025-11-10T09:27:11Z

We haven't yet. We'll be able to change the format after this has merged, so we can validate the performance of the new formats before we GA it

thecoop added 2 commits October 15, 2025 15:11

Add HNSW scalar quantized bfloat16 implementation

c10fd76

Add flat format

3a6f7fc

thecoop added >non-issue :Search Relevance/Vectors Vector search labels Oct 15, 2025

elasticsearchmachine added v9.3.0 Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch labels Oct 15, 2025

Add int8 implementation

a85b7ca

benwtrent reviewed Oct 16, 2025

View reviewed changes

server/src/main/java/org/elasticsearch/index/codec/vectors/es93/ES93Int8FlatVectorFormat.java Outdated Show resolved Hide resolved

thecoop added 4 commits October 20, 2025 11:37

Rename class

a638b59

Improve tests

419032d

Merge branch 'main' into int-hnsw-bfloat16

038db83

Fix module reference

1287a0b

thecoop force-pushed the int-hnsw-bfloat16 branch from 66172e3 to 1287a0b Compare October 20, 2025 11:16

thecoop added 2 commits October 23, 2025 08:41

Merge branch 'lucene_snapshot' into int-hnsw-bfloat16

bd55e05

Update to use new Lucene104 format

7c343c5

thecoop changed the base branch from main to lucene_snapshot October 23, 2025 08:52

thecoop and others added 4 commits October 23, 2025 10:20

Update more tests

4fa4338

Merge branch 'lucene_snapshot' into int-hnsw-bfloat16

84f5bf4

Class renames

4815bc2

[CI] Auto commit changes from spotless

94ef4f1

thecoop requested a review from benwtrent October 23, 2025 14:05

thecoop added 2 commits October 23, 2025 15:59

Merge branch 'main' into int-hnsw-bfloat16

61de804

Update for ElementType change

6720384

thecoop requested review from a team as code owners October 23, 2025 15:20

Merge branch 'lucene_snapshot' into int-hnsw-bfloat16

5c0e171

benwtrent reviewed Oct 23, 2025

View reviewed changes

...r/src/main/java/org/elasticsearch/index/codec/vectors/ES814ScalarQuantizedVectorsFormat.java Show resolved Hide resolved

benwtrent reviewed Oct 23, 2025

View reviewed changes

...in/java/org/elasticsearch/index/codec/vectors/es93/ES93ScalarQuantizedFlatVectorsFormat.java Outdated Show resolved Hide resolved

Revert "Use the reader in Lucene BWC"

115507a

This reverts commit 064392f.

Remove intermediate class

44ecd39

Merge branch 'lucene_snapshot' into int-hnsw-bfloat16

cbba58b

thecoop mentioned this pull request Oct 24, 2025

Tidy up Lucene104ScalarQuantizedVectorsWriter apache/lucene#15357

Merged

thecoop requested a review from benwtrent October 30, 2025 09:10

thecoop added 7 commits October 30, 2025 09:10

Merge branch 'lucene_snapshot' into int-hnsw-bfloat16

f989a30

Merge branch 'lucene_snapshot' into int-hnsw-bfloat16

33fe50d

Use public constructor

f6ee769

Merge branch 'lucene_snapshot' into int-hnsw-bfloat16

18e27e2

We don't need a separate search impl here

9795d0f

Merge branch 'main' into int-hnsw-bfloat16

e330b42

Merge branch 'lucene_snapshot' into int-hnsw-bfloat16

6741271

john-wagster approved these changes Nov 7, 2025

View reviewed changes

thecoop merged commit 851cb80 into elastic:lucene_snapshot Nov 10, 2025
34 checks passed

thecoop deleted the int-hnsw-bfloat16 branch November 10, 2025 09:27

thecoop added a commit to thecoop/elasticsearch that referenced this pull request Nov 17, 2025

Create int bfloat16 vector formats (elastic#136627)

2a9e2e7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create int bfloat16 vector formats#136627

Create int bfloat16 vector formats#136627
thecoop merged 26 commits intoelastic:lucene_snapshotfrom
thecoop:int-hnsw-bfloat16

thecoop commented Oct 15, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Oct 15, 2025

Uh oh!

Uh oh!

Uh oh!

benwtrent left a comment

Uh oh!

Uh oh!

thecoop commented Oct 24, 2025

Uh oh!

john-wagster left a comment

Uh oh!

thecoop commented Nov 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

thecoop commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Oct 15, 2025

Uh oh!

Uh oh!

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

thecoop commented Oct 24, 2025

Uh oh!

john-wagster left a comment

Choose a reason for hiding this comment

Uh oh!

thecoop commented Nov 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

thecoop commented Oct 15, 2025 •

edited

Loading