Skip to content

Create int bfloat16 vector formats#136627

Merged
thecoop merged 26 commits intoelastic:lucene_snapshotfrom
thecoop:int-hnsw-bfloat16
Nov 10, 2025
Merged

Create int bfloat16 vector formats#136627
thecoop merged 26 commits intoelastic:lucene_snapshotfrom
thecoop:int-hnsw-bfloat16

Conversation

@thecoop
Copy link
Copy Markdown
Member

@thecoop thecoop commented Oct 15, 2025

Create classes for quantized int vector formats that support bfloat16 and direct IO. This uses the new Lucene104 quantizer.

@elasticsearchmachine elasticsearchmachine added v9.3.0 Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch labels Oct 15, 2025
@elasticsearchmachine
Copy link
Copy Markdown
Collaborator

Pinging @elastic/es-search-relevance (Team:Search Relevance)

@thecoop thecoop changed the base branch from main to lucene_snapshot October 23, 2025 08:52
@thecoop thecoop requested a review from benwtrent October 23, 2025 14:05
@thecoop thecoop requested review from a team as code owners October 23, 2025 15:20
Copy link
Copy Markdown
Member

@benwtrent benwtrent left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I need to remember why I started that Flat vs Format pattern for bit vectors that you have continued here, I am not sure its actually warranted.

@thecoop
Copy link
Copy Markdown
Member Author

thecoop commented Oct 24, 2025

I've removed the intermediate format, and pushed everything into the top-level formats

Copy link
Copy Markdown
Contributor

@john-wagster john-wagster left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this lgtm. It did pop into my head that I don't remember where we've validated that Lucene104ScalarQuantizedVectorScorer is overall better and we don't need some option to be able to roll back but it's also not clear that that has anything to do with this PR just an errant thought I figured I'd write down.

@thecoop
Copy link
Copy Markdown
Member Author

thecoop commented Nov 10, 2025

We haven't yet. We'll be able to change the format after this has merged, so we can validate the performance of the new formats before we GA it

@thecoop thecoop merged commit 851cb80 into elastic:lucene_snapshot Nov 10, 2025
34 checks passed
@thecoop thecoop deleted the int-hnsw-bfloat16 branch November 10, 2025 09:27
thecoop added a commit to thecoop/elasticsearch that referenced this pull request Nov 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

>non-issue :Search Relevance/Vectors Vector search Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch v9.3.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants