Skip to content

DiskBBQ - Always block encode doc vectors#141598

Merged
tteofili merged 16 commits intoelastic:mainfrom
tteofili:dbbq_bulkencode_dv
Feb 4, 2026
Merged

DiskBBQ - Always block encode doc vectors#141598
tteofili merged 16 commits intoelastic:mainfrom
tteofili:dbbq_bulkencode_dv

Conversation

@tteofili
Copy link
Copy Markdown
Contributor

@tteofili tteofili commented Jan 30, 2026

This always block encodes doc vectors in DiskBBQ.
this also adjusts bulk scoring when the tail size % 4 != 0.

@benwtrent benwtrent added the :Search Relevance/Vectors Vector search label Jan 30, 2026
@tteofili
Copy link
Copy Markdown
Contributor Author

tteofili commented Feb 3, 2026

I've run some experiments yesterday with JinaV3 embeddings (FinancialQA, Trec-Covid), Cohere embeddings (Wikipedia) and E5-small (Quora). On average this gave a speedup of 50QPS across visit-percentage, filter-selectivity, quantize-bits etc.
I'll run some more experiments tomorrow.

@tteofili
Copy link
Copy Markdown
Contributor Author

tteofili commented Feb 4, 2026

here're the visualized results for FinancialQA and Trec-Covid (JinaV3 embeddings), the other datasets and models show similar trends.
Screenshot 2026-02-04 at 12 00 39
Screenshot 2026-02-04 at 12 01 39

@tteofili tteofili marked this pull request as ready for review February 4, 2026 11:05
@elasticsearchmachine elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Feb 4, 2026
@elasticsearchmachine
Copy link
Copy Markdown
Collaborator

Pinging @elastic/es-search-relevance (Team:Search Relevance)

@tteofili tteofili requested review from benwtrent, iverase and thecoop and removed request for benwtrent February 4, 2026 11:07
@elasticsearchmachine
Copy link
Copy Markdown
Collaborator

Hi @tteofili, I've created a changelog YAML for you.

@thecoop thecoop requested a review from ldematte February 4, 2026 11:14
Copy link
Copy Markdown
Contributor

@iverase iverase left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tteofili tteofili enabled auto-merge (squash) February 4, 2026 12:20
@tteofili tteofili merged commit b068744 into elastic:main Feb 4, 2026
35 checks passed
kosabogi pushed a commit to kosabogi/elasticsearch that referenced this pull request Feb 4, 2026
mamazzol pushed a commit to mamazzol/elasticsearch that referenced this pull request Feb 5, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

>enhancement :Search Relevance/Vectors Vector search Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch v9.4.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants