Use XPerFieldDocValuesFormat in AbstractTSDBSyntheticIdCodec by tlrx · Pull Request #144744 · elastic/elasticsearch

tlrx · 2026-03-23T13:12:42Z

The TSDB optimized merge logic in DocValuesConsumerUtil requires the doc values producer to be an instance of XPerFieldDocValuesFormat.FieldsReader to access field-specific producers and use optimized merges.

Without explicitly overriding docValuesFormat() in AbstractTSDBSyntheticIdCodec, the codec would use Lucene's standard PerFieldDocValuesFormat, causing the optimized merge check to fail and fall back to the slower unoptimized path.

This commit fixes AbstractTSDBSyntheticIdCodec to also use XPerFieldDocValuesFormat.FieldsReader.

elasticsearchmachine · 2026-03-23T13:13:07Z

Pinging @elastic/es-storage-engine (Team:StorageEngine)

tlrx · 2026-03-24T09:16:24Z

Buildkite benchmark this with tsdb-metricsgen-270m please

fcofdez · 2026-03-24T09:42:32Z

I've executed the tsdb-metricsgen track with this change and compared against a baseline run without synthetic ids enabled, and these are the preliminary results (comparing against all runs):

Cumulative merge time of primary shards                                                                         -5.24%          20
Cumulative merge count of primary shards                                                                        +3.35%          20
Min cumulative merge time across primary shard                                                                  -7.59%          20
Median cumulative merge time across primary shard                                                               -2.98%          20
Max cumulative merge time across primary shard                                                                  -5.09%          20

Min Throughput                                                 index                                            -0.20%          20
Mean Throughput                                                index                                            +2.01%          20
Median Throughput                                              index                                            +1.96%          20
Max Throughput                                                 index                                            +2.09%          20

Then I pulled results from a previous run where this fix was not applied yet:

Cumulative merge time of primary shards                                                                                                                                +31.47%          25
Cumulative merge count of primary shards                                                                                                                                +3.35%          25
Min cumulative merge time across primary shard                                                                                                                         +25.14%          25
Median cumulative merge time across primary shard                                                                                                                      +35.17%          25
Max cumulative merge time across primary shard                                                                                                                         +33.93%          25

Min Throughput                                                                                                         index                                            -2.45%          25
Mean Throughput                                                                                                        index                                            +0.31%          25
Median Throughput                                                                                                      index                                            -0.05%          25
Max Throughput                                                                                                         index                                            +1.24%          25

As we can observe, with the change, the merge times are similar/better than the baseline.

romseygeek

LGTM.

We should look into making it easier to test that a Codec will use the optimized merge paths here as this is very easy to miss. I'll open an issue and see if I can build something useful.

romseygeek · 2026-03-24T09:53:36Z

I opened #144834

martijnvg

Thanks @tlrx!

tlrx · 2026-03-24T14:26:43Z

Benchmark results are here but contender has a higher merge time, which is surprising and not aligned with our other benchmarks 🤔

Buildkite benchmark this with tsdb-metricsgen-270m please

burqen

Good stuff! Just have small comment.

server/src/main/java/org/elasticsearch/index/codec/tsdb/AbstractTSDBSyntheticIdCodec.java

tlrx · 2026-03-25T08:20:30Z

Buildkite benchmark this with tsdb-metricsgen-270m please

elasticmachine · 2026-03-25T08:25:56Z

💚 Build Succeeded

Buildkite Build
Commit: 8a7fa67
Baseline: ef82867 (env ID 4680910c-2306-4be3-813e-c1d6f8d3a634)
Contender: 8a7fa67 (env ID 72bc1112-3b7c-49e1-8a5f-6eb8ecd89f91)
Benchmark results

This build ran two tsdb-metricsgen-270m benchmarks to evaluate performance impact of this PR.

History

💚 Build #510 succeeded 2a941de

tlrx · 2026-03-26T08:50:49Z

Thanks everyone!

* upstream/main: (146 commits) Revert "[Native] Gradle-related tweaks to improve handling of the simdvec native library (elastic#144539)" Fix ArrayIndexOutOfBoundsException in fetch phase with partial results (elastic#144385) ESQL: Correctly manage NULL data type for SUM (elastic#144942) [ESQL] Fixes GroupedTopNBenchmark not executing (elastic#144944) Fix reader context leak when query response serialization fails (elastic#144708) Validate individual offset values in BULK_OFFSETS bounds checks (elastic#144643) Merge main21 source set into main in simdvec (elastic#144921) [TEST] Unmute TsidExtractingIdFieldMapperTests (elastic#144848) [Native] Gradle-related tweaks to improve handling of the simdvec native library (elastic#144539) Fix `ThreadedActionListenerTests#testRejectionHandling` (elastic#144795) Add new DLM Frozen Tier Transition execution plugin and service (elastic#144595) Prometheus: execute query_range via parsed EsqlStatement plan (elastic#144416) Investigate `testBulkIndexingRequestSplitting` failure (elastic#144766) Add test utility for wrapping directories in FilterDirectory layer (elastic#143563) Fix ES|QL decay tests with negative scale (elastic#144657) Fix circuit breaker leak in percolator query construction (elastic#144827) Use XPerFieldDocValuesFormat in AbstractTSDBSyntheticIdCodec (elastic#144744) [DOCS] Document how reindex work in CPS (elastic#144016) Fix Int4 vector library tests failing on Java 21 (elastic#144830) [DiskBBQ] Fix index sorting on flush (elastic#144938) ...

…#144744) The TSDB optimized merge logic in DocValuesConsumerUtil requires the doc values producer to be an instance of XPerFieldDocValuesFormat.FieldsReader to access field-specific producers and use optimized merges. Without explicitly overriding docValuesFormat() in AbstractTSDBSyntheticIdCodec, the codec would use Lucene's standard PerFieldDocValuesFormat, causing the optimized merge check to fail and fall back to the slower unoptimized path. This commit fixes AbstractTSDBSyntheticIdCodec to also use XPerFieldDocValuesFormat.FieldsReader.

Use XPerFieldDocValuesFormat in AbstractTSDBSyntheticIdCodec

9ca35bd

tlrx added >non-issue :StorageEngine/TSDB You know, for Metrics v9.4.0 labels Mar 23, 2026

elasticsearchmachine added the Team:StorageEngine label Mar 23, 2026

tlrx and others added 3 commits March 24, 2026 09:47

Merge branch 'main' into 2026/03/23-xperfielddocvaluesformat

e87dc8b

adjust unit test

7211de6

[CI] Auto commit changes from spotless

2a941de

tlrx requested review from burqen, dnhatn, fcofdez, martijnvg and romseygeek and removed request for martijnvg March 24, 2026 09:13

elastic deleted a comment from elasticmachine Mar 24, 2026

fcofdez approved these changes Mar 24, 2026

View reviewed changes

romseygeek approved these changes Mar 24, 2026

View reviewed changes

martijnvg approved these changes Mar 24, 2026

View reviewed changes

tlrx added 2 commits March 24, 2026 12:06

Merge branch 'main' into 2026/03/23-xperfielddocvaluesformat

6b82617

Merge branch 'main' into 2026/03/23-xperfielddocvaluesformat

5cb6313

dnhatn approved these changes Mar 25, 2026

View reviewed changes

burqen approved these changes Mar 25, 2026

View reviewed changes

server/src/main/java/org/elasticsearch/index/codec/tsdb/AbstractTSDBSyntheticIdCodec.java Show resolved Hide resolved

Merge branch 'main' into 2026/03/23-xperfielddocvaluesformat

8a7fa67

tlrx added 3 commits March 25, 2026 15:30

Merge branch 'main' into 2026/03/23-xperfielddocvaluesformat

70eaa72

Merge branch 'main' into 2026/03/23-xperfielddocvaluesformat

704907a

Merge branch 'main' into 2026/03/23-xperfielddocvaluesformat

9ecdee7

tlrx merged commit fd5c450 into elastic:main Mar 26, 2026
36 checks passed

tlrx deleted the 2026/03/23-xperfielddocvaluesformat branch March 26, 2026 08:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use XPerFieldDocValuesFormat in AbstractTSDBSyntheticIdCodec#144744

Use XPerFieldDocValuesFormat in AbstractTSDBSyntheticIdCodec#144744
tlrx merged 10 commits intoelastic:mainfrom
tlrx:2026/03/23-xperfielddocvaluesformat

tlrx commented Mar 23, 2026 •

edited

Loading

Uh oh!

elasticsearchmachine commented Mar 23, 2026

Uh oh!

tlrx commented Mar 24, 2026

Uh oh!

fcofdez commented Mar 24, 2026

Uh oh!

romseygeek left a comment

Uh oh!

romseygeek commented Mar 24, 2026

Uh oh!

martijnvg left a comment

Uh oh!

tlrx commented Mar 24, 2026

Uh oh!

burqen left a comment

Uh oh!

Uh oh!

tlrx commented Mar 25, 2026

Uh oh!

elasticmachine commented Mar 25, 2026 •

edited

Loading

Uh oh!

Uh oh!

tlrx commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Conversation

tlrx commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Mar 23, 2026

Uh oh!

tlrx commented Mar 24, 2026

Uh oh!

fcofdez commented Mar 24, 2026

Uh oh!

romseygeek left a comment

Choose a reason for hiding this comment

Uh oh!

romseygeek commented Mar 24, 2026

Uh oh!

martijnvg left a comment

Choose a reason for hiding this comment

Uh oh!

tlrx commented Mar 24, 2026

Uh oh!

burqen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tlrx commented Mar 25, 2026

Uh oh!

elasticmachine commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💚 Build Succeeded

History

Uh oh!

Uh oh!

tlrx commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

tlrx commented Mar 23, 2026 •

edited

Loading

elasticmachine commented Mar 25, 2026 •

edited

Loading