[Streams] Add scalability performance journeys for Streams by rStelmach · Pull Request #252288 · elastic/kibana

rStelmach · 2026-02-09T11:51:37Z

Summary

Adds six @kbn/journeys performance journeys that validate the Streams feature at scale, covering the primary user flows across listing, detail, and management pages. These journeys are excluded from PR CI and run only in scheduled performance pipelines. The heavy wired-hierarchy journey (streams_wired_hierarchy) runs in a dedicated weekly Streams-only pipeline due to its large data setup.

Journeys

Journey	Scale	What it exercises
`streams_listing_page`	5 000 classic + 3 wired children	Load, search, expand/collapse, navigate to detail
`streams_data_quality`	3 wired children	Navigate to data quality tab, verify KPI metrics
`streams_processing_step`	3 wired children	Open processor form, configure grok processor, save
`streams_retention`	3 wired children	Open retention modal, toggle inherit, set custom retention
`streams_field_mapping`	3 wired children + 200 fields	Open schema flyout, add keyword field, review & submit
`streams_wired_hierarchy`	1 000 wired children	Expand/collapse large tree, search children, navigate to detail

Data creation strategies

Classic streams at scale: Uses ES _bulk API to auto-create 5 000 unmanaged data streams in batches of 250, bypassing the Streams backend global lock. Raises cluster.max_shards_per_node to accommodate the shard count.
Wired hierarchy at scale: Uses batched content pack imports (20 batches of 50 children) with retry logic for transient 409/422 errors, followed by a final root routing update via the ingest API. Handles idempotency for "already exists" conflicts caused by timed-out-but-successful prior attempts.
Small wired hierarchy: Serial fork of 3 children from logs.otel for the lightweight journeys.

Infrastructure

.buildkite/pipelines/performance/streams_weekly.yml — weekly Streams-only pipeline (JOURNEYS_GROUP=streams) on kb-static-scalability-2
.buildkite/pipeline-resource-definitions/kibana-streams-performance-weekly.yml + .buildkite/pipeline-resource-definitions/locations.yml — Buildkite pipeline resource definition
streams_heavy_config.ts — extended FTR config with 1-hour mocha timeout (covers beforeSteps data setup)
streams journey group added to run_performance_cli.ts for --group streams execution
Metrics are reported via report_performance_metrics.sh.

Follow-ups

Add a dedicated classic-stream mapping-at-scale journey that exercises very large field counts (up to 10k), in a separate PR.

Summary by CodeRabbit

Tests
- Added comprehensive Streams performance journeys (listing, data quality, field mapping, processing, retention, wired hierarchy), supporting heavy-profile runs with extended timeouts and a new journey group; included extensive setup utilities for bulk data, large wired hierarchies, and scaled test orchestration.
Chores
- Updated test metadata and scheduling to disable certain streams jobs in scheduled pipelines, expanded project references/dependencies, and added ownership entries for Streams performance tests.

Buildkite run

https://buildkite.com/elastic/kibana-single-user-performance/builds/19020#019d002d-9541-4221-b66d-ad31c1b71df0

delanni · 2026-03-20T09:14:45Z

+      pipeline_file: .buildkite/pipelines/performance/streams_weekly.yml
+      provider_settings:
+        trigger_mode: none
+        build_branches: true


I'm not sure if this should be true. Afaik, this means something like: "start a build when a new branch appears". Let's start with this and observe if it starts builds for random branches

Good catch, changed to false. It’s intended to run scheduled weekly on main, no need to have true here

elasticmachine · 2026-03-23T19:01:31Z

💛 Build succeeded, but was flaky

Buildkite Build
Commit: 442a790

Failed CI Steps

Jest Tests #5

Test Failures

[job] [logs] Jest Tests #5 / form payload & api errors should surface API errors and send the correct payload on success

Metrics [docs]

Unknown metric groups

ESLint disabled line counts

id	before	after	diff
`@kbn/test-suites-xpack-performance`	4	5	+1

Total ESLint disabled count

id	before	after	diff
`@kbn/test-suites-xpack-performance`	4	5	+1

History

💔 Build #414592 failed cd8fd2e
💔 Build #414370 failed b3db365
💛 Build #413524 was flaky 599ec21
💔 Build #413092 failed b21a7ef
💔 Build #410747 failed 5e4d806
💔 Build #410721 failed fd1a689

flash1293

Seems like the description isn't fully up to date anymore.

About the mapping - for classic streams we should test with more fields (up to 10k), it's something that does happen in practice (but can also happen on a separate PR).

What's our story for getting notified about these / acting on them?

flash1293 · 2026-03-24T13:48:36Z

+ *                   'import' for content pack bulk import (Phase 5B, scales to 1000+)
+ * @param count - Number of child streams to create
+ */
+export async function createLargeWiredHierarchy(


Looks like this isn't used anywhere?

Slipped through during cleanup, thanks for catching this

rStelmach · 2026-03-24T14:37:10Z

@flash1293

I think it's best to handle that in a separate PR, once this one is merged, we'll have the dedicated pipeline set up, which will make it easier to test on the exact environment these journeys will run in. I'll create a follow-up PR after this lands.

What's our story for getting notified about these / acting on them?

There's a separate issue for that: https://github.com/elastic/streams-program/issues/938. Once this PR is merged and we have some data flowing, I'll start working on this

macroscopeapp · 2026-03-25T08:20:27Z

+  log.info('Wired stream hierarchy created');
+}
+
+async function ensureScaleParentStream(kibanaServer: KibanaServer, log: ToolingLog): Promise<void> {


🟡 Medium synthtrace_data/streams_data.ts:333

ensureScaleParentStream calls forkStream without retry logic for HTTP 422 lock contention. When the Streams backend is under load, the fork can fail immediately and propagate the error, causing setupLargeWiredHierarchy to fail even though other mutation operations in this file implement exponential backoff retries for the same condition.

🤖 Copy this AI Prompt to have your agent fix this:

In file x-pack/performance/synthtrace_data/streams_data.ts around line 333: `ensureScaleParentStream` calls `forkStream` without retry logic for HTTP 422 lock contention. When the Streams backend is under load, the fork can fail immediately and propagate the error, causing `setupLargeWiredHierarchy` to fail even though other mutation operations in this file implement exponential backoff retries for the same condition. Evidence trail: x-pack/performance/synthtrace_data/streams_data.ts lines 333-350 (ensureScaleParentStream with no retry); lines 127-161 (createSingleClassicStream with retry logic for isLockContentionError); lines 398-410 (fork loop with lock contention retry); lines 786-806 (setupLargeWiredHierarchy calling ensureScaleParentStream)

macroscopeapp · 2026-03-25T08:21:17Z

Approvability

Verdict: Needs human review

1 blocking correctness issue found. CODEOWNERS file was modified by a non-owner — requires human review

^{You can customize Macroscope's approvability policy. Learn more.}

…52288) ## Summary Adds six `@kbn/journeys` performance journeys that validate the Streams feature at scale, covering the primary user flows across listing, detail, and management pages. These journeys are excluded from PR CI and run only in scheduled performance pipelines. The heavy wired-hierarchy journey (`streams_wired_hierarchy`) runs in a dedicated weekly Streams-only pipeline due to its large data setup. ### Journeys | Journey | Scale | What it exercises | | ------------------------- | -------------------------------- | --------------------------------------------------------------- | | `streams_listing_page` | 5 000 classic + 3 wired children | Load, search, expand/collapse, navigate to detail | | `streams_data_quality` | 3 wired children | Navigate to data quality tab, verify KPI metrics | | `streams_processing_step` | 3 wired children | Open processor form, configure grok processor, save | | `streams_retention` | 3 wired children | Open retention modal, toggle inherit, set custom retention | | `streams_field_mapping` | 3 wired children + 200 fields | Open schema flyout, add keyword field, review & submit | | `streams_wired_hierarchy` | 1 000 wired children | Expand/collapse large tree, search children, navigate to detail | ### Data creation strategies - **Classic streams at scale**: Uses ES `_bulk` API to auto-create 5 000 unmanaged data streams in batches of 250, bypassing the Streams backend global lock. Raises `cluster.max_shards_per_node` to accommodate the shard count. - **Wired hierarchy at scale**: Uses batched content pack imports (20 batches of 50 children) with retry logic for transient 409/422 errors, followed by a final root routing update via the ingest API. Handles idempotency for "already exists" conflicts caused by timed-out-but-successful prior attempts. - **Small wired hierarchy**: Serial fork of 3 children from `logs.otel` for the lightweight journeys. ### Infrastructure - `.buildkite/pipelines/performance/streams_weekly.yml` — weekly Streams-only pipeline (`JOURNEYS_GROUP=streams`) on `kb-static-scalability-2` - `.buildkite/pipeline-resource-definitions/kibana-streams-performance-weekly.yml` + `.buildkite/pipeline-resource-definitions/locations.yml` — Buildkite pipeline resource definition - `streams_heavy_config.ts` — extended FTR config with 1-hour mocha timeout (covers `beforeSteps` data setup) - `streams` journey group added to `run_performance_cli.ts` for `--group streams` execution - Metrics are reported via `report_performance_metrics.sh`. ### Follow-ups - Add a dedicated classic-stream mapping-at-scale journey that exercises very large field counts (up to 10k), in a separate PR.  ## Summary by CodeRabbit - **Tests** - Added comprehensive Streams performance journeys (listing, data quality, field mapping, processing, retention, wired hierarchy), supporting heavy-profile runs with extended timeouts and a new journey group; included extensive setup utilities for bulk data, large wired hierarchies, and scaled test orchestration. - **Chores** - Updated test metadata and scheduling to disable certain streams jobs in scheduled pipelines, expanded project references/dependencies, and added ownership entries for Streams performance tests.  ### Buildkite run https://buildkite.com/elastic/kibana-single-user-performance/builds/19020#019d002d-9541-4221-b66d-ad31c1b71df0 --------- Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>

Matches the upper bound called out in elastic#252288 review. Buildkite streams-performance pipeline will be triggered manually against this branch before merge, so we will see at 10k whether the schema editor loads within journey timeouts. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

rStelmach and others added 13 commits February 9, 2026 12:50

initial commit

4cb42e7

Changes from node scripts/eslint_all_files --no-cache --fix

c635655

Merge branch 'main' into 458-automated-scalability-testing

192ce2e

fix tests

c404915

Merge branch 'main' into 458-automated-scalability-testing

04eaa54

fix tests

5fb6e44

adjust the stream ingestion

93ea3ac

Changes from node scripts/eslint_all_files --no-cache --fix

0b20355

raise max shards

aa4a44b

Merge branch 'main' into 458-automated-scalability-testing

2237656

adjust streams creation

b2bca04

adjust max shardHeadroom

683ee05

Merge branch 'main' into 458-automated-scalability-testing

4da18ab

rStelmach closed this Feb 12, 2026

rStelmach reopened this Feb 12, 2026

rStelmach and others added 15 commits February 12, 2026 13:18

adjust max shardHeadroom

3bd3ab2

Merge branch 'main' into 458-automated-scalability-testing

1b5e465

adjust max shardHeadroom

926a52a

Merge branch 'main' into 458-automated-scalability-testing

58ff17b

add wired streams journey

0d2dc9d

Changes from node scripts/regenerate_moon_projects.js --update

5ee3aac

Changes from node scripts/eslint_all_files --no-cache --fix

087c18b

Merge branch 'main' into 458-automated-scalability-testing

3d27e89

Merge branch 'main' into 458-automated-scalability-testing

6f51503

Merge branch 'main' into 458-automated-scalability-testing

1582f16

merge conflicts

65905ca

increase wired streams number to 1000

87cbae6

adjust building stream children

a97ef99

fix streams creation

d864305

Changes from node scripts/eslint_all_files --no-cache --fix

be5b36d

rStelmach and others added 4 commits March 19, 2026 16:41

remove hardcoded delay

af717ea

Merge branch 'main' into 458-automated-scalability-testing

5f9d0cd

Merge branch 'main' into 458-automated-scalability-testing

b21a7ef

Merge branch 'main' into 458-automated-scalability-testing

599ec21

delanni reviewed Mar 20, 2026

View reviewed changes

dont automatically build on new branches

21eecda

rStelmach requested review from delanni and dmlemeshko March 20, 2026 11:43

rStelmach and others added 5 commits March 23, 2026 09:47

Merge branch 'main' into 458-automated-scalability-testing

b3db365

Merge branch 'main' into 458-automated-scalability-testing

cd8fd2e

fix CI

eb8fb62

Changes from node scripts/lint_ts_projects --fix

00ac6c1

Changes from node scripts/regenerate_moon_projects.js --update

442a790

delanni approved these changes Mar 24, 2026

View reviewed changes

flash1293 reviewed Mar 24, 2026

View reviewed changes

remove unused code

340fe70

dmlemeshko approved these changes Mar 24, 2026

View reviewed changes

Merge branch 'main' into 458-automated-scalability-testing

ba07441

macroscopeapp Bot reviewed Mar 25, 2026

View reviewed changes

Merge branch 'main' into 458-automated-scalability-testing

16adcf1

rStelmach merged commit c450545 into elastic:main Mar 27, 2026
18 checks passed

kibanamachine added the v9.4.0 label Mar 27, 2026

rStelmach mentioned this pull request May 22, 2026

[Streams] Add classic-stream field mapping performance journey #270636

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Streams] Add scalability performance journeys for Streams#252288

[Streams] Add scalability performance journeys for Streams#252288
rStelmach merged 71 commits into
elastic:mainfrom
rStelmach:458-automated-scalability-testing

rStelmach commented Feb 9, 2026 •

edited

Loading

Uh oh!

delanni Mar 20, 2026

Uh oh!

rStelmach Mar 20, 2026

Uh oh!

elasticmachine commented Mar 23, 2026

ESLint disabled line counts

Total ESLint disabled count

Uh oh!

flash1293 left a comment

Uh oh!

flash1293 Mar 24, 2026

Uh oh!

rStelmach Mar 24, 2026

Uh oh!

rStelmach commented Mar 24, 2026

Uh oh!

macroscopeapp Bot Mar 25, 2026

Uh oh!

macroscopeapp Bot commented Mar 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

rStelmach commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Journeys

Data creation strategies

Infrastructure

Follow-ups

Summary by CodeRabbit

Buildkite run

Uh oh!

delanni Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

rStelmach Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

elasticmachine commented Mar 23, 2026

💛 Build succeeded, but was flaky

Failed CI Steps

Test Failures

Metrics [docs]

ESLint disabled line counts

Total ESLint disabled count

History

Uh oh!

flash1293 left a comment

Choose a reason for hiding this comment

Uh oh!

flash1293 Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

rStelmach Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

rStelmach commented Mar 24, 2026

Uh oh!

macroscopeapp Bot Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

macroscopeapp Bot commented Mar 25, 2026

Approvability

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

rStelmach commented Feb 9, 2026 •

edited

Loading