[Entity Store] Implement logs pagination in CCS by romulets · Pull Request #266307 · elastic/kibana

romulets · 2026-04-29T08:35:46Z

Summary

This PR introduces two major improvements to CCS (cross-cluster search) logs extraction:
log-slice pagination (mirroring what local extraction already had) and independent timestamp
management so CCS no longer relies on the caller to supply its time window.

A third fix resolves a subtle boundary bug in the time-window filter that caused log documents
to be silently dropped when all remaining logs share the same millisecond timestamp.

1 — Log-slice pagination for CCS extraction

CCS extraction previously used a single-pass entity loop with no raw-log capping.
It now uses the same two-level pagination that local extraction uses:

Outer loop — log slices
A boundary probe (buildLogPaginationCursorProbeEsql) runs before each entity batch.
It sorts raw logs ascending by (@timestamp, _id), takes the first maxLogsPerPage
documents, and returns the last one as the inclusive slice end (sliceEnd) plus a
total_logs count. When total_logs ≤ maxLogsPerPage the window is exhausted and no
further probe is needed.

Inner loop — entity pages
Within each slice, entities are paginated by (_firstSeenLog, entity.id) up to docsLimit
per query. The slice boundary (sliceEnd) is applied as a compound inclusive upper bound on
every entity page.

State persistence
After each entity page, checkpointTimestamp and paginationRecoveryId are written so a
mid-slice crash can be resumed on the next run without re-processing already-ingested entities.
After a slice completes, checkpointTimestamp advances to the slice end and paginationRecoveryId
is cleared.

2 — Independent timestamp management for CCS

CCS extraction no longer receives fromDateISO/toDateISO from the caller.
It now computes and owns its own time window using a new CcsLogExtractionState saved object.

CcsExtractToUpdatesParams changes

Removed	Added
`fromDateISO`	`lookbackPeriod` — how far back to look on a fresh start (e.g. `'3h'`)
`toDateISO`	`delay` — trailing-edge delay applied to `now` for `toDateISO` (e.g. `'1m'`)
	`windowOverride?` — explicit `{ fromDateISO, toDateISO }` for API-triggered runs

CcsLogExtractionState saved object (new)

Field	Purpose
`checkpointTimestamp`	`_firstSeenLog` of the last processed entity; used as `fromDateISO` on the next run
`paginationRecoveryId`	Entity ID cursor for mid-slice crash recovery

Window resolution (resolveExtractionWindow)

windowOverride set       →  use it directly; skip all state reads/writes (isOverride = true)
paginationRecoveryId set →  effectiveFrom = checkpointTimestamp, recoveryId = paginationRecoveryId
checkpointTimestamp set  →  effectiveFrom = checkpointTimestamp  (normal continuation)
otherwise                →  effectiveFrom = now − lookbackPeriod  (fresh start)
toDateISO                =  now − delay  (always, unless override)

API-triggered runs (windowOverride set) pass skipStateUpdates = true to both loops so
they never corrupt the scheduled-run checkpoint.

Callers updated

LogsExtractionClient: removes fromDateISO/toDateISO from the CCS call; passes
lookbackPeriod and delay from config.
force_ccs_extract_to_updates route: keeps fromDateISO/toDateISO in the request
body (explicit intent) and forwards them as windowOverride.

Testing manually:

Start an ECH deployment on 9.4-SNAPSHOT
Go to Stack Management > API Keys and generate a new Cross-Cluster api key.

Save the provided credentials

Start kibana and elasticsearch local
Add the stored credentials to the local deployment running this command in your CLI:.es/9.4.0/bin/elasticsearch-keystore add cluster.remote.${REMOTE_CLUSTER_NAME}.credentials. This command will prompt you to add the credential.
Reload security settings from kibana dev tools POST /_nodes/reload_secure_settings
Go to the cloud console of your deployment, under security, at the bottom of the page copy the proxy address

7. Register a new cluster with the proxy address

PUT _cluster/settings
{
  "persistent": {
    "cluster.remote.${REMOTE_CLUSTER_NAME}.mode": "proxy",
    "cluster.remote.${REMOTE_CLUSTER_NAME}.proxy_address": "${PROXY_ADDRESS}"
  }
}

Add data to the remote cluster observer it be ingested in your environment!

macroscopeapp · 2026-04-29T08:36:41Z

Catch flakiness early (recommended): run the flaky test runner against this PR before merging.

A new integration test (Should paginate correctly across outer log-slice loop and inner entity-page loop) is added with index creation, data ingestion, and multi-step API interactions — unknown stability.

Trigger a run with the Flaky Test Runner UI or post this comment on the PR:

/flaky scoutConfig:x-pack/solutions/security/plugins/entity_store/test/scout/api/playwright.config.ts:30

^{Share feedback in the #appex-qa channel.}

^{Posted via Macroscope — Flaky Test Runner nudge}

romulets · 2026-04-29T09:26:48Z

/flaky scoutConfig:x-pack/solutions/security/plugins/entity_store/test/scout/api/playwright.config.ts:30

kibanamachine · 2026-04-29T09:27:09Z

Flaky Test Runner

✅ Build triggered - kibana-flaky-test-suite-runner#11983

x-pack/solutions/security/plugins/entity_store/test/scout/api/playwright.config.ts x30

rudolf

Core changes lgtm

kibanamachine · 2026-04-29T11:22:31Z

Flaky Test Runner Stats

🟠 Some tests failed. - kibana-flaky-test-suite-runner#11983

[❌] x-pack/solutions/security/plugins/entity_store/test/scout/api/playwright.config.ts: 29/30 tests passed.

see run history

…operly run

florent-leborgne

minimal docs change - LGTM

kibanamachine · 2026-04-29T15:06:18Z

💛 Build succeeded, but was flaky

Failed CI Steps

FTR Configs #219

Metrics [docs]

✅ unchanged

History

💔 Build #435704 failed f038a26
💛 Build #435485 was flaky 984968e

cc @romulets

kibanamachine · 2026-04-29T15:51:01Z

Starting backport for target branches: 9.4

https://github.com/elastic/kibana/actions/runs/25119149957

kibanamachine · 2026-04-29T15:59:41Z

💔 All backports failed

Status	Branch	Result
❌	9.4	Backport failed because of merge conflicts You might need to backport the following PRs to 9.4: - Change WatchlistConfigClient to use internal Elasticsearch client instead of current user (#265966) - [One Workflow] Add form_data support to kibana.request step (#265671) - [Entity Analytics][Leads generation] Improve error states for ES index-level permission denials (#265956) - Add aria-labelledby to Console Config settings controls (#265207) - [Ingest pipelines] Fix Ip location processor bug (#265740) - [Entity Analytics] Auditing usages of `documentEntityIdentifiers` in user/host flyout (#265887) - fix(zod): adopt upstream lazy-bind memory optimization (colinhacks/zod#5897) (#266343) - [SecuritySolution] [Dashboard Migrations] Add security automatic migrations evaluation suite (#261568)

Manual backport

To create the backport manually run:

node scripts/backport --pr 266307

Questions ?

Please refer to the Backport tool documentation

romulets · 2026-04-29T16:19:33Z

💚 All backports created successfully

Status	Branch	Result
✅	9.4

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

@timestamp

) # Backport This will backport the following commits from `main` to `9.4`: - [[Entity Store] Implement logs pagination in CCS (#266307)](#266307)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sorenlouv/backport)  --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

romulets self-assigned this Apr 29, 2026

romulets requested a review from a team as a code owner April 29, 2026 08:35

romulets added release_note:skip Skip the PR/issue when compiling release notes ci:build-cloud-image ci:cloud-deploy Create or update a Cloud deployment backport:version Backport to applied version labels v9.4.0 labels Apr 29, 2026

[Entity Store] Implement logs pagination in CCS

66895d4

romulets force-pushed the entity-store/ccs-resilience branch from bc70cb8 to 66895d4 Compare April 29, 2026 08:42

macroscopeapp Bot reviewed Apr 29, 2026

View reviewed changes

Changes from node scripts/check_mappings_update --fix

6e4f509

elastic-vault-github-plugin-prod Bot requested a review from a team as a code owner April 29, 2026 08:55

[Entity Store] Update saved object name

4e9b3c3

macroscopeapp Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread .../security/plugins/entity_store/server/domain/saved_objects/ccs_log_extraction_state/index.ts Outdated

[Entity Store] Apply cr changes

eab9ca4

macroscopeapp Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread ...ns/security/plugins/entity_store/server/domain/logs_extraction/ccs_logs_extraction_client.ts Outdated

kibanamachine and others added 2 commits April 29, 2026 09:19

Changes from node scripts/check_mappings_update --fix

0f0790d

[Entity Store] Solve cr comments

984968e

rudolf approved these changes Apr 29, 2026

View reviewed changes

romulets added 2 commits April 29, 2026 15:13

[Entity Store] Update logs and frequency + timeout to allow CCS to pr…

d4bdf56

…operly run

Merge branch 'main' into entity-store/ccs-resilience

c6a01a2

macroscopeapp Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread ...ns/security/plugins/entity_store/server/domain/logs_extraction/ccs_logs_extraction_client.ts

Changes from make api-docs

f038a26

elastic-vault-github-plugin-prod Bot requested a review from a team as a code owner April 29, 2026 13:40

romulets added 2 commits April 29, 2026 16:38

Fix tests

3f6f561

Merge branch 'main' into entity-store/ccs-resilience

7df5af9

florent-leborgne approved these changes Apr 29, 2026

View reviewed changes

orouz approved these changes Apr 29, 2026

View reviewed changes

romulets merged commit 90f1efb into elastic:main Apr 29, 2026
29 checks passed

kibanamachine added the v9.5.0 label Apr 29, 2026

romulets mentioned this pull request Apr 29, 2026

[9.4] [Entity Store] Implement logs pagination in CCS (#266307) #266464

Merged

romulets mentioned this pull request Apr 30, 2026

[Entity Store] CCS timeouting too easily #266587

Closed

romulets linked an issue May 4, 2026 that may be closed by this pull request

[Entity Store] CCS timeouting too easily #266587

Closed

github-actions Bot mentioned this pull request May 6, 2026

Failing test: Entity Store CCS logs extraction (test against local instance) - Should paginate correctly across outer log-slice loop and inner entity-page loop #267869

Open

Conversation

romulets commented Apr 29, 2026 • edited by kibanamachine Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

1 — Log-slice pagination for CCS extraction

2 — Independent timestamp management for CCS

Callers updated

Testing manually:

Uh oh!

macroscopeapp Bot commented Apr 29, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

romulets commented Apr 29, 2026

Uh oh!

kibanamachine commented Apr 29, 2026

Flaky Test Runner

✅ Build triggered - kibana-flaky-test-suite-runner#11983

Uh oh!

rudolf left a comment

Choose a reason for hiding this comment

Uh oh!

kibanamachine commented Apr 29, 2026

Flaky Test Runner Stats

🟠 Some tests failed. - kibana-flaky-test-suite-runner#11983

Uh oh!

Uh oh!

florent-leborgne left a comment

Choose a reason for hiding this comment

Uh oh!

kibanamachine commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💛 Build succeeded, but was flaky

Failed CI Steps

Metrics [docs]

History

Uh oh!

Uh oh!

kibanamachine commented Apr 29, 2026

Uh oh!

kibanamachine commented Apr 29, 2026

💔 All backports failed

Manual backport

Questions ?

Uh oh!

romulets commented Apr 29, 2026

💚 All backports created successfully

Questions ?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

romulets commented Apr 29, 2026 •

edited by kibanamachine

Loading

kibanamachine commented Apr 29, 2026 •

edited

Loading