Expose frozen indices information on the rule health endpoint by denar50 · Pull Request #219703 · elastic/kibana

denar50 · 2025-04-30T11:37:08Z

Summary

This is a follow up PR to expose the metric frozen_indices_queried_max_count on the rule healthcheck endpoint.
This metric is an aggregation of the metric frozen_indices_queried_count which is calculated upon rule execution. Refer to this PR to see more details about it.

How to test this?

Run Elastic locally with these additional parameters in order to enable the frozen data tier: -E path.repo="/tmp" -E xpack.searchable.snapshot.shared_cache.size=20GB.
Use this tutorial to create the snapshot repository and an ILM policy. You can disable rollover for the ILM policy and configure indices to be moved to frozen after 0 days.
Create an index manually and populate it with a couple of documents.
Assign the ILM policy to the index you created in the previous step and wait for it to be rolled to frozen. You can run this command to speed up the process:

PUT /_cluster/settings
{
  "persistent": {
    "indices.lifecycle.poll_interval": "10s"
  }
}

You can confirm that the index is indeed in frozen by calling

GET <YOUR_IDX_HERE>/_ilm/explain

phase should be frozen and step should be complete.

Create a rule querying the frozen index.
Call the rule health endpoint with:

curl -X POST --user elastic:changeme "http://localhost:5601/internal/detection_engine/health/_rule?date_start=2025-04-29T09:07:39.489Z&date_end=2025-05-01T09:08:39.489Z" \
  -H "Content-Type: application/json" \
  -H "elastic-api-version: 1" \
  -H 'kbn-xsrf: 123' \
  -H "x-elastic-internal-origin: Kibana" \
  --data '{"rule_id":"2f9780b5-7819-4685-ab8e-d817d3701d10"}'

You should see frozen_indices_queried_max_count populated with 1.

elasticmachine · 2025-04-30T11:37:59Z

Pinging @elastic/security-detection-engine (Team:Detection Engine)

jkelas · 2025-05-08T09:20:07Z

The code looks fine, but I discovered some issue when testing. I couldn't make the frozen_indices_queried_max_count display 1, it was always set to 0. I paired up with the author, @denar50 , and we worked together on the code, the author needs to get back to testing in his own environment, I passed him all the information / how I set up my environment. Waiting for an update from @denar50.

elasticmachine · 2025-05-08T12:35:16Z

💛 Build succeeded, but was flaky

Buildkite Build
Commit: f7610d9

Failed CI Steps

Metrics [docs]

✅ unchanged

History

💛 Build #298707 was flaky 6543310
💚 Build #298428 succeeded ed21961
💚 Build #297613 succeeded b25b71b
💔 Build #297530 failed 510cf09

jkelas

After pairing up again with the reviewer, we concluded that the reason why it didn't work the previous time was an incorrect time range for the query. After fixing the settings, the behavior is correct, it shows the "frozen_indices_queried_max_count": 1 as expected.

The code looks OK.

Testing was done according to the instruction in the ticket.
My curl command looked like this:

curl -X POST --user elastic:changeme "http://localhost:5621/kbn/internal/detection_engine/health/_rule?date_start=2025-04-29T09:07:39.489Z&date_end=2025-05-09T09:08:39.489Z" \
  -H "Content-Type: application/json" \
  -H "elastic-api-version: 1" \
  -H 'kbn-xsrf: 123' \
  -H "x-elastic-internal-origin: Kibana" \
  --data '{"rule_id":"7ca6fb8f-3f5f-4fea-a577-72eccc8e001c"}' | jq

and at the end of the printout, in the last bucket, I can see this:

        "indexing_duration_ms": {
          "percentiles": {
            "50.0": 4,
            "95.0": 4,
            "99.0": 5.519999999999996,
            "99.9": 5.952000000000005
          }
        },
        "frozen_indices_queried_max_count": 1

kibanamachine · 2025-05-08T15:11:58Z

Starting backport for target branches: 8.19

https://github.com/elastic/kibana/actions/runs/14909813866

kibanamachine · 2025-05-08T15:27:51Z

💔 All backports failed

Status	Branch	Result
❌	8.19	Backport failed because of merge conflicts

Manual backport

To create the backport manually run:

node scripts/backport --pr 219703

Questions ?

Please refer to the Backport tool documentation

## Summary This is a follow up PR to expose the metric `frozen_indices_queried_max_count` on the rule healthcheck endpoint. This metric is an aggregation of the metric `frozen_indices_queried_count` which is calculated upon rule execution. Refer to [this PR](#218435) to see more details about it. ## How to test this? - Run Elastic locally with these additional parameters in order to enable the frozen data tier: -E path.repo="/tmp" -E xpack.searchable.snapshot.shared_cache.size=20GB. - Use [this tutorial](https://docs.elastic.dev/security-soution/analyst-experience-team/eng-prod/how-to/configure-local-frozen-tier) to create the snapshot repository and an ILM policy. You can disable rollover for the ILM policy and configure indices to be moved to frozen after 0 days. - Create an index manually and populate it with a couple of documents. - Assign the ILM policy to the index you created in the previous step and wait for it to be rolled to frozen. You can run this command to speed up the process: ``` PUT /_cluster/settings { "persistent": { "indices.lifecycle.poll_interval": "10s" } } ``` You can confirm that the index is indeed in frozen by calling ``` GET <YOUR_IDX_HERE>/_ilm/explain ``` `phase` should be `frozen` and `step` should be `complete`. - Create a rule querying the frozen index. - Call the rule health endpoint with: ``` curl -X POST --user elastic:changeme "http://localhost:5601/internal/detection_engine/health/_rule?date_start=2025-04-29T09:07:39.489Z&date_end=2025-05-01T09:08:39.489Z" \ -H "Content-Type: application/json" \ -H "elastic-api-version: 1" \ -H 'kbn-xsrf: 123' \ -H "x-elastic-internal-origin: Kibana" \ --data '{"rule_id":"2f9780b5-7819-4685-ab8e-d817d3701d10"}' ``` You should see `frozen_indices_queried_max_count` populated with `1`. (cherry picked from commit 0544125) # Conflicts: # x-pack/solutions/security/plugins/security_solution/common/api/detection_engine/rule_monitoring/detection_engine_health/health_endpoints.md # x-pack/solutions/security/plugins/security_solution/server/lib/detection_engine/rule_monitoring/logic/detection_engine_health/event_log/aggregations/types.ts

denar50 · 2025-05-08T15:51:44Z

💚 All backports created successfully

Status	Branch	Result
✅	8.19

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

## Summary This is a follow up PR to expose the metric `frozen_indices_queried_max_count` on the rule healthcheck endpoint. This metric is an aggregation of the metric `frozen_indices_queried_count` which is calculated upon rule execution. Refer to [this PR](#218435) to see more details about it. ## How to test this? - Run Elastic locally with these additional parameters in order to enable the frozen data tier: -E path.repo="/tmp" -E xpack.searchable.snapshot.shared_cache.size=20GB. - Use [this tutorial](https://docs.elastic.dev/security-soution/analyst-experience-team/eng-prod/how-to/configure-local-frozen-tier) to create the snapshot repository and an ILM policy. You can disable rollover for the ILM policy and configure indices to be moved to frozen after 0 days. - Create an index manually and populate it with a couple of documents. - Assign the ILM policy to the index you created in the previous step and wait for it to be rolled to frozen. You can run this command to speed up the process: ``` PUT /_cluster/settings { "persistent": { "indices.lifecycle.poll_interval": "10s" } } ``` You can confirm that the index is indeed in frozen by calling ``` GET <YOUR_IDX_HERE>/_ilm/explain ``` `phase` should be `frozen` and `step` should be `complete`. - Create a rule querying the frozen index. - Call the rule health endpoint with: ``` curl -X POST --user elastic:changeme "http://localhost:5601/internal/detection_engine/health/_rule?date_start=2025-04-29T09:07:39.489Z&date_end=2025-05-01T09:08:39.489Z" \ -H "Content-Type: application/json" \ -H "elastic-api-version: 1" \ -H 'kbn-xsrf: 123' \ -H "x-elastic-internal-origin: Kibana" \ --data '{"rule_id":"2f9780b5-7819-4685-ab8e-d817d3701d10"}' ``` You should see `frozen_indices_queried_max_count` populated with `1`. (cherry picked from commit 0544125) # Conflicts: # x-pack/solutions/security/plugins/security_solution/common/api/detection_engine/rule_monitoring/detection_engine_health/health_endpoints.md # x-pack/solutions/security/plugins/security_solution/server/lib/detection_engine/rule_monitoring/logic/detection_engine_health/event_log/aggregations/types.ts

…219703) (#220540) # Backport This will backport the following commits from `main` to `8.19`: - [Expose frozen indices information on the rule health endpoint (#219703)](#219703)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sorenlouv/backport)

…c#219703) ## Summary This is a follow up PR to expose the metric `frozen_indices_queried_max_count` on the rule healthcheck endpoint. This metric is an aggregation of the metric `frozen_indices_queried_count` which is calculated upon rule execution. Refer to [this PR](elastic#218435) to see more details about it. ## How to test this? - Run Elastic locally with these additional parameters in order to enable the frozen data tier: -E path.repo="/tmp" -E xpack.searchable.snapshot.shared_cache.size=20GB. - Use [this tutorial](https://docs.elastic.dev/security-soution/analyst-experience-team/eng-prod/how-to/configure-local-frozen-tier) to create the snapshot repository and an ILM policy. You can disable rollover for the ILM policy and configure indices to be moved to frozen after 0 days. - Create an index manually and populate it with a couple of documents. - Assign the ILM policy to the index you created in the previous step and wait for it to be rolled to frozen. You can run this command to speed up the process: ``` PUT /_cluster/settings { "persistent": { "indices.lifecycle.poll_interval": "10s" } } ``` You can confirm that the index is indeed in frozen by calling ``` GET <YOUR_IDX_HERE>/_ilm/explain ``` `phase` should be `frozen` and `step` should be `complete`. - Create a rule querying the frozen index. - Call the rule health endpoint with: ``` curl -X POST --user elastic:changeme "http://localhost:5601/internal/detection_engine/health/_rule?date_start=2025-04-29T09:07:39.489Z&date_end=2025-05-01T09:08:39.489Z" \ -H "Content-Type: application/json" \ -H "elastic-api-version: 1" \ -H 'kbn-xsrf: 123' \ -H "x-elastic-internal-origin: Kibana" \ --data '{"rule_id":"2f9780b5-7819-4685-ab8e-d817d3701d10"}' ``` You should see `frozen_indices_queried_max_count` populated with `1`.

denar50 requested a review from a team as a code owner April 30, 2025 11:37

denar50 requested a review from nikitaindik April 30, 2025 11:37

denar50 added release_note:skip Skip the PR/issue when compiling release notes backport:skip This PR does not require backporting Team:Detection Engine Security Solution Detection Engine Area labels Apr 30, 2025

denar50 force-pushed the security-team-12387-expose-frozen-indices-stats-on-rule-execution-summary-endpoint branch from dcf4162 to 510cf09 Compare April 30, 2025 11:42

denar50 added backport:version Backport to applied version labels v8.19.0 and removed backport:skip This PR does not require backporting labels Apr 30, 2025

denar50 force-pushed the security-team-12387-expose-frozen-indices-stats-on-rule-execution-summary-endpoint branch 3 times, most recently from ed21961 to 6543310 Compare May 5, 2025 16:13

jkelas requested review from jkelas and removed request for nikitaindik May 7, 2025 14:27

Expose frozen indices information on the rule health endpoint

f7610d9

denar50 force-pushed the security-team-12387-expose-frozen-indices-stats-on-rule-execution-summary-endpoint branch from 6543310 to f7610d9 Compare May 8, 2025 10:56

jkelas approved these changes May 8, 2025

View reviewed changes

denar50 added the v9.1.0 label May 8, 2025

denar50 merged commit 0544125 into main May 8, 2025
11 checks passed

denar50 deleted the security-team-12387-expose-frozen-indices-stats-on-rule-execution-summary-endpoint branch May 8, 2025 15:11

denar50 mentioned this pull request May 8, 2025

[8.19] Expose frozen indices information on the rule health endpoint (#219703) #220540

Merged

kibanamachine mentioned this pull request May 8, 2025

[DOCS] Update CrowdStrike and SentinelOne connectors #219887

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose frozen indices information on the rule health endpoint#219703

Expose frozen indices information on the rule health endpoint#219703
denar50 merged 1 commit intomainfrom
security-team-12387-expose-frozen-indices-stats-on-rule-execution-summary-endpoint

denar50 commented Apr 30, 2025 •

edited by kibanamachine

Loading

Uh oh!

elasticmachine commented Apr 30, 2025

Uh oh!

jkelas commented May 8, 2025

Uh oh!

elasticmachine commented May 8, 2025

Uh oh!

jkelas left a comment

Uh oh!

Uh oh!

kibanamachine commented May 8, 2025

Uh oh!

kibanamachine commented May 8, 2025

Uh oh!

denar50 commented May 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

denar50 commented Apr 30, 2025 • edited by kibanamachine Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

How to test this?

Uh oh!

elasticmachine commented Apr 30, 2025

Uh oh!

jkelas commented May 8, 2025

Uh oh!

elasticmachine commented May 8, 2025

💛 Build succeeded, but was flaky

Failed CI Steps

Metrics [docs]

History

Uh oh!

jkelas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kibanamachine commented May 8, 2025

Uh oh!

kibanamachine commented May 8, 2025

💔 All backports failed

Manual backport

Questions ?

Uh oh!

denar50 commented May 8, 2025

💚 All backports created successfully

Questions ?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

denar50 commented Apr 30, 2025 •

edited by kibanamachine

Loading