another test branch by NickDris · Pull Request #4 · NickDris/rally-tracks

NickDris · 2025-08-28T09:00:57Z

[8.15] Add three searches that fetch _source as separate challenges to elastic/logs track (Add three searches that fetch _source as separate challenges to elastic/logs track elastic/rally-tracks#622) ([8.15] Add three searches that fetch _source as separate challenges to elastic/logs track elastic/rally-tracks#625)
Update elastic/logs integrations to 8.13.3 (Update integrations to 8.13.3 elastic/rally-tracks#613) (Update elastic/logs integrations to 8.13.3 (#613) elastic/rally-tracks#628)
Introduce an index_mode param into the elastic/security Rally track (Introduce an index_mode param into the elastic/security Rally track elastic/rally-tracks#640) (Introduce an index_mode param into the elastic/security Rally track elastic/rally-tracks#641)
fix: allow skipping delete of fleet component template (Allow skipping delete of fleet component template elastic/rally-tracks#642) (Allow skipping delete of fleet component template elastic/rally-tracks#643)
fix: skip deleting some required component templates for security (Skip deleting some required component templates for security elastic/rally-tracks#645)
Disallow usage of ilm lifecycle for serverless (Disallow usage of ilm lifecycle for serverless elastic/rally-tracks#648)
fix: prevent usage of ilm lifecycle in serverless (Prevent usage of ilm lifecycle in serverless elastic/rally-tracks#651)
Synthetic source does not still support copy_to (Synthetic source does not still support copy_to elastic/rally-tracks#653)
[8.15] Keep array source in logsdb mode ([8.15] Keep array source in logsdb mode elastic/rally-tracks#656)
Update to use JDK 21 for build (Update to use JDK 21 for build elastic/rally-tracks#660) (Backport #660 to 8.15 elastic/rally-tracks#663)
Backport Paramaterise timeout elastic/rally-tracks#661 (Backport #661 elastic/rally-tracks#662)
Include ability to force merge segments in elastic/security (Include ability to force merge segments in elastic/security elastic/rally-tracks#657) (Include ability to force merge segments in elastic/security elastic/rally-tracks#659)
Modify the default vector index type to int8_hnsw (Modify the default vector index type to int8_hnsw elastic/rally-tracks#658) (Modify the default vector index type to int8_hnsw (#658) elastic/rally-tracks#664)
Re-enable copy_to into elastic/security (Re-enable copy_to into elastic/security elastic/rally-tracks#667) (Re-enable copy_to into elastic/security (#667) elastic/rally-tracks#669)
Enable logsdb index mode in security track (Enable logsdb index mode in security track elastic/rally-tracks#670) (Enable logsdb index mode in security track (#670) elastic/rally-tracks#674)
[8.15] Add logsdb support to http_logs track (Add logsdb support to http_logs track elastic/rally-tracks#672) ([8.15] Add logsdb support to http_logs track elastic/rally-tracks#676)
host.name is empty we need to use host.hostname (host.name is empty we need to use host.hostname elastic/rally-tracks#678) ([8.15] host.name is empty we need to use host.hostname (#678) elastic/rally-tracks#679)
Add backport action (Add backport action elastic/rally-tracks#599) ([8.15] Add backport action (#599) elastic/rally-tracks#680)
Include a variable to control synthetic_source_keep parameter (Include a variable to control synthetic_source_keep parameter elastic/rally-tracks#682) (Backport synthetic source keep param elastic/rally-tracks#684)
[8.15] Add synthetic_source_keep param to tsdb and http_logs tracks (Add synthetic_source_keep param to tsdb and http_logs tracks elastic/rally-tracks#683) ([8.15] Add synthetic_source_keep param to tsdb and http_logs tracks elastic/rally-tracks#685)
Ignore VSCode files (Ignore VSCode files elastic/rally-tracks#686) ([8.15] Ignore VSCode files (#686) elastic/rally-tracks#688)
Skip Fleet component templates in serverless (Skip Fleet component templates in serverless elastic/rally-tracks#697) ([8.15] Skip Fleet component templates in serverless (#697) elastic/rally-tracks#700)
host.id has lower cardinality (Use host.id for sorting security data elastic/rally-tracks#687) ([8.15] host.id has lower cardinality (#687) elastic/rally-tracks#689)
Add support for source mode to various tracks ([8.15] Add support for source mode to various tracks (#692) elastic/rally-tracks#699)
Backport/8.15/pr 675 (Backport/8.15/pr 675 elastic/rally-tracks#695)
Use polling mode for force merge in tsdb track (Use polling mode for force merge in tsdb track elastic/rally-tracks#707). ([8.15] Use polling mode for force merge in tsdb track (#707) elastic/rally-tracks#709)
[8.15] Add recall and NDCG operations in msmarco-v2-vector ([8.15] Add recall and NDCG operations in msmarco-v2-vector elastic/rally-tracks#710)
Bump versions after 3.8 deprecation (Bump versions after 3.8 deprecation elastic/rally-tracks#729) ([8.15] Bump versions after 3.8 deprecation (#729) elastic/rally-tracks#736)
Deep terms agg (Deep terms agg elastic/rally-tracks#733) ([8.15] Deep terms agg (#733) elastic/rally-tracks#735)
Bump action versions (Bump action versions elastic/rally-tracks#724) (Bump action versions (#724) elastic/rally-tracks#763)
Update backport.yml (Update backport.yml elastic/rally-tracks#758) (Update backport.yml (#758) elastic/rally-tracks#762)
[8.15] Pin distribution version in ITs
Revert "[8.15] Pin distribution version in ITs"
Switch serverless ITs to vector-optimized serverless project (Switch serverless ITs to vector-optimized serverless projects elastic/rally-tracks#795) ([8.15] Switch serverless ITs to vector-optimized serverless project (#795) elastic/rally-tracks#810)
Add extra cleanup fixture to ITs (Add extra cleanup fixture to ITs elastic/rally-tracks#794) ([8.15] Add extra cleanup fixture to ITs (#794) elastic/rally-tracks#808)
Properly map the rally.doc_size and rally.message_size fields in elastic/logs track (Properly map the rally.doc_size and rally.message_size fields in elastic/logs track elastic/rally-tracks#716) ([8.15] Properly map the rally.doc_size and rally.message_size fields in elastic/logs track (#716) elastic/rally-tracks#717)
Switch to dedicated backport token (Switch to dedicated backport token elastic/rally-tracks#811) ([8.15] Switch to dedicated backport token (#811) elastic/rally-tracks#817)
Limit CI workflow scope for pushes (Limit CI workflow scope for pushes elastic/rally-tracks#818) ([8.15] Limit CI workflow scope for pushes (#818) elastic/rally-tracks#819)
[8.15] Use distribution version in ITs ([8.15] Use distribution version in ITs elastic/rally-tracks#812)
Trim unnecessary ITs (Trim unnecessary ITs elastic/rally-tracks#809) ([8.15] Trim unnecessary ITs (#809) elastic/rally-tracks#820)
Remove unnecessary dependency (Remove unnecessary dependency elastic/rally-tracks#821) ([8.15] Remove unnecessary dependency (#821) elastic/rally-tracks#822)
Update README.md (Update README.md elastic/rally-tracks#604) ([8.15] Update README.md (#604) elastic/rally-tracks#836)
Just a comment
Just another test

…o elastic/logs track (elastic#622) (elastic#625) Backporting elastic#622 to the 8.15 branch. Otherwise rally nightly and esbench will not pick the change up. The version of Elasticsearch main is 8.15.0-SNAPSHOT and therefor rally nightly / esbench will use the rally track's 8.15 branch. These search are based from searches that are executed by search/discovery search challenge that fetch top documents. The queries are without query and fetch 100, 500 and 1000 documents. The source is fetches using the field fetch feature, like in the searches that execute as part of search/discovery workflow. The problem is that we see combined latency and service time for search/discovery challenge and not for each search that is executed as part of this challenge (it is composite operation). By adding these searches we can get latency and service time of searches that specifically fetch _source. This information is useful for the logsdb effort. Synthetic source makes fetching _source more expensive, but currently we can't introspect at a closer level what the impact is (since search/discovery search challenge report latency / service time for multiple operations).

Update integrations for elastic/logs to 8.13.3

…ck (elastic#640) (elastic#641) The index_mode parameter will be used to run Rally benchmarks comparing indexing using standard and logsdb mode for the elastic/security track. Enabling LogsDB is done by means of a component template which is added and later used if the index_mode is provided. In case it is missing no index mode will be used which will default to standard.

…lastic#643) The fleet component template is used when we try to delete it. Here we introduce a parameter that allows us to skip deletion of the component template. The default value is false, which means normally we attempt to delete it. Setting it explicitly to true we avoid deleting it. This prevents errors happening if we try to delete it and it is in use.

…astic#645)

Serverless deployments miss ILM. As a result component templates should not use the lifecycle setting. Here we introduce a setting which allows us to exclude the lifecycle setting either using `lifecycle` parameter or a `build_flavor` parameter. This mimics what we do already for the elastic/logs track.

Backport to 8.15: - Keep array source in logsdb mode (655)

* Update to use JDK 21 for build

* Paramaterise timeout * Update README.md --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>

…c#657) (elastic#659)

…tic#664) From ES v8.14 the default index type for dense_vectors is int8_hnsw. This modifies our rally tracks to refect it.

`copy_to` is used to copy from `kubernetes.event.message` to `message`. Now it is supported in Elasticsearch 8.15 and we can benchmark the security track including it. We also remove a parameter which was used to run a modified workflow, which was using `kubernetes.event.message` instead of `message`.

This PR changes the security track so that we can enable LogsDB in index templates. Note that the failure store is only available in serverless so we gate its usage excluding it in case the deployment is not serverless. For LogsDB testing we rely on Kibana to install all other component/composable templates. This is to make sure we need limited changes to the Rally track. While testing this new configuration we discovered that installation of (component) templates done by Kibana is Serverless only happens when a user interacts with it. This means (component) templates are not installed and the `elastic/security` track execution fails as a result of using (component) templates that do not exist.

This back ports elastic#672 to 8.15 branch. * `enable_logsdb` (default: false) Determines whether the logsdb index mode gets used. If set then index sorting is configured to only use `@timestamp` field and the `source_enabled` parameter will have no effect. * `force_merge_max_num_segments` (default: unset): An integer specifying the max amount of segments the force-merge operation should use.

…astic#679) If the `host.name` field does not exists, indices created as backing indices of a data stream are injected with empty values of `host.name`. Sorting on `host.name` and `@timestamp` results in sorting just on `@timestamp`. Looking at some mappings I see a `host.hostname` exists. Also a cardinality aggregation results in hundreds of distinct values which suggests the filed is not empty. We would like to test using a meaningful combination of fields to sort on. Ideally we expect better benchmark results despite being possible that other, more effective, combinations of fields might exist. We are interested, anyway, in changes over time **given a valid set of fields to sort on**. (cherry picked from commit 0ca00a0)

(cherry picked from commit 3ae3304) Co-authored-by: Gareth Ellis <gareth.ellis@elastic.co>

…tic#682) (elastic#684) This PR introduces a new track parameter, `synthetic_source_keep` which is used to control the behaviour of synthetic source for all field types. It can have values `none`, `arrays` or `all` (`all` not usable when set at index level). See elastic/elasticsearch#112706 to understand the effect of each value. Later on we will use this to change the behaviour in our nightlies and run benchmarks on both `elastic/logs` and `elastic/security` using value `arrays`.

…lastic#683) (elastic#685) Backporting elastic#683 to 8.15 branch. The addition of the index.mapping.synthetic_source_keep to tsdb is new. To http_logs is not and before the index.mapping.synthetic_source_keep setting was hard coded to arrays. I will open a separate PR that adds the source_keep track param to nightly configs. Having the source_keep makes comparing benchmark results between the different source keep options easier.

(cherry picked from commit 4493616) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

Skips Fleet component templates in elastic/logs when running with serverless (cherry picked from commit 4cd9d4a) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

host.hostname has cardinality 100 while host.id has cardinality 50. This happen because in the dataset there is a host.if per each couple ho hostnames, like a single host.id and for each of them two hostnames like 'dustin.windows' and 'dustin.linux'. This is probably an artifact of the data generation script. Lower cardinality fields might: * reduce sorting overhead due to less comparisons * improve compression due to more data clustering together This change should at least allow us if there is any benefit in choosing a lower cardinality field. (cherry picked from commit e2ca95e) Co-authored-by: Salvatore Campagna <93581129+salvatore-campagna@users.noreply.github.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>

Add support for source mode to elastic/logs, elastic/security and http_logs. Backport of elastic#692 to 8.15 branch. (cherry picked from commit ae63824)

…c#709) (cherry picked from commit 18c88a9) Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com>

* Add recall and NDCG operations in msmarco-v2-vector (elastic#610) This change adds an operation called knn-recall that computes the following metrics: * Recall * NDCG * Avg number of nodes visited during search Given the size of the corpus, the true top N values used for recall operations have been approximated offline for each query as follows: ``` { "knn": { "field": "emb", "query_vector": query['emb'], "k": 10000, "num_candidates": 10000 }, "rescore": { "window_size": 10000, "query": { "query_weight": 0, "rescore_query": { "script_score": { "query": { "match_all": {} }, "script": { "source": "double value = dotProduct(params.query_vector, 'emb'); return sigmoid(1, Math.E, -value);", "params": { "query_vector": vec } } } } } } } ``` This means that the computed recall is measured against the system's best possible approximate neighbor run rather than the actual top N. For the relevance metrics, the `qrels.tsv` file contains annotations for all the queries listed in `queries.json`. This file is generated from the original training data available at [ir_datasets/msmarco_passage_v2](https://ir-datasets.com/msmarco-passage-v2.html#msmarco-passage-v2/train). (cherry picked from commit b6f3535) * Exclude msmarco from IT tests (elastic#708) --------- Co-authored-by: Jim Ferenczi <jim.ferenczi@elastic.co>

(cherry picked from commit 17cc202)

(cherry picked from commit 23f6712) Co-authored-by: Nik Everett <nik9000@gmail.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>

(cherry picked from commit 68fb7d4)

Pinning action to a full length commit SHA [see](https://docs.github.com/en/actions/security-for-github-actions/security-guides/security-hardening-for-github-actions#using-third-party-actions) (cherry picked from commit 50e9ddb) # Conflicts: # .github/workflows/backport.yml Co-authored-by: Paul McCann <paul.mccann@elastic.co> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>

This reverts commit 6532f10.

…#795) (elastic#810) AD (anomaly detection) and DFA (data frame analytics) ML endpoints are no longer available in general purpose ES projects in QA, so this is switching IT tests to vector-optimized projects. This is a temporary workaround. Ultimately we need to differentiate serverless project type depending on Rally track. (cherry picked from commit 530092b) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

(cherry picked from commit b3a16c2) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>

…tic/logs track (elastic#716) (elastic#717) (cherry picked from commit 76072a1) Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com> Co-authored-by: Oleksandr Kolomiiets <olkolomiiets@gmail.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>

(cherry picked from commit 7250bb3) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

(cherry picked from commit 571300c) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

(cherry picked from commit 581478b) # Conflicts: # catalog-info.yaml

(cherry picked from commit 37a20a8) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

The parse tool requires the compressed file to be passed to it. (cherry picked from commit 3f72278) Co-authored-by: Jesse Bacon <dread.roberts@gmail.com>

martijnvg and others added 30 commits June 28, 2024 16:58

Update elastic/logs integrations to 8.13.3 (elastic#613) (elastic#628)

536056b

Update integrations for elastic/logs to 8.13.3

fix: skip deleting some required component templates for security (el…

8d6076f

…astic#645)

fix: prevent usage of ilm lifecycle in serverless (elastic#651)

fd21135

Synthetic source does not still support copy_to (elastic#653)

4bdaf2c

[8.15] Keep array source in logsdb mode (elastic#656)

2355ed9

Backport to 8.15: - Keep array source in logsdb mode (655)

Update to use JDK 21 for build (elastic#660) (elastic#663)

822bf5c

* Update to use JDK 21 for build

Backport elastic#661 (elastic#662)

fec60cc

* Paramaterise timeout * Update README.md --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>

Include ability to force merge segments in elastic/security (elasti…

7884032

…c#657) (elastic#659)

Modify the default vector index type to int8_hnsw (elastic#658) (elas…

0a8df44

…tic#664) From ES v8.14 the default index type for dense_vectors is int8_hnsw. This modifies our rally tracks to refect it.

Add backport action (elastic#599) (elastic#680)

bd4905f

(cherry picked from commit 3ae3304) Co-authored-by: Gareth Ellis <gareth.ellis@elastic.co>

Ignore VSCode files (elastic#686) (elastic#688)

f89dfeb

(cherry picked from commit 4493616) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

Skip Fleet component templates in serverless (elastic#697) (elastic#700)

5dfd9f5

Skips Fleet component templates in elastic/logs when running with serverless (cherry picked from commit 4cd9d4a) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

Add support for source mode to various tracks (elastic#699)

6b46254

Add support for source mode to elastic/logs, elastic/security and http_logs. Backport of elastic#692 to 8.15 branch. (cherry picked from commit ae63824)

Backport/8.15/pr 675 (elastic#695)

9452aed

Use polling mode for force merge in tsdb track (elastic#707). (elasti…

41b1dbc

…c#709) (cherry picked from commit 18c88a9) Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com>

Bump versions after 3.8 deprecation (elastic#729) (elastic#736)

b27efd0

(cherry picked from commit 17cc202)

Deep terms agg (elastic#733) (elastic#735)

cc6c511

(cherry picked from commit 23f6712) Co-authored-by: Nik Everett <nik9000@gmail.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>

Bump action versions (elastic#724) (elastic#763)

1ce2eba

(cherry picked from commit 68fb7d4)

gareth-ellis and others added 14 commits March 31, 2025 14:45

[8.15] Pin distribution version in ITs

6532f10

Revert "[8.15] Pin distribution version in ITs"

6c83a43

This reverts commit 6532f10.

Add extra cleanup fixture to ITs (elastic#794) (elastic#808)

c442fba

(cherry picked from commit b3a16c2) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>

Switch to dedicated backport token (elastic#811) (elastic#817)

69c8609

(cherry picked from commit 7250bb3) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

Limit CI workflow scope for pushes (elastic#818) (elastic#819)

85e6735

(cherry picked from commit 571300c) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

[8.15] Use distribution version in ITs (elastic#812)

6d759d2

Trim unnecessary ITs (elastic#809) (elastic#820)

9c7cb69

(cherry picked from commit 581478b) # Conflicts: # catalog-info.yaml

Remove unnecessary dependency (elastic#821) (elastic#822)

689d524

(cherry picked from commit 37a20a8) Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

Update README.md (elastic#604) (elastic#836)

b32eb3f

The parse tool requires the compressed file to be passed to it. (cherry picked from commit 3f72278) Co-authored-by: Jesse Bacon <dread.roberts@gmail.com>

Just a comment

d1483c2

Just another test

76391e7

NickDris closed this Aug 28, 2025

NickDris deleted the another_test_branch branch September 3, 2025 09:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

another test branch#4

another test branch#4
NickDris wants to merge 44 commits intomasterfrom
another_test_branch

NickDris commented Aug 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Comments

Conversation

NickDris commented Aug 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Comments