dris test branch by NickDris · Pull Request #2 · NickDris/rally-tracks

NickDris · 2025-08-28T08:16:18Z

Update elastic/logs integrations to 8.13.3 (Update integrations to 8.13.3 elastic/rally-tracks#613)
Add recall and NDCG operations in msmarco-v2-vector (Add recall and NDCG operations in msmarco-v2-vector elastic/rally-tracks#610)
Make exclude conditional on not serverless (Make exclude conditional on not serverless elastic/rally-tracks#620)
Grant elasticsearch-team ability to rerun builds (Grant elasticsearch-team ability to rerun builds elastic/rally-tracks#621)
Remove index sorting overwrite (Remove index sorting overwrite elastic/rally-tracks#619)
Add three searches that fetch _source as separate challenges to elastic/logs track (Add three searches that fetch _source as separate challenges to elastic/logs track elastic/rally-tracks#622)
Add three searches that fetch _source as separate challenges to elastic/logs track (Add three searches that fetch _source as separate challenges to elastic/logs track elastic/rally-tracks#623)
Added many more distance benchmarks for ESQL (Added many more distance benchmarks for ESQL elastic/rally-tracks#626)
Add query rule tests to the wikipedia track (Add query rule tests to the wikipedia track elastic/rally-tracks#627)
Fix autoscale-v2 (Fix autoscale-v2 elastic/rally-tracks#629)
Update MS Marco passage ranking to use _score when calculating NDCG (Update MS Marco passage ranking to use _score when calculating NDCG elastic/rally-tracks#630)
Update msmarco-passage-ranking track to use sparse_vector instead of weighted_tokens (Update msmarco-passage-ranking track to use sparse_vector instead of weighted_tokens elastic/rally-tracks#633)
Add exclude to query ruleset benchmarks (Add exclude to query ruleset benchmarks elastic/rally-tracks#635)
Introduce an index_mode param into the elastic/security Rally track (Introduce an index_mode param into the elastic/security Rally track elastic/rally-tracks#640)
Add a retriever test with optional rerank (Add a retriever test with optional rerank elastic/rally-tracks#639)
fix: allow skipping delete of fleet component template (Allow skipping delete of fleet component template elastic/rally-tracks#642)
fix: skip deleting some required component templates for security (Skip deleting some required component templates for security elastic/rally-tracks#644)
Disallow usage of ilm lifecycle for serverless (Disallow usage of ilm lifecycle for serverless elastic/rally-tracks#647)
fix: prevent usage of ilm lifecycle in serverless (Prevent usage of ilm lifecycle in serverless elastic/rally-tracks#650)
Synthetic source does not still support copy_to (Synthetic source does not still support copy_to elastic/rally-tracks#652)
Keep array source in logsdb mode (Keep array source in logsdb mode elastic/rally-tracks#655)
Include ability to force merge segments in elastic/security (Include ability to force merge segments in elastic/security elastic/rally-tracks#657)
Update to use JDK 21 for build (Update to use JDK 21 for build elastic/rally-tracks#660)
Paramaterise timeout (Paramaterise timeout elastic/rally-tracks#661)
Modify the default vector index type to int8_hnsw (Modify the default vector index type to int8_hnsw elastic/rally-tracks#658)
Re-enable copy_to into elastic/security (Re-enable copy_to into elastic/security elastic/rally-tracks#667)
Add components logs@mappings, logs@settings, ecs@mappings to endpoint templates (Add standard component templates to security track composable templates elastic/rally-tracks#668)
Enable logsdb index mode in security track (Enable logsdb index mode in security track elastic/rally-tracks#670)
Add logsdb support to http_logs track (Add logsdb support to http_logs track elastic/rally-tracks#672)
host.name is empty we need to use host.hostname (host.name is empty we need to use host.hostname elastic/rally-tracks#678)
Add backport action (Add backport action elastic/rally-tracks#599)
Continue backport on error (Continue backport on error elastic/rally-tracks#681)
Include a variable to control synthetic_source_keep parameter (Include a variable to control synthetic_source_keep parameter elastic/rally-tracks#682)
Add synthetic_source_keep param to tsdb and http_logs tracks (Add synthetic_source_keep param to tsdb and http_logs tracks elastic/rally-tracks#683)
Added a stripped down version of logging-querying for frozen tier benchmarking (Added a stripped down version of logging-querying for frozen tier elastic/rally-tracks#637)
Ignore VSCode files (Ignore VSCode files elastic/rally-tracks#686)
Update README.md (Update README.md elastic/rally-tracks#614)
Update README.md (Update README.md elastic/rally-tracks#604)
host.id has lower cardinality (Use host.id for sorting security data elastic/rally-tracks#687)
Update elastic/security artifacts (Update elastic/security artifacts elastic/rally-tracks#675)
Added two more distancefilter and updated iterations after new optimizations (Added two more distancefilter benchmarks and updated iterations after new optimizations elastic/rally-tracks#693)
Configure Slack notifications through env (Configure Slack notifications through env elastic/rally-tracks#696)
Skip Fleet component templates in serverless (Skip Fleet component templates in serverless elastic/rally-tracks#697)
Add search/mteb/dbpedia relevance evaluation track (Add search/mteb/dbpedia relevance evaluation track elastic/rally-tracks#691)
Add support for source mode to various tracks (Add support for source mode to various tracks elastic/rally-tracks#692)
Add new ESQL CCS challenge in Logs (Add new ESQL CCS challenge in Logs elastic/rally-tracks#701)
Two more benchmarks for partial sorting with ESQL (Two more benchmarks for partial sorting with ESQL elastic/rally-tracks#703)
Add scalability queries to esql-ccs-snapshot Logs challenge (Add scalability queries to esql-ccs-snapshot Logs challenge elastic/rally-tracks#705)
Add ESQL AVG query for non-runtime field (Add ESQL AVG query for non-runtime field elastic/rally-tracks#706)
Use polling mode for force merge in tsdb track (Use polling mode for force merge in tsdb track elastic/rally-tracks#707)
Exclude msmarco from IT tests (Exclude msmarco from IT tests elastic/rally-tracks#708)
Add ES|QL to categorize-text challenge (Add ES|QL to categorize-text challenge elastic/rally-tracks#712)
Introduce a new use_synthetic_source_recovery track parameter (Introduce a new use_synthetic_source_recovery track parameter elastic/rally-tracks#713)
Also propagate synthetic_source_keep if set to none (Also propagate synthetic_source_keep if set to none elastic/rally-tracks#715)
Properly map the rally.doc_size and rally.message_size fields in elastic/logs track (Properly map the rally.doc_size and rally.message_size fields in elastic/logs track elastic/rally-tracks#716)
Don't index rally. fields in elastic/logs track (Don't index rally.* fields in elastic/logs track elastic/rally-tracks#718)*
Configure routing on sort fields for logsdb (Configure routing on sort fields for logsdb elastic/rally-tracks#720)
ES|QL: add track for LOOKUP JOIN scale tests (ES|QL: add track for LOOKUP JOIN scale tests elastic/rally-tracks#719)
ES|QL join: add SORT and KEEP queries and SMALL challenge (ES|QL join: add SORT and KEEP queries and SMALL challenge elastic/rally-tracks#721)
Add auto_expand_replicas to joins track (Add auto_expand_replicas to joins track elastic/rally-tracks#727)
Configure routing on sort fields for logsdb for logs-k8-application.log (Configure routing on sort fields for logsdb for logs-k8-application.log elastic/rally-tracks#725)
Bump action versions (Bump action versions elastic/rally-tracks#724)
ES|QL JOIN: query templates to reduce redundancy (ES|QL JOIN: query templates to reduce redundancy elastic/rally-tracks#728)
Bump versions after 3.8 deprecation (Bump versions after 3.8 deprecation elastic/rally-tracks#729)
ES|QL JOIN: non existing keys and duplicate keys (ES|QL JOIN: non existing keys and duplicate keys elastic/rally-tracks#730)
Introduce a new operation to benchmark the shard changes recovery api (Introduce a new operation to benchmark the shard changes recovery api elastic/rally-tracks#722)
Two shard recovery requests with different batch size (Two shard recovery requests with different batch size elastic/rally-tracks#731)
Set source mode through an index setting (Set source mode through an index setting elastic/rally-tracks#732)
Deep terms agg (Deep terms agg elastic/rally-tracks#733)
Bump Wikipedia export version (Bump Wikipedia export version elastic/rally-tracks#738)
Increase cluster health timeout in esql-ccs-snapshot challenge (Increase cluster health timeout in esql-ccs-snapshot challenge elastic/rally-tracks#739)
Add auto follow pattern challenge (Add auto follow pattern challenge elastic/rally-tracks#734)
Restore the source mode setting in the remaining tests (Restore the source mode setting in the remaining tests elastic/rally-tracks#741)
ES|QL JOIN: Add option for parallel query execution (ES|QL JOIN: Add option for parallel query execution elastic/rally-tracks#742)
Make the tsdb track compatible with Elasticsearch 7.17 (Make the tsdb track compatible with Elasticsearch 7.17 elastic/rally-tracks#743)
fix: remove synthetic source recovery challenge (Remove synthetic source recovery challenge elastic/rally-tracks#745)
Remove conditionals for tsdb (Remove conditionals for tsdb elastic/rally-tracks#746)
Fix misplaced comma in TSDB track (Fix misplaced comma in TSDB track elastic/rally-tracks#747)
Work around ESQL query parsing bug (Work around ESQL query parsing bug elastic/rally-tracks#748)
Add use_doc_values_skipper track param to elastic/logs (Add use_doc_values_skipper track param to elastic/logs elastic/rally-tracks#749)
Add ES|QL specific tracks (Add ES|QL specific tracks elastic/rally-tracks#750)
Add search queries to ESQL track, for comparison (Add search queries to ESQL track, for comparison elastic/rally-tracks#751)
elastic/logs: use the common custom settings component template for K8s application logs (elastic/logs: use the common custom settings component template for K8s application logs elastic/rally-tracks#753)
Add drop_null_columns challenge to ESQL benchmarks (Add drop_null_columns challenge to ESQL benchmarks elastic/rally-tracks#752)
Add nightly track for ES|QL joins (Add ES|QL JOINS track designed to run nightly elastic/rally-tracks#757)
Add challenge to the elastic/log track which upgrades a data stream (Add challenge to the elastic/log track which upgrades a data stream elastic/rally-tracks#755)
Measure limit 10 performance (Measure limit 10 performance elastic/rally-tracks#760)
Update backport.yml (Update backport.yml elastic/rally-tracks#758)
Add ES|QL full text functions challenges (Add ES|QL full text functions challenges elastic/rally-tracks#759)
Fix ignore parameters in dbpedia and msmarco-passage-ranking (Fix ignore parameters in dbpedia and msmarco-passage-ranking elastic/rally-tracks#761)
Adding recall testing to openAI track (Adding recall testing to openAI track elastic/rally-tracks#702)
Partial revert of 9fae039: use a 5s refresh for k8s application logs unless the refresh_interval is specified (Use a 5s refresh for k8s application logs unless the refresh_interval is specified elastic/rally-tracks#766)
Fix StopIteration bug on queries (Wikipedia track - fix StopIteration bug on queries elastic/rally-tracks#767)
Add ingest and search scaling challenges for Wikipedia (Add ingest and search scaling challenges for Wikipedia elastic/rally-tracks#765)
[Wikipedia] Disable the request cache by default ([Wikipedia] Disable the request cache by default elastic/rally-tracks#770)
Add ES|QL tracks with frozen indices (Add ES|QL tracks with frozen indices elastic/rally-tracks#764)
Updating vector_ops to search_ops to match the defined params ([OpenAI_Vector] Update Documentation elastic/rally-tracks#774)
Add Big5 track (Add Big5 track elastic/rally-tracks#775)
Support target ingest throughput options to wikipedia scaling challenges (Support target ingest throughput options to wikipedia scaling challenges elastic/rally-tracks#776)
Attempt to minimize noise (Attempt to minimize noise elastic/rally-tracks#777)
http_logs add search with int sort (http_logs add search with int sort elastic/rally-tracks#778)
Additional track changes for ES|QL full text functions (Additional track changes for ES|QL full text functions elastic/rally-tracks#772)
Revert "Partial revert of 9fae039: use a 5s refresh for k8s application logs unless the refresh_interval is specified (Use a 5s refresh for k8s application logs unless the refresh_interval is specified elastic/rally-tracks#766)" (Remove fixed refresh interval from k8s-logging template elastic/rally-tracks#781)
expose params for big5 track (Expose params for big5 track elastic/rally-tracks#779)
Add ESQL queries using TS command for time-series aggregations (Add ESQL queries using TS command for time-series aggregations elastic/rally-tracks#780)
Add match phrase tests to Wikipedia track (Add match phrase tests to Wikipedia track elastic/rally-tracks#782)
Remove target_throughput from big5 track (Remove target_throughput from big5 track elastic/rally-tracks#785)
Add a custom param test (Add a custom param test elastic/rally-tracks#784)
Updating so_vector queries and adding match all recall (Updating so_vector queries and adding match all recall elastic/rally-tracks#786)
Remove fixed refresh interval from auditbeat template (Remove fixed refresh interval from auditbeat template elastic/rally-tracks#783)
Adding more filter tasks and filter recall for so_vector (Adding more filter tasks and filter recall for so_vector elastic/rally-tracks#789)
Adding k100 tasks to so_vector (Adding k100 tasks to so_vector elastic/rally-tracks#792)
Add autoscaling challenges to msmarco-v2-vector track (Add autoscaling challenges to msmarco-v2-vector track elastic/rally-tracks#791)
Switch serverless ITs to vector-optimized serverless project (Switch serverless ITs to vector-optimized serverless projects elastic/rally-tracks#795)
Add extra cleanup fixture to ITs (Add extra cleanup fixture to ITs elastic/rally-tracks#794)
remove duplicate task definition (Remove duplicate task definition in msmarco elastic/rally-tracks#797)
Make number of shards configurable in dense_vector (Make number of shards configurable in dense_vector elastic/rally-tracks#790)
refactor: rename logging-querying-esql challenge (refactor: rename logging-querying-esql challenge elastic/rally-tracks#803)
Switch random_vector track to use data stream (Switch random_vector track to use data stream elastic/rally-tracks#793)
Add parameter to use patterned_text for message field (Add parameter to use patterned_text for message field elastic/rally-tracks#802)
Adding is_filtered_search meta tag to so_vector track (Adding is_filtered_search meta tag to so_vector track elastic/rally-tracks#805)
Add new challenge to elastic/logs to benchmark the INSIST_🐔 esql command (Add new challenge to elastic/logs to benchmark the INSIST_🐔 esql command elastic/rally-tracks#801)
Adding knn search and recall challenges using default k and num_candidates (Adding knn search and recall challenges to so_vector using default k and num_candidates params elastic/rally-tracks#806)
Setting default vector_index_type to bbq_hnsw for so_vector (Setting default vector_index_type to bbq_hnsw for so_vector elastic/rally-tracks#807)
Accept pragma risks in ESQL challenge in NYC Taxis (Accept pragma risks in ESQL challenge in NYC Taxis elastic/rally-tracks#813)
Switch to dedicated backport token (Switch to dedicated backport token elastic/rally-tracks#811)
Limit CI workflow scope for pushes (Limit CI workflow scope for pushes elastic/rally-tracks#818)
Fix oversample params for knn tracks (Fix oversample params for knn tracks elastic/rally-tracks#814)
Trim unnecessary ITs (Trim unnecessary ITs elastic/rally-tracks#809)
Remove unnecessary dependency (Remove unnecessary dependency elastic/rally-tracks#821)
Streams challenge (Streams challenge elastic/rally-tracks#816)
Make streams test compatible with chicken test (Make streams test compatible with chicken test elastic/rally-tracks#823)
update mapping (streams-logging: Update mapping elastic/rally-tracks#826)
[ES|QL] Add new queries into nyc_taxis track to measure DateTrunc to RoundTo transformation ([ES|QL] Add new queries into nyc_taxis track to measure DateTrunc to RoundTo transformation elastic/rally-tracks#824)
Add template_id sort to patterned_text track (Add template_id sort to patterned_text track elastic/rally-tracks#825)
[ES-12320] Align tags in Logs, NOAA, SO and TSDB ([ES-12320] Align tags in Logs, NOAA, SO and TSDB elastic/rally-tracks#827)
Remove system managed template fields ([elastic/shared] Remove system managed template fields elastic/rally-tracks#829)
Persist max buckets increase in TSDB benchmark (Persist max buckets increase in TSDB benchmark elastic/rally-tracks#830)
Adding request-timeout for recall metrics for so_vector (Adding request-timeout for recall metrics for so_vector elastic/rally-tracks#800)
Pass request-timeout in SO Vector knn-recall-param-source (Pass request-timeout in SO Vector knn-recall-param-source elastic/rally-tracks#832)
Add queries for LOOKUP JOIN with filters on lookup table attributes (Add queries for LOOKUP JOIN with filters on lookup table attributes elastic/rally-tracks#834)
Remove source excludes in all vector tracks (Remove source excludes in all vector tracks elastic/rally-tracks#833)
Add a nested mode to the random_vector track (Add a nested mode to the random_vector track elastic/rally-tracks#804)
ES|QL JOIN: Add lookup index with a constant key (ES|QL JOIN: Add lookup index with a constant key elastic/rally-tracks#835)
test

Update integrations for elastic/logs to 8.13.3

This change adds an operation called knn-recall that computes the following metrics: * Recall * NDCG * Avg number of nodes visited during search Given the size of the corpus, the true top N values used for recall operations have been approximated offline for each query as follows: ``` { "knn": { "field": "emb", "query_vector": query['emb'], "k": 10000, "num_candidates": 10000 }, "rescore": { "window_size": 10000, "query": { "query_weight": 0, "rescore_query": { "script_score": { "query": { "match_all": {} }, "script": { "source": "double value = dotProduct(params.query_vector, 'emb'); return sigmoid(1, Math.E, -value);", "params": { "query_vector": vec } } } } } } } ``` This means that the computed recall is measured against the system's best possible approximate neighbor run rather than the actual top N. For the relevance metrics, the `qrels.tsv` file contains annotations for all the queries listed in `queries.json`. This file is generated from the original training data available at [ir_datasets/msmarco_passage_v2](https://ir-datasets.com/msmarco-passage-v2.html#msmarco-passage-v2/train).

* Make exclude conditional on not serverless * Typo

@timestamp

The default is now host.name AND @timestamp

…ic/logs track (elastic#622) These search are based from searches that are executed by search/discovery search challenge that fetch top documents. The queries are without query and fetch 100, 500 and 1000 documents. The source is fetches using the field fetch feature, like in the searches that execute as part of search/discovery workflow. The problem is that we see combined latency and service time for search/discovery challenge and not for each search that is executed as part of this challenge (it is composite operation). By adding these searches we can get latency and service time of searches that specifically fetch _source. This information is useful for the logsdb effort. Synthetic source makes fetching _source more expensive, but currently we can't introspect at a closer level what the impact is (since search/discovery search challenge report latency / service time for multiple operations).

…ic/logs track (elastic#623) Backporting elastic#622 to the 8.15 branch. Otherwise rally nightly and esbench will not pick the change up. The version of Elasticsearch main is 8.15.0-SNAPSHOT and therefor rally nightly / esbench will use the rally track's 8.15 branch. These search are based from searches that are executed by search/discovery search challenge that fetch top documents. The queries are without query and fetch 100, 500 and 1000 documents. The source is fetches using the field fetch feature, like in the searches that execute as part of search/discovery workflow. The problem is that we see combined latency and service time for search/discovery challenge and not for each search that is executed as part of this challenge (it is composite operation). By adding these searches we can get latency and service time of searches that specifically fetch _source. This information is useful for the logsdb effort. Synthetic source makes fetching _source more expensive, but currently we can't introspect at a closer level what the impact is (since search/discovery search challenge report latency / service time for multiple operations).

Most of these are optimized in main, except two, the ESQL versions of sorting by distance, which we've set to one iteration for now to save on benchmarking time before we do performance optimizations on those.

* Add query rule tests to the wikipedia track * Update rule query to 8.15.0 syntax * Add pinned * Revert back to 8.15 syntax after local testing is completed

…lastic#630)

…weighted_tokens (elastic#633) * Update msmarco-passage-ranking track to use sparse_vector instead of weighted_tokens * Update field type to sparse_vector

* Add exclude to query ruleset benchmarks * Linting * Make pinned ID selection random * Update wikipedia/track.py Co-authored-by: Quentin Pradet <quentin.pradet@gmail.com> --------- Co-authored-by: Quentin Pradet <quentin.pradet@gmail.com>

…ck (elastic#640) The index_mode parameter will be used to run Rally benchmarks comparing indexing using standard and logsdb mode for the elastic/security track. Enabling LogsDB is done by means of a component template which is added and later used if the index_mode is provided. In case it is missing no index mode will be used which will default to standard.

* Add a retriever test with optional rerank * Linting * Fix retriever query * Fix retriever * Linting

The fleet component template is used when we try to delete it. Here we introduce a parameter that allows us to skip deletion of the component template. The default value is false, which means normally we attempt to delete it. Setting it explicitly to true we avoid deleting it. This prevents errors happening if we try to delete it and it is in use.

…astic#644)

Serverless deployments miss ILM. As a result component templates should not use the lifecycle setting. Here we introduce a setting which allows us to exclude the lifecycle setting either using `lifecycle` parameter or a `build_flavor` parameter. This mimics what we do already for the elastic/logs track.

This causes the elastic/security track to fail execution when index_mode is set to logsdb. This is happening because LogsDB uses synthetic source which, in turn, does not support copy_to. Supporting copy_to is expected to come in Elasticsearch 8.16. In the meanwhile we just exclude the copy_to setting from the mapping so to avoid triggering the error.

Test this change in nightlies, to decide whether to enable it by default. Related to elastic/elasticsearch#112354

…c#657)

* Update to use JDK 21 for build

* Paramaterise timeout * Update README.md

From ES v8.14 the default index type for dense_vectors is int8_hnsw. This modifies our rally tracks to refect it.

`copy_to` is used to copy from `kubernetes.event.message` to `message`. Now it is supported in Elasticsearch 8.15 and we can benchmark the security track including it. We also remove a parameter which was used to run a modified workflow, which was using `kubernetes.event.message` instead of `message`.

… templates (elastic#668)

This PR changes the security track so that we can enable LogsDB in index templates. Note that the failure store is only available in serverless so we gate its usage excluding it in case the deployment is not serverless. For LogsDB testing we rely on Kibana to install all other component/composable templates. This is to make sure we need limited changes to the Rally track. While testing this new configuration we discovered that installation of (component) templates done by Kibana is Serverless only happens when a user interacts with it. This means (component) templates are not installed and the `elastic/security` track execution fails as a result of using (component) templates that do not exist.

* `enable_logsdb` (default: false) Determines whether the logsdb index mode gets used. If set then index sorting is configured to only use `@timestamp` field and the `source_enabled` parameter will have no effect. * `force_merge_max_num_segments` (default: unset): An integer specifying the max amount of segments the force-merge operation should use.

If the `host.name` field does not exists, indices created as backing indices of a data stream are injected with empty values of `host.name`. Sorting on `host.name` and `@timestamp` results in sorting just on `@timestamp`. Looking at some mappings I see a `host.hostname` exists. Also a cardinality aggregation results in hundreds of distinct values which suggests the filed is not empty. We would like to test using a meaningful combination of fields to sort on. Ideally we expect better benchmark results despite being possible that other, more effective, combinations of fields might exist. We are interested, anyway, in changes over time **given a valid set of fields to sort on**.

…and (elastic#801) Also add mapping track parameter which controls whether all fields are mapped or whether almost all fields are unmapped. This allows for benchmarking elastic/logs in an unmapped context with experimental INSIST_🐔 esql command.

…dates (elastic#806)

)

This unblocks queries using `pragma` when used with non-snapshot Elasticsearch builds.

* streams challenge * dummy change * add to test_logs * review comments * fix * fix 2 * clean up * remove newlines * fix another thing * control replicas * fix and cleanup * delete accidentally comitted files * reset properly * add reroute back * Update elastic/logs/pipelines/logs@stream.processing.json Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co> * fixes * remove pipeline --------- Co-authored-by: Grzegorz Banasiak <grzegorz.banasiak@elastic.co>

Make sure the `logs` stream works with the `mapped = "unmapped"` parameter. This isn't the case right now because all other data streams are captured by the default `logs-*-*` mappings, but `logs` is not. The "unmapped" parameter requires tests with dedicated ES cluster because it operates on a different set of templates, so it cannot clean templates created by earlier races. For this reason, elastic/logs tests are not split in 2 modules.

…RoundTo transformation (elastic#824) * add more date_histogram esql queries with different date range and intervals

Sort patterned-text elastic/logs track by host.name, message.template_id, timestamp.

* Adding request-timeout for recall metrics for so_vector * adjusting request timeout logic

…lastic#834)

Now that is #131907 merged, remove all the source exclusion from our tracks since this is handled by default.

This change adds a parameter called `paragraph_size` tha determines how many random vectors are indexed per document. When the value is > 1, the track switches to the nested field automatically.

ebadyano and others added 30 commits June 7, 2024 20:54

Update elastic/logs integrations to 8.13.3 (elastic#613)

d6fd6d2

Update integrations for elastic/logs to 8.13.3

Make exclude conditional on not serverless (elastic#620)

aaff8e6

* Make exclude conditional on not serverless * Typo

Grant elasticsearch-team ability to rerun builds (elastic#621)

2a95171

Remove index sorting overwrite (elastic#619)

7f82d34

The default is now host.name AND @timestamp

Added many more distance benchmarks for ESQL (elastic#626)

25ca4ff

Most of these are optimized in main, except two, the ESQL versions of sorting by distance, which we've set to one iteration for now to save on benchmarking time before we do performance optimizations on those.

Add query rule tests to the wikipedia track (elastic#627)

50b94d5

* Add query rule tests to the wikipedia track * Update rule query to 8.15.0 syntax * Add pinned * Revert back to 8.15 syntax after local testing is completed

Fix autoscale-v2 (elastic#629)

dd26414

Update MS Marco passage ranking to use _score when calculating NDCG (e…

78063b7

…lastic#630)

Update msmarco-passage-ranking track to use sparse_vector instead of …

d3b161d

…weighted_tokens (elastic#633) * Update msmarco-passage-ranking track to use sparse_vector instead of weighted_tokens * Update field type to sparse_vector

Add a retriever test with optional rerank (elastic#639)

529117e

* Add a retriever test with optional rerank * Linting * Fix retriever query * Fix retriever * Linting

fix: skip deleting some required component templates for security (el…

e50dc1d

…astic#644)

fix: prevent usage of ilm lifecycle in serverless (elastic#650)

269e046

Keep array source in logsdb mode (elastic#655)

e8b8054

Test this change in nightlies, to decide whether to enable it by default. Related to elastic/elasticsearch#112354

Include ability to force merge segments in elastic/security (elasti…

cf826cd

…c#657)

Update to use JDK 21 for build (elastic#660)

74d31cd

* Update to use JDK 21 for build

Paramaterise timeout (elastic#661)

21dcd36

* Paramaterise timeout * Update README.md

Modify the default vector index type to int8_hnsw (elastic#658)

54d3433

From ES v8.14 the default index type for dense_vectors is int8_hnsw. This modifies our rally tracks to refect it.

Add components logs@mappings, logs@settings, ecs@mappings to endpoint…

c12d71c

… templates (elastic#668)

pmpailis and others added 25 commits July 2, 2025 16:48

Adding is_filtered_search meta tag to so_vector track (elastic#805)

cf813ff

Adding knn search and recall challenges using default k and num_candi…

0c1b2a4

…dates (elastic#806)

Setting default vector_index_type to bbq_hnsw for so_vector (elastic#807

2326cd5

)

Accept pragma risks in ESQL challenge in NYC Taxis (elastic#813)

33d35f9

This unblocks queries using `pragma` when used with non-snapshot Elasticsearch builds.

Switch to dedicated backport token (elastic#811)

7250bb3

Limit CI workflow scope for pushes (elastic#818)

571300c

Fix oversample params for knn tracks (elastic#814)

24c4b0b

Trim unnecessary ITs (elastic#809)

581478b

Remove unnecessary dependency (elastic#821)

37a20a8

update mapping (elastic#826)

1e07d14

[ES|QL] Add new queries into nyc_taxis track to measure DateTrunc to …

1658376

…RoundTo transformation (elastic#824) * add more date_histogram esql queries with different date range and intervals

Add template_id sort to patterned_text track (elastic#825)

e140dad

Sort patterned-text elastic/logs track by host.name, message.template_id, timestamp.

[ES-12320] Align tags in Logs, NOAA, SO and TSDB (elastic#827)

1480089

Remove system managed template fields (elastic#829)

48f2566

Persist max buckets increase in TSDB benchmark (elastic#830)

9096502

Adding request-timeout for recall metrics for so_vector (elastic#800)

ced0694

* Adding request-timeout for recall metrics for so_vector * adjusting request timeout logic

Pass request-timeout in SO Vector knn-recall-param-source (elastic#832)

8deaed5

Add queries for LOOKUP JOIN with filters on lookup table attributes (e…

8f18820

…lastic#834)

Remove source excludes in all vector tracks (elastic#833)

9815c03

Now that is #131907 merged, remove all the source exclusion from our tracks since this is handled by default.

Add a nested mode to the random_vector track (elastic#804)

492122f

This change adds a parameter called `paragraph_size` tha determines how many random vectors are indexed per document. When the value is > 1, the track switches to the nested field automatically.

ES|QL JOIN: Add lookup index with a constant key (elastic#835)

c828de6

test

b58de26

NickDris closed this Aug 28, 2025

NickDris deleted the dris-test-branch branch August 28, 2025 08:16

NickDris added a commit that referenced this pull request Dec 10, 2025

Address review comments #2

d947929

NickDris added a commit that referenced this pull request Dec 11, 2025

Fix attempt #2 [skip ci]

6708202

NickDris added a commit that referenced this pull request Dec 11, 2025

Fix attempt #2 [skip ci] (#127)

9b7106b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dris test branch#2

dris test branch#2
NickDris wants to merge 145 commits into8.15from
dris-test-branch

NickDris commented Aug 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

NickDris commented Aug 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants