Add recall and NDCG operations in msmarco-v2-vector by jimczi · Pull Request #610 · elastic/rally-tracks

jimczi · 2024-05-16T13:14:59Z

This change adds an operation called knn-recall that computes the following metrics:

Recall
NDCG
Avg number of nodes visited during search

The new queries-recall.json file contains all the queries (76 in total) from the testing set along with their embeddings and the top 1000 ids computed with brute force over the entire corpus.
For the relevance metrics, the qrels.tsv file contains annotations for all the queries listed in queries-recall.json. This file is generated from the original training data available at ir_datasets/msmarco_passage_v2.

This change adds an operation called knn-recall that computes the following metrics: * Recall * NDCG * Avg number of nodes visited during search Given the size of the corpus, the true top N values used for recall operations have been approximated offline for each query as follows: ``` { "knn": { "field": "emb", "query_vector": query['emb'], "k": 10000, "num_candidates": 10000 }, "rescore": { "window_size": 10000, "query": { "query_weight": 0, "rescore_query": { "script_score": { "query": { "match_all": {} }, "script": { "source": "double value = dotProduct(params.query_vector, 'emb'); return sigmoid(1, Math.E, -value);", "params": { "query_vector": vec } } } } } } } ``` This means that the computed recall is measured against the system's best possible approximate neighbor run rather than the actual top N. For the relevance metrics, the `qrels.tsv` file contains annotations for all the queries listed in `queries.json`. This file is generated from the original training data available at [ir_datasets/msmarco_passage_v2](https://ir-datasets.com/msmarco-passage-v2.html#msmarco-passage-v2/train).

msmarco-v2-vector/README.md

msmarco-v2-vector/_tools/parse_queries.py

afoucret · 2024-05-23T09:43:41Z

cohere_vector/index-vectors-only-mapping.json

    "dynamic": false,
    "_source": {
-      "enabled": false
+      "mode": "synthetic"


Should this be a param?

msmarco-v2-vector/challenges/default.json

Co-authored-by: Wes Mason <wes@1stvamp.org>

…s into jim/msmarco-v2-vector

afoucret

Few comment but nothing that would prevent you to merge the PR

afoucret · 2024-06-06T08:05:02Z

msmarco-v2-vector/_tools/parse_queries.py

+            for query in dataset.queries_iter():
+                emb = await retrieve_embed_for_query(co, query[1])
+                resp = await es.search(
+                    index="msmarco-v2", query=get_brute_force_query(emb), size=1000, _source=["_none_"], fields=["docid"]


Maybe a param?

msmarco-v2-vector/operations/default.json

msmarco-v2-vector/track.py

…s fully

gareth-ellis · 2024-11-28T20:31:18Z

@jimczi should this be backported to 8.15? I tried to backport #708 due to it adding about 50 mins to IT tests, but it seems that this PR was never backported to 8.15, so the changes are only in master (By default rally will choose the 8.15 branch when benchmarking against 8.X, where the version being tested is 8.15 or later - serverless always runs from master)

jimczi · 2024-12-04T16:24:40Z

Sorry for the delay here @gareth-ellis .
Are you trying to add this challenge somewhere? I can do the backport but I am not sure I understand whether that's what you imply here?

This change adds an operation called knn-recall that computes the following metrics: * Recall * NDCG * Avg number of nodes visited during search Given the size of the corpus, the true top N values used for recall operations have been approximated offline for each query as follows: ``` { "knn": { "field": "emb", "query_vector": query['emb'], "k": 10000, "num_candidates": 10000 }, "rescore": { "window_size": 10000, "query": { "query_weight": 0, "rescore_query": { "script_score": { "query": { "match_all": {} }, "script": { "source": "double value = dotProduct(params.query_vector, 'emb'); return sigmoid(1, Math.E, -value);", "params": { "query_vector": vec } } } } } } } ``` This means that the computed recall is measured against the system's best possible approximate neighbor run rather than the actual top N. For the relevance metrics, the `qrels.tsv` file contains annotations for all the queries listed in `queries.json`. This file is generated from the original training data available at [ir_datasets/msmarco_passage_v2](https://ir-datasets.com/msmarco-passage-v2.html#msmarco-passage-v2/train). (cherry picked from commit b6f3535)

* Add recall and NDCG operations in msmarco-v2-vector (#610) This change adds an operation called knn-recall that computes the following metrics: * Recall * NDCG * Avg number of nodes visited during search Given the size of the corpus, the true top N values used for recall operations have been approximated offline for each query as follows: ``` { "knn": { "field": "emb", "query_vector": query['emb'], "k": 10000, "num_candidates": 10000 }, "rescore": { "window_size": 10000, "query": { "query_weight": 0, "rescore_query": { "script_score": { "query": { "match_all": {} }, "script": { "source": "double value = dotProduct(params.query_vector, 'emb'); return sigmoid(1, Math.E, -value);", "params": { "query_vector": vec } } } } } } } ``` This means that the computed recall is measured against the system's best possible approximate neighbor run rather than the actual top N. For the relevance metrics, the `qrels.tsv` file contains annotations for all the queries listed in `queries.json`. This file is generated from the original training data available at [ir_datasets/msmarco_passage_v2](https://ir-datasets.com/msmarco-passage-v2.html#msmarco-passage-v2/train). (cherry picked from commit b6f3535) * Exclude msmarco from IT tests (#708) --------- Co-authored-by: Jim Ferenczi <jim.ferenczi@elastic.co>

jimczi added 12 commits May 16, 2024 14:04

fix lint

85d3713

lint bis

d76622b

Switch qrels to test

d9af741

add 76 recall queries from the testing set

d671e14

restore original queries

ccdf5a6

Fix ground truth ids

0052cb8

remove print

7761d3f

lint

c799567

add best_ndcg computed from the ground truth ids

37d05d3

fix

5b470b7

fix lint

1544106

jimczi requested a review from afoucret May 17, 2024 12:29

lint bis

34a7ac9

jimczi requested a review from 1stvamp May 17, 2024 13:14

1stvamp reviewed May 17, 2024

View reviewed changes

msmarco-v2-vector/README.md Outdated Show resolved Hide resolved

1stvamp reviewed May 17, 2024

View reviewed changes

msmarco-v2-vector/_tools/parse_queries.py Show resolved Hide resolved

1stvamp reviewed May 17, 2024

View reviewed changes

msmarco-v2-vector/_tools/parse_queries.py Show resolved Hide resolved

Fix serverless compatibility

dfedb0f

afoucret reviewed May 23, 2024

View reviewed changes

jimczi and others added 10 commits May 28, 2024 22:18

Update msmarco-v2-vector/README.md

b40a7df

Co-authored-by: Wes Mason <wes@1stvamp.org>

Update msmarco-v2-vector/_tools/parse_queries.py

0565bc0

Co-authored-by: Wes Mason <wes@1stvamp.org>

Update msmarco-v2-vector/_tools/parse_queries.py

ff6703f

Co-authored-by: Wes Mason <wes@1stvamp.org>

replace search operations with templates

db542d7

Merge branch 'jim/msmarco-v2-vector' of github.com:jimczi/rally-track…

027c3c6

…s into jim/msmarco-v2-vector

fix indent

cdc10c3

fix template

7b85868

fix challenge

605f223

add template for single client operations

dc6f242

cleanup operation and challenge names

577290f

afoucret approved these changes Jun 6, 2024

View reviewed changes

jimczi added 3 commits June 6, 2024 12:43

Ensure that the parallel-documents-indexing-bulk operation always run…

3e7eead

…s fully

fix extra comma

ca6ee15

apply review comment

f787457

jimczi merged commit b6f3535 into elastic:master Jun 10, 2024

jimczi deleted the jim/msmarco-v2-vector branch June 10, 2024 13:32

gbanasiak mentioned this pull request Nov 28, 2024

Exclude msmarco from IT tests #708

Merged

gareth-ellis mentioned this pull request Dec 6, 2024

[8.15] Add recall and NDCG operations in msmarco-v2-vector #710

Merged

NickDris mentioned this pull request Aug 28, 2025

dris test branch NickDris/rally-tracks#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add recall and NDCG operations in msmarco-v2-vector#610

Add recall and NDCG operations in msmarco-v2-vector#610
jimczi merged 27 commits intoelastic:masterfrom
jimczi:jim/msmarco-v2-vector

jimczi commented May 16, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afoucret May 23, 2024

Uh oh!

Uh oh!

afoucret left a comment

Uh oh!

afoucret Jun 6, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gareth-ellis commented Nov 28, 2024

Uh oh!

jimczi commented Dec 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

jimczi commented May 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afoucret May 23, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

afoucret left a comment

Choose a reason for hiding this comment

Uh oh!

afoucret Jun 6, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gareth-ellis commented Nov 28, 2024

Uh oh!

jimczi commented Dec 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jimczi commented May 16, 2024 •

edited

Loading