Adding recall testing to openAI track by benwtrent · Pull Request #702 · elastic/rally-tracks

benwtrent · 2024-10-29T19:15:01Z

This adds recall and greatly adjusts the OpenAI track.

I thinking we should do something similar with ALL our vector tracks, even for the dense vector track as recalculating the true knn, even for dense vector is wasted compute time.

This is a draft as its a hack for some local testing, if we really want to move forward with this, I can clean this up and we can commit it.

jimczi

LGTM

openai_vector/index-vectors-only-mapping-with-docid-mapping.json

gareth-ellis

LGTM, with the exception that it adds 11 minutes to the system tests - this is because of the way the runner initialises, even in test mode.

I don't have a good solution right now - we have another track that does the same, we got around it by skipping in the system tests - like here #708

I dont think this will run for serverless, so just needs adding to the skip list for standard it.

One possibility, that I need to try, is to create a second file that contains many less queries, and use that based on a parameter being set. Then we could add a variant where we run e.g msmarco and openai setting this special parameter. I will add an issue so we don't forget about this tech debt...

Issue here: #754

openai_vector/challenges/default.json

Co-authored-by: Gareth Ellis <gareth.ellis@elastic.co>

benwtrent · 2025-03-13T09:23:18Z

@gareth-ellis ah, I could add two "test mode" files? I have seen that done in other rally tracks.

benwtrent · 2025-03-13T12:45:45Z

Let me know if I need to make any changes @gareth-ellis.

But, I do think once we get this finished, we can then make progress of moving our nightlies to this benchmark with different parameters.

gareth-ellis · 2025-03-13T12:54:17Z

When you wrote earlier, I started looking at how we could do this - i've got it semi working in msmarco track, if i get it working 100% by later today, i'll try and apply the same to yours, otherwise will temporarily add your new track to the skip list and then we should be good

benwtrent added 2 commits October 29, 2024 15:11

Adding recall testing to openAI track

7caee9f

iter

6b00b0c

benwtrent marked this pull request as ready for review December 19, 2024 14:50

jimczi approved these changes Dec 19, 2024

View reviewed changes

openai_vector/index-vectors-only-mapping-with-docid-mapping.json Outdated Show resolved Hide resolved

benwtrent added 4 commits March 4, 2025 14:27

Merge remote-tracking branch 'upstream/master' into openai-recall-test

8782f5c

fixing formatting

8a598d9

fixing

6fd41fe

fixing params

26ba5a9

gareth-ellis requested a review from a team March 12, 2025 10:27

gareth-ellis requested changes Mar 12, 2025

View reviewed changes

openai_vector/challenges/default.json Outdated Show resolved Hide resolved

Update openai_vector/challenges/default.json

3863c4e

Co-authored-by: Gareth Ellis <gareth.ellis@elastic.co>

Exclude track from system test temporarily

825eadc

gareth-ellis self-requested a review March 13, 2025 13:52

gareth-ellis approved these changes Mar 13, 2025

View reviewed changes

benwtrent merged commit 000eeb7 into elastic:master Mar 24, 2025
13 checks passed

benwtrent deleted the openai-recall-test branch March 24, 2025 13:07

NickDris mentioned this pull request Aug 28, 2025

dris test branch NickDris/rally-tracks#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding recall testing to openAI track#702

Adding recall testing to openAI track#702
benwtrent merged 8 commits intoelastic:masterfrom
benwtrent:openai-recall-test

benwtrent commented Oct 29, 2024

Uh oh!

jimczi left a comment

Uh oh!

Uh oh!

gareth-ellis left a comment •

edited

Loading

Uh oh!

Uh oh!

benwtrent commented Mar 13, 2025

Uh oh!

benwtrent commented Mar 13, 2025

Uh oh!

gareth-ellis commented Mar 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

benwtrent commented Oct 29, 2024

Uh oh!

jimczi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gareth-ellis left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

benwtrent commented Mar 13, 2025

Uh oh!

benwtrent commented Mar 13, 2025

Uh oh!

gareth-ellis commented Mar 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

gareth-ellis left a comment •

edited

Loading