[mieb] Any2TextMultipleChoice Abstask&Evaluator & four tasks in CV-bench #1287

gowitheflow-1998 · 2024-10-10T18:03:18Z

Adding:

Any2Text Multiple Choice Abstask and Evaluator
four tasks from CV-bench that uses above (count, relation, depth, distance).

the logic is first grabbing unique candidate choices from the task (e.g., the 788 questions in the count task only has 17 unique candidates), encode them, store in a {candidate: embedding} dict. In scoring time, we query the dict and do similarity matching. This gets rid of the need to repetitive encoding, and makes it easier for queries with different number of candidates that can't be supported by AbsTaskImageTextPairClassification atm.

Checklist

Run tests locally to make sure nothing is broken using make test.
Run the formatter to format the code using make lint.

…ty evaluation datasets

…nto ImageTextPairCls

…hmark/mteb into ImageTextPairCls

gowitheflow-1998 · 2024-10-10T18:17:24Z

also, this and Any2AnyRetrieval can be extended to audio and video for moeb at some point I think!

isaac-chung

Great work with the AbsTasks! Mostly small non-blocking comments.

mteb/tasks/Image/Any2TextMultipleChoice/eng/CVBench.py

mteb/evaluation/evaluators/Image/Any2TextMultipleChoiceEvaluator.py

gowitheflow-1998 and others added 8 commits September 23, 2024 17:04

fix ImageTextPair dataloading for large datasets; more compositionali…

91ad565

…ty evaluation datasets

Merge branch 'mieb' of https://github.com/embeddings-benchmark/mteb i…

e12ff71

…nto ImageTextPairCls

fix meta data

e42e868

fix validate points

6b76812

Merge branch 'mieb' of https://github.com/embeddings-benchmark/mteb i…

e0b530d

…nto ImageTextPairCls

CV-Bench

aabf0d4

Merge branch 'ImageTextPairCls' of https://github.com/embeddings-benc…

310fc92

…hmark/mteb into ImageTextPairCls

evaluator args comment

d951d25

gowitheflow-1998 requested review from KennethEnevoldsen and isaac-chung October 10, 2024 18:33

isaac-chung approved these changes Oct 10, 2024

View reviewed changes

fix

e817e3f

isaac-chung merged commit b0bc4e2 into mieb Oct 11, 2024
9 checks passed

isaac-chung deleted the ImageTextPairCls branch October 11, 2024 07:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[mieb] Any2TextMultipleChoice Abstask&Evaluator & four tasks in CV-bench #1287

[mieb] Any2TextMultipleChoice Abstask&Evaluator & four tasks in CV-bench #1287

gowitheflow-1998 commented Oct 10, 2024

gowitheflow-1998 commented Oct 10, 2024

isaac-chung left a comment

[mieb] Any2TextMultipleChoice Abstask&Evaluator & four tasks in CV-bench #1287

[mieb] Any2TextMultipleChoice Abstask&Evaluator & four tasks in CV-bench #1287

Conversation

gowitheflow-1998 commented Oct 10, 2024

Checklist

gowitheflow-1998 commented Oct 10, 2024

isaac-chung left a comment

Choose a reason for hiding this comment