Make Pooling Parameterizable #11

janheinrichmerker · 2024-11-13T21:02:04Z

Currently, there are some hard-coded configurations and other issues in the pooling code:

The Elasticsearch connection and index to get passage IDs by document ID are hard-coded
Some paths are hard-coded.
The set of retrieval models and re-rankers is hard-coded.

I believe we should move the configuration to the CLI options and ideally not directly rely on Elasticsearch at all (e.g., by directly retrieving from the segmented corpus).

janheinrichmerker · 2024-11-13T21:04:24Z

I addressed some of the issues in commits:

mam10eks · 2024-11-14T07:07:51Z

Awesome, yes, makes sense.

I will modify this to be non-hard coded for the next iteration that I will create on friday

mam10eks self-assigned this Nov 14, 2024

mam10eks changed the title ~~Fix pooling~~ Make Pooling Parameterizable Nov 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Pooling Parameterizable #11

Make Pooling Parameterizable #11

janheinrichmerker commented Nov 13, 2024

janheinrichmerker commented Nov 13, 2024

mam10eks commented Nov 14, 2024

Make Pooling Parameterizable #11

Make Pooling Parameterizable #11

Comments

janheinrichmerker commented Nov 13, 2024

janheinrichmerker commented Nov 13, 2024

mam10eks commented Nov 14, 2024