Add response passthrough to ranking endpoints #65

jueri · 2024-09-20T08:47:41Z

Add response passthrough to ranking endpoints

Previously the STELLA infrastructure demanded a fixed response schema for rankings. The ranking systems were expected to return the documents or items in a certain format and the STELLA app would pass the results after the interleaving als in a certain format. This was not flexible and all content needed to be loaded afterwards from external sources based on the returned ID.

Improving on that, the STELLA App now supports a passthrough mode for the ranking endpoint. This means that the ranking systems can return the documents in any format they like and the STELLA App will return the same format after interleaving. This allows to return the full content of the documents.

To make use of this feature, the experimental systems need additional configurations to tell the STELLA App the JSON Path to the document ranking in the response and the key of the document ID. This can be configured through the SYSTEMS_CONFIG environment variable in the docker compose file.

Example:

SYSTEMS_CONFIG: |
        {
            "ranker_base": {"type": "ranker", "base": true, "docid": "id", "hits_path": "$.hits.hits"},
            "ranker_exp": {"type": "ranker", "docid": "id", "hits_path": "$.hits.hits"}
        }

The results are still saved to the database in the base schema of the STELLA app and the original response will not be saved to the database. This is to ensure fast responses and minimize latency. However therefore a new caching mechanism was needed. Therefore, Flask-Caching is used. By default, FileSystemCache is used, but this can be changed in the config.py file.

This branch further includes the system configs as JSON feature:

Allow passing the systems config in the docker compose environment variables as a JSON string. This is cleaner and clearer and will allow the configuration of additional system parameters necessary for future updates.

Before:

RECSYS_LIST: gesis_rec_pyterrier gesis_rec_pyserini
RECSYS_BASE: gesis_rec_pyterrier
RANKSYS_LIST: gesis_rank_pyserini_base gesis_rank_pyserini
RANKSYS_BASE: gesis_rank_pyserini_base

After:

SYSTEMS_CONFIG: |
          [
            {"name": "gesis_rec_pyterrier", "type": "recommender", "base": true},
            {"name": "gesis_rec_pyserini", "type": "recommender"},
            {"name": "gesis_rank_pyserini_base", "type": "ranker", "base": true},
            {"name": "gesis_rank_pyserini", "type": "recommender"}
          ]

…riables as a JSON string

Merge new config branch to supply additional system specific configurations (docid and hits_path) for the response passthrough.

…ystems

Merge in new config version

…ng Docker instance if run in CI environment

mdenizturkmen

Well done :)

rohitharavinder

Looks good and makes sense.

jueri added 22 commits August 28, 2024 14:24

sart work on response passthrough feature

7c6eac4

Allow passing the systems config in the docker compose environment va…

b5adb13

…riables as a JSON string

Merge branch 'feature/rework-config' into feature/response-passthrough

6069d96

Merge new config branch to supply additional system specific configurations (docid and hits_path) for the response passthrough.

start work on results passthrough

1a647b5

update JSON config format slightly and added more tests

bb9a643

Merge branch 'feature/rework-config' into feature/response-passthrough

55b8d31

implement passthrough

534b4cb

unify test structure to differentiate between base and experimental s…

5c76254

…ystems

update passthrough

c2e5bf1

update json env string processing

02670c5

update docker versinon in mock connection

e4f4527

Merge branch 'feature/rework-config' into feature/response-passthrough

17b8c70

Merge in new config version

remove logging to file

3971998

keep original response local only

a336c3b

minor imporvements to the profiling service

0434fe4

fix caching for rankings with session ID

51e9fa3

merge fix for caching in ranking endpoint.

1969aa7

cleanup

06d0edd

add description of changes

afd682e

update docker version in mock

ea33bec

update tests and GitHub workflow to ignore tests that rely on a runni…

7623b1b

…ng Docker instance if run in CI environment

update tests and GitHub workflow to ignore tests that rely on a runni…

2cc97f9

…ng Docker instance if run in CI environment

jueri added the enhancement New feature or request label Sep 20, 2024

jueri requested review from peterpanama and rohitharavinder September 20, 2024 08:47

jueri self-assigned this Sep 20, 2024

jueri requested review from mdenizturkmen and removed request for peterpanama October 21, 2024 05:19

mdenizturkmen approved these changes Oct 21, 2024

View reviewed changes

rohitharavinder approved these changes Oct 28, 2024

View reviewed changes

move parsing of the systems hit JSON path to the config

f8bdf62

jueri merged commit 44cfb37 into main Nov 11, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add response passthrough to ranking endpoints #65

Add response passthrough to ranking endpoints #65

jueri commented Sep 20, 2024

mdenizturkmen left a comment

rohitharavinder left a comment

Add response passthrough to ranking endpoints #65

Add response passthrough to ranking endpoints #65

Conversation

jueri commented Sep 20, 2024