Added search quality testing pipeline #1774

hagen-danswer · 2024-07-03T21:52:31Z

No description provided.

vercel · 2024-07-03T21:52:34Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
internal-search	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jul 6, 2024 6:50pm

yuhongsun96

Generally looks good, didn't scrutinize the scripts that much but seems reasonable.

By the way, there are a ton of errors in the automated checks. I know this is more of a [WIP] PR but let's definitely fix those up before the final PR

backend/danswer/configs/app_configs.py

backend/danswer/db/connector_credential_pair.py

backend/danswer/db/engine.py

backend/danswer/document_index/vespa/index.py

backend/tests/regression/answer_quality/cli_utils.py

backend/tests/regression/answer_quality/relari.py

backend/tests/regression/answer_quality/search_test_config.yaml

yuhongsun96 · 2024-07-04T05:42:54Z

deployment/docker_compose/docker-compose.search-testing.yml

@@ -0,0 +1,406 @@
+version: '3'


I'm quite against having yet another docker compose yaml. Every time we add a new env based config we have to currently copy it to 3 docker compose yamls, kubernetes deployment, helm deployment. Let's see if we can just use the existing yamls

backend/danswer/db/engine.py

backend/tests/regression/answer_quality/README.md

backend/tests/regression/answer_quality/cli_utils.py

backend/tests/regression/answer_quality/file_uploader.py

backend/tests/regression/answer_quality/search_test_config.yaml

deployment/docker_compose/docker-compose.search-testing.yml

backend/danswer/document_index/vespa/index.py

yuhongsun96 · 2024-07-05T18:18:14Z

deployment/docker_compose/docker-compose.search-testing.yml

+    ports:
+      - "8080"
+    environment:
+      # Auth Settings


this can rely on an .env file explicitly. The goal is to not have to duplicate the env vars so that when we introduce a new one it has to be changed in 20 places

yuhongsun96 · 2024-07-05T18:18:58Z

deployment/docker_compose/docker-compose.search-testing.yml

+      o: bind
+      device: ${DANSWER_VESPA_DATA_DIR:-./vespa_data}
+  model_cache_huggingface:
+  #   driver: local


general cleanup here (also other commented things around the PR). Be sure to give the PR a read on GitHub before the final review <3 thanks!

yuhongsun96 · 2024-07-05T18:19:11Z

deployment/docker_compose/docker-compose.search-testing.yml

+#     driver_opts:
+#       type: none
+#       o: bind
+#       device: ${DANSWER_MODEL_CACHE_DIR:-./model_cache}


NEWLINE REEEE

yuhongsun96 · 2024-07-05T18:19:48Z

deployment/docker_compose/docker-compose.search-testing.yml

+      - LANGUAGE_CHAT_NAMING_HINT=${LANGUAGE_CHAT_NAMING_HINT:-}
+      - QA_PROMPT_OVERRIDE=${QA_PROMPT_OVERRIDE:-}
+      # Other Services
+      - POSTGRES_HOST=relational_db


Some of the env things do still need to stay as it points to a service in the docker compose deployment which is not default

yuhongsun96

Super amazing work! Thanks!

Added search quality testing pipeline

1f77f87

vercel bot deployed to Preview July 3, 2024 21:54 View deployment

fix

d8be570

vercel bot deployed to Preview July 3, 2024 21:56 View deployment

yuhongsun96 reviewed Jul 4, 2024

View reviewed changes

added readme

a2aa2ea

vercel bot deployed to Preview July 4, 2024 19:04 View deployment

reverted vespa port changes

50fcd17

vercel bot deployed to Preview July 5, 2024 00:50 View deployment

anotha fix

ba3adc3

vercel bot deployed to Preview July 5, 2024 00:52 View deployment

switched to an api call architecture

dac5628

vercel bot deployed to Preview July 5, 2024 00:55 View deployment

cleaned up print statements

8364fac

vercel bot deployed to Preview July 5, 2024 01:03 View deployment

refactored api calls and added test reruns

72f9d88

vercel bot deployed to Preview July 5, 2024 18:03 View deployment

yuhongsun96 reviewed Jul 5, 2024

View reviewed changes

backend/danswer/document_index/vespa/index.py Outdated Show resolved Hide resolved

yuhongsun96 reviewed Jul 5, 2024

View reviewed changes

reverted backend changes

5f1462a

vercel bot deployed to Preview July 5, 2024 23:33 View deployment

qol improvements

480d620

vercel bot deployed to Preview July 6, 2024 18:23 View deployment

added yaml comments

1d03b77

vercel bot deployed to Preview July 6, 2024 18:50 View deployment

yuhongsun96 approved these changes Jul 6, 2024

View reviewed changes

yuhongsun96 merged commit ac14369 into main Jul 6, 2024
8 checks passed

yuhongsun96 deleted the build-search-testing-pipeline branch July 6, 2024 18:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added search quality testing pipeline #1774

Added search quality testing pipeline #1774

hagen-danswer commented Jul 3, 2024

vercel bot commented Jul 3, 2024 •

edited

Loading

yuhongsun96 left a comment •

edited

Loading

yuhongsun96 Jul 4, 2024

yuhongsun96 Jul 5, 2024

yuhongsun96 Jul 5, 2024

yuhongsun96 Jul 5, 2024

yuhongsun96 Jul 5, 2024

yuhongsun96 left a comment

Added search quality testing pipeline #1774

Added search quality testing pipeline #1774

Conversation

hagen-danswer commented Jul 3, 2024

vercel bot commented Jul 3, 2024 • edited Loading

yuhongsun96 left a comment • edited Loading

Choose a reason for hiding this comment

yuhongsun96 Jul 4, 2024

Choose a reason for hiding this comment

yuhongsun96 Jul 5, 2024

Choose a reason for hiding this comment

yuhongsun96 Jul 5, 2024

Choose a reason for hiding this comment

yuhongsun96 Jul 5, 2024

Choose a reason for hiding this comment

yuhongsun96 Jul 5, 2024

Choose a reason for hiding this comment

yuhongsun96 left a comment

Choose a reason for hiding this comment

vercel bot commented Jul 3, 2024 •

edited

Loading

yuhongsun96 left a comment •

edited

Loading