Sphinteract

Overview

This repository contains the code, data, and resources for our research on NL2SQL with Sphinteract framework.

zeroshot_experiments.ipynb - Source code for running zero shot experiments on KaggleDBQA and BIRD.
FewShotAmbSQL.ipynb - Source code for running fewshot experiments on KaggleDBQA and BIRD.

We have made our experiment logs and user study data available for public access. You can find them on our Google Drive:

batchRequests.zip - This folder contains the 0th round requests and answer using the same prompt format as DAIL-SQL (code repsentation with rule). To reduce the experimental costs, we decided to use OpenAI Batch API to generate the initial SQL queries.
experiment_logs.zip - This folder contains the experimental logs of Sphinteract using 0, 1, 3, and 5 demonstrations with gpt-3.5-turbo and gpt-4-turbo. Each pkl file contains a list of dictionaries. The keys for each dictionary is sql_log, cq_log, feedback_log, and num_cq_asked.
- sql_log - Contains a list of (order, sql_generation_prompt, predicted_sql, pscore). The order field is an int which helps tracking the ordering between sqls, cqs, and feedback. The sql_generation_prompt follows the same format as DAIL-SQL (code repsentation with rule).
- cq_log - Contains a list of (order, SRA_prompt, clarification question, pscore).
- feedback_log - Contains a list of (order, feedback_prompt, feedback, pscore).
kaggle_userstudy.jsonl - This file contains the 64 KaggleDBQA questions and the response from users. Through this userstudy, we find that for the same questions, users may have totally different expectations on the SQL ansewr.
sphinteract_user_study.zip - This folder contains our Sphinteract user study web pages. Open the frontpage.html to start the user study. Before starting the user study, please fill in the apiKey field in study.html.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
userstudy_chroma		userstudy_chroma
.gitignore		.gitignore
FewShotAmbSQL.ipynb		FewShotAmbSQL.ipynb
LICENSE		LICENSE
README.md		README.md
kaggle_dataset.csv		kaggle_dataset.csv
query_module.py		query_module.py
sphinteract_full.pdf		sphinteract_full.pdf
zeroshot_experiments.ipynb		zeroshot_experiments.ipynb