Linear-Time Demonstration Selection for In-Context Learning via Gradient Estimation

Authors: Ziniu Zhang, Zhenshuo Zhang, Dongyue Li, Lu Wang, Jennifer Dy and Hongyang R. Zhang.

Usage

Here are the procedure to implement our work:

Enviroment reconstruction

conda create -n gradsel python=3.8
conda activate gradsel
# pip install the correct torch version here
pip install git+https://github.com/huggingface/transformers.git@c37573806ab3526dd805c49cbe2489ad4d68a9d7
pip install -U scikit-learn

Also, we provide an environment file including the python package versions we used in our experiments.

We need to first generate data features:

python ./utils/get_feature.py --task {task} --model {model key}

Commands to run inferences.

# GradRE
python main.py --dataset {dataset} --model {model key} --method direct --do_zeroshot --test_batch_size 4 --use_demonstrations  --seed 0 --k {number} --num_anchors {number} --estim
# GradFS
python main.py --dataset {dataset} --model {model key} --method direct --do_zeroshot --test_batch_size 4 --use_demonstrations  --seed 0 --k {number} --forsel
# GradCE
python main.py --dataset {dataset} --model {model key} --method direct --do_zeroshot --test_batch_size 4 --use_demonstrations  --seed 0 --k {number} --gradestim

Examples.

For the linear-regression task, we provide the usage file as an example.

Then, we provide an example of running GradRE on SST-2 dataset.

python main.py --dataset sst2 --model deepseek-ai/deepseek-llm-7b-chat --method direct --do_zeroshot --test_batch_size 4 --use_demonstrations  --seed 0 --k 8 --ranens

The output is the F1-score of our method.

Reference

If you find this repository useful or happen to use it in a research paper, please cite our work with the following Bib information.

@article{zhang2025linear,
  title={Linear-Time Demonstration Selection for In-Context Learning via Gradient Estimation},
  author={Zhang, Ziniu and Zhang, Zhenshuo and Li, Dongyue and Wang, Lu and Dy, Jennifer and Zhang, Hongyang R.},
  booktitle={Empirical Methods in Natural Language Processing (EMNLP)},
  year={2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 351 Commits
examples		examples
gradsel		gradsel
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
gradsel_pipeline.png		gradsel_pipeline.png
main.py		main.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Linear-Time Demonstration Selection for In-Context Learning via Gradient Estimation

Usage

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

VirtuosoResearch/ICL-GradSel

Folders and files

Latest commit

History

Repository files navigation

Linear-Time Demonstration Selection for In-Context Learning via Gradient Estimation

Usage

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages