Glimpse

This code is for our ICLR 2025 paper "Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection", where we borrow some code from Fast-DetectGPT.

Paper | Demo | OpenReview

We are working on the demo and will update the link soon.

Brief Intro

Method	ChatGPT	GPT-4	Claude-3 Sonnet	Claude-3 Opus	Gemini-1.5 Pro	Avg.
Fast-DetectGPT (Open-Source: gpt-neo-2.7b)	0.9487	0.8999	0.9260	0.9468	0.8072	0.9057
Glimpse (Fast-DetectGPT) (Proprietary: gpt-3.5)	0.9766 (↑54%)	0.9411 (↑41%)	0.9576 (↑43%)	0.9689 (↑42%)	0.9244 (↑61%)	0.9537 (↑51%)

The table shows detection accuracy (measured in AUROC) across five source LLMs, where the methods are evaluated on a diverse dataset Mix3 (a mixture of XSum, Writing, and PubMed) produced by each source model. The baseline Fast-DetectGPT uses an open-source gpt-neo-2.7b model but our Glimpse (Fast-DetectGPT) uses a proprietary gpt-3.5 model. The notion "↑" indicates the improvement relative to the remaining space, calculated by "(new - old) / (1.0 - old)".

Environment

Python3.12
PyTorch2.3.1
Setup the environment: pip install -r requirements.txt

(Notes: the baseline methods are run on 1 GPU of Tesla A100 with 80G memory, while Glimpse is run on a CPU environment.)

Workspace

Following folders are created for our experiments:

./exp_main -> experiments with five latest LLMs as the source model (main.sh).
./exp_langs -> experiments on six languages (langs.sh).

(Notes: we share the data and results for convenient reproduction.)

Citation

If you find this work useful, you can cite it with the following BibTex entry:

@articles{bao2024glimpse,
  title={Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection},
  author={Bao, Guangsheng and Zhao, Yanbin and He, Juncai and Zhang, Yue},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
exp_langs		exp_langs
exp_main		exp_main
scripts		scripts
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
ablation_prompt.sh		ablation_prompt.sh
ablation_ranksize.sh		ablation_ranksize.sh
ablation_topk.sh		ablation_topk.sh
attack.sh		attack.sh
baselines_closellm.sh		baselines_closellm.sh
baselines_langs.sh		baselines_langs.sh
baselines_openllm.sh		baselines_openllm.sh
data_claude.sh		data_claude.sh
data_gemini.sh		data_gemini.sh
langs.sh		langs.sh
main.sh		main.sh
main_openllm.sh		main_openllm.sh
requirements.txt		requirements.txt
requirements_dipper.txt		requirements_dipper.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Glimpse

Brief Intro

Environment

Workspace

Citation

About

Releases

Packages

Languages

License

baoguangsheng/glimpse

Folders and files

Latest commit

History

Repository files navigation

Glimpse

Brief Intro

Environment

Workspace

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages