-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to reproduce Counterfactual Robustness result with ChatGPT #6
Comments
Hello, you can use |
Cool. I reran both lines with
So I guess Also where do I get ED* and correct_rate? Thanks. |
Hello, |
I see. With evaluate on ChatGPT I got:
Seems like GPT-3.5 produces unstable result? |
Yes, you can set by lower temperature like |
Here's what I did:
Step1:
python evalue.py --dataset zh --noise_rate 0.0 --modelname chatgpt
Step2:
python fact_evalue.py --dataset zh --modelname chatgpt
I got file
prediction_zh_chatgpt_temp0.7_noise0.0_passage5_correct0.0_result
with content:And file
prediction_zh_chatgpt_temp0.7_noise0.0_passage5_correct0.0_chatgptresult.json
with content:I failed to see how this matches the results in the paper:
Any ideas?
The text was updated successfully, but these errors were encountered: