Correctly validating new real-world OpenAI API Key format, relaxing negative tests #3330

nstielau · 2024-08-08T20:47:34Z

Why are these changes needed?

Using a real-world OpenAI API key, which successfully works with the OpenAI hosted models, I encountered a warning message that the API key was not valid. This red-herring lead to slow debugging, as the issue was in fact a space character in an agent name rather than anything to do with the API key.

Approach

I've removed two negative tests. Writing a validation implementation that passes these tests is possible, but in my opinion it is preferable to have a simple validation function that doesn't cover all possibilities. It seems like 99.9% of the time, a user would copy/paste their API Key, and the updated function should catch common positive and negative tests.

Additionally, in my opinion, false positives that log the warning message incorrectly are as bad as false negatives which would not indicate the API Key when it is not working.

Reviewers

@wrfly and @AaronWard PTAL TY!

Related issue number

n/a
See #3078 for last changes

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

…egative tests.

gitguardian · 2024-08-08T20:48:10Z

⚠️ GitGuardian has uncovered 6 secrets following the scan of your pull request.

Please consider investigating the findings and remediating the incidents. Failure to do so may lead to compromising the associated services or software components.

Since your pull request originates from a forked repository, GitGuardian is not able to associate the secrets uncovered with secret incidents on your GitGuardian dashboard.
Skipping this check run and merging your pull request will create secret incidents on your GitGuardian dashboard.

🔎 Detected hardcoded secrets in your pull request

GitGuardian id	GitGuardian status	Secret	Commit	Filename
12853598	Triggered	Generic High Entropy Secret	`f5522a9`	test/oai/test_utils.py	View secret
-	-	Generic High Entropy Secret	`f5522a9`	test/oai/test_utils.py	View secret
-	-	Generic High Entropy Secret	`f5522a9`	test/oai/test_utils.py	View secret
12853600	Triggered	Generic High Entropy Secret	`f5522a9`	test/oai/test_utils.py	View secret
12853601	Triggered	Generic High Entropy Secret	`f5522a9`	test/oai/test_utils.py	View secret
12853602	Triggered	Generic High Entropy Secret	`f5522a9`	test/oai/test_utils.py	View secret

🛠 Guidelines to remediate hardcoded secrets

Understand the implications of revoking this secret by investigating where it is used in your code.
Replace and store your secrets safely. Learn here the best practices.
Revoke and rotate these secrets.
If possible, rewrite git history. Rewriting git history is not a trivial act. You might completely break other contributing developers' workflow and you risk accidentally deleting legitimate data.

To avoid such incidents in the future consider

following these best practices for managing and storing secrets including API keys and other credentials
install secret detection on pre-commit to catch secret before it leaves your machine and ease remediation.

^{🦉 GitGuardian detects secrets in your source code to help developers and security teams secure the modern development process. You are seeing this because you or someone else with access to this repository has authorized GitGuardian to scan your pull request.}

nstielau · 2024-08-08T20:54:21Z

@microsoft-github-policy-service agree

AcePeaX · 2024-08-16T13:30:26Z

This PR would normally fix this issue #3345. I'm facing the same problem and can't wait for this PR to be merged 👍 .

nstielau · 2024-08-16T13:39:40Z

Thanks for connecting to that issue @AcePeed.

AaronWard · 2024-08-20T14:42:27Z

@sonichi Can you assign me as a reviewer?

rseymour · 2024-08-21T23:53:06Z

Also facing same issue a valid API key format is out of the hands of this library and OpenAI returns an error when one is valid.

buddycat · 2024-08-27T14:47:04Z

The updated regex still rejects my valid OpenAI api_key. It would need to be adjusted to allow for multiple underscores, possibly like ^sk-(?!.--)([a-zA-Z0-9]+(?:-[a-zA-Z0-9_]+))$

nstielau · 2024-08-27T19:13:22Z

@buddycat can you give a test-case assertion for your key?

nstielau · 2024-08-27T19:15:08Z

Haha, FYI, I tried to get autogen agent to fix this. I tweaked the code execution to track changes in git. You can view the iterations here: https://github.githistory.xyz/nstielau/autogen_iterations/blob/main/ai_regex_quiz__temp_0.95.py

It couldn't solve for the current test cases with my new test case :/

buddycat · 2024-08-28T12:26:05Z

Here's a test-case that should be true:

sk-proj-111111111122222aaaaaaaa_7777hhhhh_111111111122222aaaaaaa_99900000fggfffhgg

jackgerrits · 2024-09-25T15:57:28Z

Superseded by #3569

Correctly validating new real-world OpenAI API Key format, relaxing n…

f5522a9

…egative tests.

Merge branch 'main' into open_ai_api_key_regex_fix

bdfbcd6

Merge branch 'main' into open_ai_api_key_regex_fix

1f7f7c3

gagb requested review from AaronWard and jackgerrits September 9, 2024 05:30

jackgerrits mentioned this pull request Sep 25, 2024

Remove api key validation #3569

Merged

3 tasks

jackgerrits closed this Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correctly validating new real-world OpenAI API Key format, relaxing negative tests #3330

Correctly validating new real-world OpenAI API Key format, relaxing negative tests #3330

nstielau commented Aug 8, 2024

gitguardian bot commented Aug 8, 2024 •

edited

Loading

nstielau commented Aug 8, 2024

AcePeaX commented Aug 16, 2024

nstielau commented Aug 16, 2024

AaronWard commented Aug 20, 2024

rseymour commented Aug 21, 2024

buddycat commented Aug 27, 2024

nstielau commented Aug 27, 2024

nstielau commented Aug 27, 2024

buddycat commented Aug 28, 2024

jackgerrits commented Sep 25, 2024

Correctly validating new real-world OpenAI API Key format, relaxing negative tests #3330

Correctly validating new real-world OpenAI API Key format, relaxing negative tests #3330

Conversation

nstielau commented Aug 8, 2024

Why are these changes needed?

Approach

Reviewers

Related issue number

Checks

gitguardian bot commented Aug 8, 2024 • edited Loading

⚠️ GitGuardian has uncovered 6 secrets following the scan of your pull request.

nstielau commented Aug 8, 2024

AcePeaX commented Aug 16, 2024

nstielau commented Aug 16, 2024

AaronWard commented Aug 20, 2024

rseymour commented Aug 21, 2024

buddycat commented Aug 27, 2024

nstielau commented Aug 27, 2024

nstielau commented Aug 27, 2024

buddycat commented Aug 28, 2024

jackgerrits commented Sep 25, 2024

gitguardian bot commented Aug 8, 2024 •

edited

Loading