feat(linter): implement noSecrets #3823

SaadBazaz · 2024-09-08T11:28:29Z

Summary

Possibly closes #3822 . I need to use this rule in some projects, but the lack of it in biome prevents adoption. This rule is great for security and sanitation.

The rule searches for potential secrets/keys in code.

Inspired by no-secrets/no-secrets in eslint.

I've dropped TODOs in the code for further improvement.

⚠️ NOTE: This is not a replacement for using your 🧠 when building. While this rule is helpful, it's not infallible. Always review your code carefully and consider implementing additional security measures like pre-commit hooks or automated secret scanning in your CI/CD pipeline, such as GitGuardian or enable GitHub protections.

Test Plan

I've tested locally using:

cargo t quick_test

and

just test-lintrule noSecrets

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs

crates/biome_js_analyze/tests/specs/nursery/noSecrets/invalid.js

crates/biome_js_analyze/tests/specs/nursery/noSecrets/invalid.js.snap

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs

codspeed-hq · 2024-09-08T12:17:33Z

CodSpeed Performance Report

Merging #3823 will not alter performance

_{Comparing SaadBazaz:feat/no-secrets (efbb8fd) with main (4bc409d)}

Summary

✅ 107 untouched benchmarks

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs

…le time, add multithreading loop

SaadBazaz · 2024-09-08T17:26:54Z

I need some help to resolve the merge conflicts.

dyc3 · 2024-09-08T17:43:32Z

The files with conflicts are all generated, so IIRC you should be able rebase on main and then just gen-all to run all the codegen.

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs

crates/biome_js_analyze/tests/specs/nursery/noSecrets/invalid.js.snap

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs

Conaclos · 2024-09-08T18:16:51Z

The impact on the benchmark seems to be significant. I think the use of so many regexes is one of the causes. We should try to find a way to reduce this overhead.
One idea: we could exclude all strings that have less than X characters where X is the minimum of characters required to match at least one of the regex. This could exclude many small strings.

SaadBazaz · 2024-09-08T18:18:50Z

The impact on the benchmark seems to be significant. I think the use of so many regexes is one of the causes. We should try to find a way to reduce this overhead. One idea: we could exclude all strings that have less than X characters where X is the minimum of characters required to match at least one of the regex. This could exclude many small strings.

Great idea! Although I feel like the benchmarks haven't been updated in a while (last run was ~3 hours ago, the code's been refactored majorly since then)

Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>

…3838)

SaadBazaz · 2024-09-09T17:50:03Z

The more valuable things here are quick return heuristics. Excluding strings with spaces seems to be a good one, but it might cause some false negatives (eg. google service accounts are json, which may not be minified).

I agree... Eventually (in further passes) this would be implemented on comments as well... So we definitely need better, smarter heuristics :D

zohairhadi · 2024-09-09T18:15:07Z

would love to have this sooner

dyc3

Looks good to me. Would you mind adding a changelog entry for this? (and make sure to fix any ci failures)

SaadBazaz · 2024-09-09T18:48:39Z

Looks good to me. Would you mind adding a changelog entry for this? (and make sure to fix any ci failures)

Fixed, formatted, linted and changelog updated. Ready for merge. 🦾

SaadBazaz · 2024-09-09T19:25:38Z

(and make sure to fix any ci failures)

Getting performance failure. The difference seems minor, however, it's on React benchmark which seems like a popular one. What should be the plan-of-action here? @dyc3

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs

Conaclos · 2024-09-09T19:31:01Z

Still have some perf issues...

The rule (as any other new rules) will be in the nursery group.
It is ok if we have a small regression. We can still improve it in a future PR.

Do you think we should check for spaces in a string, and ignore those strings? Those would most likely be sentences anyway.

I am unsure if it could make a big difference. We could still try it in the future.

Some other ideas:

We could create an enum with a variant for each secret type.
A function could return a slice of potential secret based on the length of the string.
This could avoid iterating over all secrets and checking for min length.
Here we return the possible secrets based on the string length.
Some secrets could be checked by a hand-made implementation by avoiding completely regexes.

ematipico · 2024-09-09T19:50:48Z

(and make sure to fix any ci failures)

Getting performance failure. The difference seems minor, however, it's on React benchmark which seems like a popular one. What should be the plan-of-action here? @dyc3

That's a flaky test that isn't part of the analyzer, it's the parser. The last run didn't show any regression in the analyzer

SaadBazaz · 2024-09-09T20:09:21Z

Still have some perf issues...

The rule (as any other new rules) will be in the nursery rule. It is ok if we have a small regression. We can still improve it in a future PR.

Do you think we should check for spaces in a string, and ignore those strings? Those would most likely be sentences anyway.

I am unsure if it could make a big difference. We could still try it in the future.

Some other ideas:

We could an enum with a variant for each secret type.
A function could return a slice of potential secret based on the length of the string.
This could avoid iterating over all secrets and checking for min length.
Here we return the possible secrets based on the string length.

Some secrets could be checked by a hand-made implementation by avoiding completely regexes.

Another idea to add on:

StartWith and EndWith enums, for faster string checks (e.g. in the case of RSA Private Keys, etc).

I think we can take a lot of inspiration from gitleaks.

Could even parse just this file: https://github.com/gitleaks/gitleaks/blob/master/config/gitleaks.toml

SaadBazaz · 2024-09-10T05:09:34Z

Seems like we can speed-up regex by enabling nightly build with SIMD. However, I am not aware of how that'll impact other modules.
rust-lang/regex#350

With faster regex, we can eventually support many more useful regexes and make biome a possible alternative to gitleaks.

feat(linter): implement noSecrets initial

44dc010

github-actions bot added A-Project Area: project A-Linter Area: linter L-JavaScript Language: JavaScript and super languages A-Diagnostic Area: diagnostocis labels Sep 8, 2024

dyc3 reviewed Sep 8, 2024

View reviewed changes

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs Outdated Show resolved Hide resolved

crates/biome_js_analyze/tests/specs/nursery/noSecrets/invalid.js Outdated Show resolved Hide resolved

chore: fix todo syntax, add more invalid test cases, add snaps

ab30aa2

SaadBazaz requested a review from dyc3 September 8, 2024 12:05

dyc3 requested changes Sep 8, 2024

View reviewed changes

crates/biome_js_analyze/tests/specs/nursery/noSecrets/invalid.js.snap Outdated Show resolved Hide resolved

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs Outdated Show resolved Hide resolved

chore: update shannon_entropy string

6d343f2

minht11 reviewed Sep 8, 2024

View reviewed changes

SaadBazaz added 3 commits September 8, 2024 18:48

refactor: fix tests, and actually get code to work

81db935

docs: remove eslint inspired line

227422f

refactor: use lazylock for caching regexes

35d4ac9

ematipico reviewed Sep 8, 2024

View reviewed changes

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs Outdated Show resolved Hide resolved

chore: lint according to project req

b2a56be

github-actions bot added the A-CLI Area: CLI label Sep 8, 2024

refactor: turn sensitive patterns into a tuple, make regexes at compi…

b3dfca7

…le time, add multithreading loop

SaadBazaz requested review from ematipico, dyc3 and minht11 September 8, 2024 17:22

SaadBazaz changed the title ~~feat(linter): implement noSecrets initial~~ feat(linter): implement noSecrets Sep 8, 2024

fix: merge conflict, regen

fa1addb

dyc3 requested changes Sep 8, 2024

View reviewed changes

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs Outdated Show resolved Hide resolved

crates/biome_js_analyze/tests/specs/nursery/noSecrets/invalid.js.snap Outdated Show resolved Hide resolved

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs Outdated Show resolved Hide resolved

docs: update doc comments, show better error to user

dde2dc9

renovate bot and others added 8 commits September 9, 2024 22:19

chore(deps): update rust crate bpaf to 0.9.13 (biomejs#3834)

725853d

Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>

chore(deps): update codspeedhq/action action to v2.4.5 (biomejs#3831)

0956994

Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>

chore(deps): update rust:1.80.1 docker digest to d22d893 (biomejs#3829)

6116ac0

Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>

chore(deps): update rust crate anyhow to 1.0.87 (biomejs#3832)

29b7cf0

Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>

fix(linter): only emit diagnostics for grid area properties (biomejs#…

4c47714

…3838)

fix(linter): allow SVG elements with role="img" (biomejs#3837)

e342b99

chore: remove unnecessary string conversion

3f8dfe1

chore: remove unused string conversion

2616e9e

github-actions bot added the L-CSS Language: CSS label Sep 9, 2024

Merge branch 'main' into feat/no-secrets

ae88cb5

github-actions bot removed the L-CSS Language: CSS label Sep 9, 2024

dyc3 approved these changes Sep 9, 2024

View reviewed changes

SaadBazaz added 3 commits September 9, 2024 23:37

chore: update docstring to use const instead of var

a6a72d9

docs: update changelog, add disclaimer for users

3642ad7

docs: update changelog

296d2ed

github-actions bot added the A-Changelog Area: changelog label Sep 9, 2024

Conaclos reviewed Sep 9, 2024

View reviewed changes

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs Outdated Show resolved Hide resolved

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs Outdated Show resolved Hide resolved

crates/biome_js_analyze/src/lint/nursery/no_secrets.rs Outdated Show resolved Hide resolved

refactor: check minlength as a test, create consts

2274dcd

Conaclos approved these changes Sep 9, 2024

View reviewed changes

docs: update changelog with rorrect link

efbb8fd

dyc3 merged commit a66e450 into biomejs:main Sep 9, 2024
12 checks passed

SaadBazaz mentioned this pull request Oct 3, 2024

feat(noSecrets): refine the entropy computation to avoid some false positives #4118

Merged

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(linter): implement noSecrets #3823

feat(linter): implement noSecrets #3823

SaadBazaz commented Sep 8, 2024 •

edited

Loading

codspeed-hq bot commented Sep 8, 2024 •

edited

Loading

SaadBazaz commented Sep 8, 2024

dyc3 commented Sep 8, 2024

Conaclos commented Sep 8, 2024

SaadBazaz commented Sep 8, 2024

SaadBazaz commented Sep 9, 2024

zohairhadi commented Sep 9, 2024

dyc3 left a comment •

edited

Loading

SaadBazaz commented Sep 9, 2024

SaadBazaz commented Sep 9, 2024

Conaclos commented Sep 9, 2024 •

edited

Loading

ematipico commented Sep 9, 2024

SaadBazaz commented Sep 9, 2024 •

edited

Loading

SaadBazaz commented Sep 10, 2024 •

edited

Loading

feat(linter): implement noSecrets #3823

feat(linter): implement noSecrets #3823

Conversation

SaadBazaz commented Sep 8, 2024 • edited Loading

Summary

Test Plan

codspeed-hq bot commented Sep 8, 2024 • edited Loading

CodSpeed Performance Report

Merging #3823 will not alter performance

Summary

SaadBazaz commented Sep 8, 2024

dyc3 commented Sep 8, 2024

Conaclos commented Sep 8, 2024

SaadBazaz commented Sep 8, 2024

SaadBazaz commented Sep 9, 2024

zohairhadi commented Sep 9, 2024

dyc3 left a comment • edited Loading

Choose a reason for hiding this comment

SaadBazaz commented Sep 9, 2024

SaadBazaz commented Sep 9, 2024

Conaclos commented Sep 9, 2024 • edited Loading

ematipico commented Sep 9, 2024

SaadBazaz commented Sep 9, 2024 • edited Loading

SaadBazaz commented Sep 10, 2024 • edited Loading

SaadBazaz commented Sep 8, 2024 •

edited

Loading

codspeed-hq bot commented Sep 8, 2024 •

edited

Loading

dyc3 left a comment •

edited

Loading

Conaclos commented Sep 9, 2024 •

edited

Loading

SaadBazaz commented Sep 9, 2024 •

edited

Loading

SaadBazaz commented Sep 10, 2024 •

edited

Loading