Add permissions on list datasets for env group + cosmetic S3 Datasets #1718

dlpzx · 2024-11-22T13:12:55Z

Feature or Bugfix

Feature

Detail

For the listS3DatasetsOwnedByEnvGroup API call this PR introduces a permission check to evaluate if the user has LIST_ENVIRONMENT_DATASETS in the environment and on top of that it checks that the input groupUri is one of the groups of the user performing the call.

some cosmetic changes: internal functions prefixed with _

Relates

Security

Please answer the questions below briefly where applicable, or write N/A. Based on
OWASP 10.

Does this PR introduce or modify any input fields or queries - this includes
fetching data from storage outside the application (e.g. a database, an S3 bucket)?
- Is the input sanitized?
- What precautions are you taking before deserializing the data you consume?
- Is injection prevented by parametrizing queries?
- Have you ensured no eval or similar functions are used?
Does this PR introduce any functionality or component that requires authorization?
- How have you ensured it respects the existing AuthN/AuthZ mechanisms?
- Are you logging failed auth attempts?
Are you using or adding any cryptographic features?
- Do you use a standard proven implementations?
- Are the used keys controlled by the customer? Where are they stored?
Are you introducing any new policies/roles/users?
- Have you used the least-privilege principle? How?

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

noah-paige · 2024-11-22T15:04:47Z

backend/dataall/modules/s3_datasets/api/dataset/resolvers.py

@@ -153,7 +153,7 @@ def list_datasets_owned_by_env_group(
 ):
    if not filter:
        filter = {}
-    return DatasetService.list_datasets_owned_by_env_group(environmentUri, groupUri, filter)
+    return DatasetService.list_datasets_owned_by_env_group(uri=environmentUri, group_uri=groupUri, filter=filter)


list_datasets_owned_by_env_group args are uri: str, group_uri: str, data: dict but using kwarg filter here

Changed now!

#1720) ### Feature or Bugfix - Bugfix - integration tests ### Detail Update integration tests with additional checks included in #1718 ### Relates - #1718 ### Security Please answer the questions below briefly where applicable, or write `N/A`. Based on [OWASP 10](https://owasp.org/Top10/en/). - Does this PR introduce or modify any input fields or queries - this includes fetching data from storage outside the application (e.g. a database, an S3 bucket)? - Is the input sanitized? - What precautions are you taking before deserializing the data you consume? - Is injection prevented by parametrizing queries? - Have you ensured no `eval` or similar functions are used? - Does this PR introduce any functionality or component that requires authorization? - How have you ensured it respects the existing AuthN/AuthZ mechanisms? - Are you logging failed auth attempts? - Are you using or adding any cryptographic features? - Do you use a standard proven implementations? - Are the used keys controlled by the customer? Where are they stored? - Are you introducing any new policies/roles/users? - Have you used the least-privilege principle? How? By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

…tic S3 Datasets (#1718) - Feature For the `listS3DatasetsOwnedByEnvGroup` API call this PR introduces a permission check to evaluate if the user has `LIST_ENVIRONMENT_DATASETS` in the environment and on top of that it checks that the input groupUri is one of the groups of the user performing the call. + some cosmetic changes: internal functions prefixed with `_` Please answer the questions below briefly where applicable, or write `N/A`. Based on [OWASP 10](https://owasp.org/Top10/en/). - Does this PR introduce or modify any input fields or queries - this includes fetching data from storage outside the application (e.g. a database, an S3 bucket)? - Is the input sanitized? - What precautions are you taking before deserializing the data you consume? - Is injection prevented by parametrizing queries? - Have you ensured no `eval` or similar functions are used? - Does this PR introduce any functionality or component that requires authorization? - How have you ensured it respects the existing AuthN/AuthZ mechanisms? - Are you logging failed auth attempts? - Are you using or adding any cryptographic features? - Do you use a standard proven implementations? - Are the used keys controlled by the customer? Where are they stored? - Are you introducing any new policies/roles/users? - Have you used the least-privilege principle? How? By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

@noah-paige

### Feature or Bugfix - Security ### Detail ### 🔐 Security * Update sanitization technique for terms filtering by @noah-paige in #1692 and in #1693 * Move access logging to a separate environment logging bucket by @noah-paige in #1695 * Add explicit token duration config for both JWTs by @noah-paige in #1698 * Disable GraphQL introspection if prod sizing by @noah-paige in #1704 * Add snyk workflow on schedule by @noah-paige in #1705, #1708, #1713, #1745 and in in #1746 * Unify Logger Config for Tasks by @noah-paige in #1709 * Updating overly permissive policies tagged by checkov for environment role using least privilege principles by @mourya-33 in #1632 Data.all permission model has been reviewed to ensure all Mutations and Queries have proper permissions: * Add MANAGE_SHARES permissions by @dlpzx in #1702 * Add permission check - is tenant to update SSM parameters API by @dlpzx in #1714 * Add GET_SHARE_OBJECT permissions to get data filters API by @dlpzx in #1717 * Add permissions on list datasets for env group + cosmetic S3 Datasets by @dlpzx in #1718 * Add GET_WORKSHEET permission in RUN_SQL_QUERY by @dlpzx in #1716 * Add permissions to Quicksight monitoring service layer by @dlpzx in #1715 * Add LIST_ENVIRONMENT_DATASETS permission for listing shared datasets and cleanup unused code by @dlpzx in #1719 * Add is_owner permissions to Glossary mutations + add new integration tests by @dlpzx in #1721 * Refactor env permissions + modify getTrustAccount by @dlpzx in #1712 * Add Feed consistent permissions by @dlpzx in #1722 * Add Votes consistent permissions by @dlpzx in #1724 * Consistent get_<DATA_ASSET> permissions - Dashboards by @dlpzx in #1729 ### 🧪 Test improvements Integration tests are in sync with `main` without 2.7 planned features. In this PR all core modules, optional modules and submodules are tested. That includes: tenant-permissions, omics, mlstudio, votes, notifications and backwards compatiblity of s3 shares. by @SofiaSazonova, @noah-paige , @petrkalos and @dlpzx In addition, the following PR adds functional tests that ensure the permission model of data.all is not corrupted. * ⭐ Add resource permission checks by @petrkalos in #1711 ### Dependencies * Update FastAPI by @petrkalos in #1577 * update fastapi dependency by @noah-paige in #1699 * Upgrade "cross-spawn" to "7.0.5" by @dlpzx in #1701 * Bump python runtime to bump cdk klayers cryptography version by @noah-paige in #1707 ### Relates - List above ### Security Please answer the questions below briefly where applicable, or write `N/A`. Based on [OWASP 10](https://owasp.org/Top10/en/). - Does this PR introduce or modify any input fields or queries - this includes fetching data from storage outside the application (e.g. a database, an S3 bucket)? - Is the input sanitized? - What precautions are you taking before deserializing the data you consume? - Is injection prevented by parametrizing queries? - Have you ensured no `eval` or similar functions are used? - Does this PR introduce any functionality or component that requires authorization? - How have you ensured it respects the existing AuthN/AuthZ mechanisms? - Are you logging failed auth attempts? - Are you using or adding any cryptographic features? - Do you use a standard proven implementations? - Are the used keys controlled by the customer? Where are they stored? - Are you introducing any new policies/roles/users? - Have you used the least-privilege principle? How? By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license. --------- Co-authored-by: mourya-33 <[email protected]> Co-authored-by: Mourya Darivemula <[email protected]> Co-authored-by: Noah Paige <[email protected]> Co-authored-by: Petros Kalos <[email protected]> Co-authored-by: Sofia Sazonova <[email protected]> Co-authored-by: Sofia Sazonova <[email protected]>

dlpzx requested review from noah-paige and SofiaSazonova November 22, 2024 13:13

noah-paige reviewed Nov 22, 2024

View reviewed changes

Add permissions on list datasets for env group + cosmetic S3 Datasets

2c3208f

dlpzx force-pushed the fix/clean-code-6 branch from 0bd997b to 2c3208f Compare November 22, 2024 15:14

dlpzx requested a review from noah-paige November 22, 2024 15:28

SofiaSazonova approved these changes Nov 22, 2024

View reviewed changes

noah-paige approved these changes Nov 22, 2024

View reviewed changes

dlpzx merged commit c52a620 into main Nov 25, 2024
9 checks passed

dlpzx mentioned this pull request Nov 25, 2024

Fix integration tests for list_environment_datasets unauthorized cases #1720

Merged

dlpzx mentioned this pull request Dec 4, 2024

Security features v2.6.2 #1731

Closed

dlpzx mentioned this pull request Dec 9, 2024

2.6.2 Security features #1737

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add permissions on list datasets for env group + cosmetic S3 Datasets #1718

Add permissions on list datasets for env group + cosmetic S3 Datasets #1718

dlpzx commented Nov 22, 2024

noah-paige Nov 22, 2024

dlpzx Nov 22, 2024

Add permissions on list datasets for env group + cosmetic S3 Datasets #1718

Add permissions on list datasets for env group + cosmetic S3 Datasets #1718

Conversation

dlpzx commented Nov 22, 2024

Feature or Bugfix

Detail

Relates

Security

noah-paige Nov 22, 2024

Choose a reason for hiding this comment

dlpzx Nov 22, 2024

Choose a reason for hiding this comment