CUMULUS-2688: Update bulk operations to fetch granules by unique columns #3000

npauzenga · 2022-06-17T13:40:59Z

Summary:

This is PR 2/3 for the above ticket. This PR address the changes to bulk operations. When a bulk operation operates on granules it now needs to accept and fetch granules by granuleId + collectionId, not just granuleId. This is because the unique identifiers changed in the Postgres switchover.

Changes

Updates BULK_GRANULE, BULK_GRANULE_DELETE, and BULK_GRANULE_REINGEST operations in bulk-operations to support collectionId
Updates integration tests to use unique identifiers

PR Checklist

Update CHANGELOG
Unit tests
Ad-hoc testing - Deploy changes and test manually
Integration tests

…on cumulus ID

…ule-endpoint

…/CUMULUS-2688-bulk-operation

npauzenga · 2022-06-22T18:33:23Z

CHANGELOG.md

@@ -6,6 +6,12 @@ The format is based on [Keep a Changelog](http://keepachangelog.com/en/1.0.0/).

 ## Unreleased Phase 3

+### Breaking Changes


Wondering if there should be migration instructions for this... 🤔.

Definitely requires an update to https://github.com/nasa/cumulus-api/

Yeah, good call. I'll update that repo once all three tickets for 2688 are in as they all have changes that'll need to be noted. It looks like we might need a new rds-phase-3 branch there that gets released at the same time as the main cumulus phase 3 release too 🤔

nasa/cumulus-api#321

npauzenga · 2022-06-22T18:35:58Z

packages/api/lambdas/bulk-operation.js

 const { reingestGranule, applyWorkflow } = require('../lib/ingest');

 const log = new Logger({ sender: '@cumulus/bulk-operation' });

 async function applyWorkflowToGranules({
-  granuleIds,
+  granules,


These aren't really granules. They're { collectionId, granuleId}. If this is confusing I can rename.

Yeah, I think it should be something better across the stack, esp at the endpoint level if we're renaming there anyway. I just can't decide on a good name. 🤔

Yeah that's where I'm getting stuck. It's really granuleUniqueIdsButNotCumulusIds 😄. Though the API doesn't return cumulus_id at all so maybe it would only be slightly confusing on the dev side.

We could do something like granuleAndCollectionIds but that could also be ambiguous because "ID" can mean a couple things in our schema. It highlights the problem of not exposing cumulus_id to these endpoints/users.

The more I think about it the more I'm leaning towards keeping granules. My reasoning (subject to change) is that you could pass an entire API Granule object and it would work. It's just that we really only need granuleId and collectionId.

I can't find a convincing reason to disagree.

nemreid

All the changes look fine, but I'd really like to avoid overloading granules even more in this way. Let's discuss regarding another term when you're available.

…ration

npauzenga added 15 commits June 6, 2022 16:14

Add endpoint configuration to get granule by collection and granule ID

cbe0e44

add test for getGranuleByUniqueColumns

c6a12dd

add docs, export new granules function, fix test

1b519a0

switch to collectionId instead of collection_cumulus_id for granule GET

a0d78db

Fix merge error

120ced7

Update granule get to throw correct errors

3be9fb1

update api-client configuration with collectionId instead of collecti…

2588d6c

…on cumulus ID

fix lint errors

cea0306

refactor api-client function and deprecate lib function

15415c3

Update CHANGELOG.md

54734da

update docs

720805b

Merge branch 'feature/rds-phase-3' into feature/CUMULUS-2688-new-gran…

1fdde4b

…ule-endpoint

update tests for bulk operation granule lookups using collectionId

4804dff

Merge branch 'feature/CUMULUS-2688-new-granule-endpoint' into feature…

3af1c65

…/CUMULUS-2688-bulk-operation

fix remaining bulk operation tests

ee8e1f3

npauzenga added the work in progress label Jun 17, 2022

npauzenga added 9 commits June 17, 2022 13:34

Refactor tests for less duplication

f3d62d5

fix lint errors

2006805

update bulk delete test for granule lookups

5a89f58

Update CHANGELOG.md

20c8242

fix lint errors

c15136a

update endpoint logic for getting granules by unique keys

393d153

fix lint errors

709a18f

Update integration test to fetch granule by unique columns

886a739

update integration tests to fetch granules by unique columns

aff0ca6

npauzenga commented Jun 22, 2022

View reviewed changes

npauzenga changed the title ~~WIP: Feature/cumulus 2688 bulk operation~~ CUMULUS-2688: Update bulk operations to fetch granules by unique columns Jun 22, 2022

npauzenga added needs review and removed work in progress labels Jun 22, 2022

npauzenga mentioned this pull request Jun 23, 2022

CUMULUS-2688: New Granule endpoint GET /:collectionId/:granuleId #2978

Merged

4 tasks

nemreid added in review and removed needs review labels Jun 28, 2022

nemreid suggested changes Jul 9, 2022

View reviewed changes

nemreid added pr feedback and removed in review labels Jul 9, 2022

Base automatically changed from feature/CUMULUS-2688-new-granule-endpoint to feature/rds-phase-3 July 19, 2022 21:00

npauzenga mentioned this pull request Jul 22, 2022

CUMULUS-2688: Granule doc updates nasa/cumulus-api#321

Merged

npauzenga added needs review and removed pr feedback labels Jul 22, 2022

Merge branch 'feature/rds-phase-3' into feature/CUMULUS-2688-bulk-ope…

1c28013

…ration

nemreid approved these changes Jul 25, 2022

View reviewed changes

Jkovarik added in review and removed needs review in review labels Jul 25, 2022

npauzenga added 2 commits July 25, 2022 15:08

fix lint and merge error

cc0c468

Merge branch 'feature/rds-phase-3' into feature/CUMULUS-2688-bulk-ope…

3e445aa

…ration

npauzenga merged commit fee1280 into feature/rds-phase-3 Jul 26, 2022

npauzenga deleted the feature/CUMULUS-2688-bulk-operation branch July 26, 2022 16:36

Jkovarik mentioned this pull request Jun 1, 2023

Jk/cumulus 3135 fix integration tests #3403

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUMULUS-2688: Update bulk operations to fetch granules by unique columns #3000

CUMULUS-2688: Update bulk operations to fetch granules by unique columns #3000

npauzenga commented Jun 17, 2022 •

edited

Loading

npauzenga Jun 22, 2022

nemreid Jul 8, 2022

npauzenga Jul 20, 2022

npauzenga Jul 22, 2022

npauzenga Jun 22, 2022

nemreid Jul 6, 2022

npauzenga Jul 20, 2022 •

edited

Loading

nemreid Jul 25, 2022

nemreid left a comment

		@@ -6,6 +6,12 @@ The format is based on [Keep a Changelog](http://keepachangelog.com/en/1.0.0/).

		## Unreleased Phase 3

		### Breaking Changes

CUMULUS-2688: Update bulk operations to fetch granules by unique columns #3000

CUMULUS-2688: Update bulk operations to fetch granules by unique columns #3000

Conversation

npauzenga commented Jun 17, 2022 • edited Loading

Changes

PR Checklist

npauzenga Jun 22, 2022

Choose a reason for hiding this comment

nemreid Jul 8, 2022

Choose a reason for hiding this comment

npauzenga Jul 20, 2022

Choose a reason for hiding this comment

npauzenga Jul 22, 2022

Choose a reason for hiding this comment

npauzenga Jun 22, 2022

Choose a reason for hiding this comment

nemreid Jul 6, 2022

Choose a reason for hiding this comment

npauzenga Jul 20, 2022 • edited Loading

Choose a reason for hiding this comment

nemreid Jul 25, 2022

Choose a reason for hiding this comment

nemreid left a comment

Choose a reason for hiding this comment

npauzenga commented Jun 17, 2022 •

edited

Loading

npauzenga Jul 20, 2022 •

edited

Loading