Skip to content

Git Repository organization — Glue Job location best practices #123

Answered by cnfait
ystoneman asked this question in Q&A
Discussion options

You must be logged in to vote

I would advise against storing the Glue job code in the datalakeLibrary repository. The content of this repository is built into a Lambda layer intended to be used by Lambda functions part of Stage A and Stage B. The code of the Glue job itself is of no use to them and will only make the layer bigger, which can be an issue.

What usually happens is storing the Glue job code with the Glue job's CloudFormation template inside a repository distinct from all the SDLF repositories. This repository can be called sdlf-transforms for example. Stage B (using the datalakeLibrary) runs a Glue job using a specific name so that's the only thing you need to be careful about.

There are alternatives to th…

Replies: 2 comments 6 replies

Comment options

You must be logged in to vote
6 replies
@tommycheek
Comment options

@cnfait
Comment options

@cnfait
Comment options

@tommycheek
Comment options

@cnfait
Comment options

Answer selected by ystoneman
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants