Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

automation improvement - workflow templates and github action definitions #4234

Open
baywet opened this issue Nov 28, 2024 · 4 comments
Open
Labels
script Pull requests that update Bash or JavaScript code

Comments

@baywet
Copy link
Contributor

baywet commented Nov 28, 2024

During today's TDC call #4220 @handrews brought up the fact that we might have misalignment and duplication of workflows and scripts across repositories.
It of course make sense to rationalize that to improve reliability and reduce maintenance costs.
I promised during the call I'd recap my suggestions on an issue, here are two ways we could rationalize that:

Duplicated scripts should really be GitHub actions

GitHub actions can be defined to wrap up scripts and their runtime environment. Not only that enables easy reusability across multiple repos, it also makes consuming workflows simpler since they don't need to setup the runtime for the script (node, powershell, python, etc...)

Here is an example of a custom action that comments on pull requests based on an input file it parses. Note that it contains multiple things:

  • a dockerfile to setup what it needs.
  • an action.yaml that gives metadata about the action itself, inputs, outputs
  • the PowerShell script in this case (could really be anything)
  • (not present) a readme to document how to consume the action itself.

This action can now be consumed like any other in a workflow example

You can have multiple action definitions or a single one per repo, it's really up to you. And you can rely on workflows to test your action logic, making it more reliable.
Additionally, actions are versioned using git tags, and upgrades can be automated via dependabot.
Another benefit of this approach is to be able to run the script locally by just running the image, no additional requirements beyond docker.
 
One downside of using dockerfile is that now the container needs to be built for every execution of the consuming workflow. This is easily fixed by adding another workflow where the action definition builds the image and publishes it to GitHub artifacts, and updating the "image" field in the actions.yaml.

more information

Reusable workflows

I think this documentation page is great as an introduction.
The goal would be to have a central repo for all the reusable workflows, and refer to them in target repositories.

Additional solutions

I hope this provides good context for discussions.
Let me know if you have any additional comments or questions.

@handrews handrews added the script Pull requests that update Bash or JavaScript code label Dec 2, 2024
@ralfhandl
Copy link
Contributor

Looking at the three incarnations of the respec workflow in the Arazzo-, OpenAPI-, and Overlay-Specification repos, the common structure is

  1. Check out current branch
  2. Check out deployment target branch into subfolder
  3. Run HTML build script to modify subfolder
  4. Create pull request for deployment target branch from modified subfolder

Would we start with an action for step 3 to get rid of the three slightly different copies of the HTML build scripts?

Or would we want to have a (composite?) action that performs all four steps so that the three reduced workflows would only differ in the value of a "spec" action parameter?

The same could be used for the schema-publish workflow in the OpenAPI- and Overlay-Specification repos.

The schema-tests and validate-markdown workflows are trivial and I don't see the need for an action there.

@baywet
Copy link
Contributor Author

baywet commented Dec 13, 2024

I'd start with making 3 an action, it's going to be atomic and easier to test. You can always make a composite one after you've done so.

@ralfhandl
Copy link
Contributor

Step 3 looks to be the same in all three repos on the surface:

    - name: run main script
      run: scripts/md2html/build.sh

Inside the repo-specific build.sh the main difference is how the list of editors is constructed per specification version, with the most complicated flavor in the OAS repo:

cat > history/MAINTAINERS_v2.0.md <<EOF
## Active
* Jeremy Whitlock [@whitlockjc](https://github.com/whitlockjc)
* Marsh Gardiner [@earth2marsh](https://github.com/earth2marsh)
* Ron Ratovsky [@webron](https://github.com/webron)
* Tony Tam [@fehguy](https://github.com/fehguy)
EOF
cat > history/MAINTAINERS_v3.0.0.md <<EOF
## Active
* Jeremy Whitlock [@whitlockjc](https://github.com/whitlockjc)
* Marsh Gardiner [@earth2marsh](https://github.com/earth2marsh)
* Ron Ratovsky [@webron](https://github.com/webron)
* Tony Tam [@fehguy](https://github.com/fehguy)
## Emeritus
* Jason Harmon [@jharmn](https://github.com/jharmn)
EOF
git show c740e95:MAINTAINERS.md > history/MAINTAINERS_v3.0.1.md
git show 3140640:MAINTAINERS.md > history/MAINTAINERS_v3.0.2.md
cp history/MAINTAINERS_v3.0.2.md history/MAINTAINERS_v3.0.3.md
cp history/MAINTAINERS_v3.0.2.md history/MAINTAINERS_v3.1.0.md
#TODO: adjust commit for 3.0.4, 3.1.1
git show c3b88ed:EDITORS.md > history/MAINTAINERS_v3.0.4.md
cp history/MAINTAINERS_v3.0.4.md history/MAINTAINERS_v3.1.1.md
# add lines for 3.2.0, ...

This could be harmonized by materializing the editors lists as additional files versions/3.1.1-editors.md etc.

Then the build script can become repo-agnostic.

@OAI/tsc: any concerns?

@handrews
Copy link
Member

@ralfhandl makes sense to me. The whole thing with MAINTAINERS and EDITORS has become very convoluted. Just saving the correct snapshot per version seems much easier to understand, and then we don't have multiple overlapping files because the editors and TSC aren't quite the same set of people.

@ralfhandl ralfhandl moved this to In Progress in Automation & Infrastructure Dec 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
script Pull requests that update Bash or JavaScript code
Projects
Status: In Progress
Development

No branches or pull requests

3 participants