Take advantage of larger job runners in CI build tests #921

mhucka · 2025-12-08T05:18:34Z

Thanks to the help of Google's ML Velocity team, the TensorFlow Quantum project has access to larger job runners. We can use them for the really time-consuming jobs in the CI build checks workflow, which are the library build and the tutorial tests.

In addition, this PR makes a few other small adjustments:

The tutorial-tests job does not need to depend on wheel-build to finish. (Maybe it did in the past?) Consequently, we can take out the dependency and have all 3 jobs run in parallel, which speeds up on the overall workflow.
In two places where ./configure.sh was invoked, that step was immediately followed by a step to run ./scripts/build_pip_package_test.sh, which also runs ./configure.sh. We can take out the redundant invocations of configure.sh in this workflow.

Note: I didn't change the wheel-build job to use the new runners because it's not a bottleneck – the most time consuming job in here is the tutorials tests – the wheel-build job is not the bottleneck. The larger runners are more expensive ($/per minute) to run, so if we can't benefit from them, it doesn't make sense to use them.

Here is an example of changed workflow run-times. First, a sample of what it is before changes:

And now with the workflow changes:

A typical run of the build tests has gone from ~22 minutes to ~7 min (approx 1/3 of what it used to be); that speedup is due to the use of the new ML team runners. The overall time has gone down from ~24 min to ~16 min, or about 2/3 of what it used to be. The bottleneck is the tutorial tests. The time for doing the tutorial tests has barely improved because the tutorial test script does not take advantage of parallelism. (Something to be improved in the future.)

In two places where `./configure.sh` was invoked, the step was immediately followed by running `./scripts/build_pip_package_test.sh`, which also runs `./configure.sh`.

Maybe there was a dependency between them in the past, but there isn't one now. Removing the `needs:` property lets all 3 jobs run in parallel, speeding up the whole CI checks workflow.

Thanks to the help of Google's ML Velocity team, the TensorFlow Quantum project has access to larger job runners. We can use them for the really time-consuming jobs in our workflows. It took a fair amount of trial-and-error testing to resolve some odd differences in the runner environments, but eventually I got it down to just a couple of additional commands. Note: I didn't change the `wheel-build` job to use the new runners because the most time-consuming job in here is the tutorials tests and the wheel-build job is not the bottleneck. The larger runners are more expensive, so if we can't benefit from them, it doesn't make sense to use them.

mhucka added 3 commits December 8, 2025 04:56

Take out redundant invocations of configure.sh

36819e2

In two places where `./configure.sh` was invoked, the step was immediately followed by running `./scripts/build_pip_package_test.sh`, which also runs `./configure.sh`.

tutorial-tests doesn't need to depend on wheel-build

9cbe265

Maybe there was a dependency between them in the past, but there isn't one now. Removing the `needs:` property lets all 3 jobs run in parallel, speeding up the whole CI checks workflow.

mhucka added the area/devops Involves build systems, Make files, Bazel files, continuous integration, and/or other DevOps topics label Dec 8, 2025

mhucka marked this pull request as ready for review December 8, 2025 05:36

MichaelBroughton approved these changes Dec 8, 2025

View reviewed changes

Merge branch 'master' into mh-use-ml-runners

6e0b2b3

mhucka enabled auto-merge (squash) December 8, 2025 18:00

mhucka merged commit ca6e113 into tensorflow:master Dec 8, 2025
10 checks passed

mhucka deleted the mh-use-ml-runners branch December 8, 2025 18:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Take advantage of larger job runners in CI build tests #921

Take advantage of larger job runners in CI build tests #921

mhucka commented Dec 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Take advantage of larger job runners in CI build tests #921

Take advantage of larger job runners in CI build tests #921

Conversation

mhucka commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mhucka commented Dec 8, 2025 •

edited

Loading