Add critical path scheduler to improve build times #2019

peterbell10 · 2021-08-25T22:41:14Z

This is based on #1333 and fixes #376.

I'm interesting in seeing #1333 merged, but unfortunately it seems to have languished. When I tried it, it was completely broken. This maintains the same principal, but reworks it to: address review comments, fix bugs, and improve time complexity of operations.

For each edge depended on by the targets being built, we calculate its priority as the longest path through the weighted directed graph of dependencies. Where weights come from the build log's last execution time. Ready build tasks are then always executed highest priority first.

#1333 maintained a std::list of all edges in priority order. This PR instead uses a std::priority_queue of only the ready edges (wrapped in the EdgeQueue class). This method has log time push, and pop so avoids the O(n) traversal we had with the list.

The existing algorithm doesn't work because it strictly requires that all outputs are visited before updating an edge. So any task downstream from a task with multiple out-edges may get ignored. The fix is to always propagate your critical time to the next input node, and only place it in the queue if you offer a higher critical time.

src/build.cc

peterbell10 · 2021-08-25T22:56:10Z

src/build.cc

+    active_edges.erase(e);
+
+    for (std::vector<Node*>::iterator it = e->inputs_.begin(),
+                                      end = e->inputs_.end();


I assumed C++11 isn't required yet from the code style used elsewhere. Is that correct?

Also means no std::chrono::duration<int64_t, std::milli> :-(

peterbell10 · 2021-08-25T22:56:46Z

src/build.cc

@@ -75,6 +75,16 @@ bool DryRunCommandRunner::WaitForCommand(Result* result) {

 }  // namespace

+
+bool EdgeQueue::EdgePriorityCompare::operator()(const Edge* e1, const Edge* e2) const {


graph.h which defines Edge isn't included in the header.

You could template it. (And if you really mean for it to be just Edge*, you can static_assert that the template is a forward-declared type. As in

class Edge; // Fwd declare ... template <typename EdgeType> [[nodiscard]] bool operator()(const EdgeType* e1, const EdgeType* e2) const { static_assert(std::is_same_v<EdgeType, Edge>); // Avoiding including `graph.h`. ...

src/graph.h

src/build.cc

src/build.h

src/build.cc

peterbell10 · 2021-08-27T20:26:11Z

BTW, here is an example of how the scheduler improves build times. The project I'm working on takes 20 minutes to compile normally, but goes down to 15 minutes with this PR and a primed .ninja_log file.

Before

After

AddTarget cannot add edges to the ready queue before the critical time has been computed.

kinke · 2022-03-23T02:25:10Z

Something worth noting about clean builds is it should still be a win.

In my case, the initial build took the same time as with ninja master (I've rebased this PR). What I very much like is that I can manually tweak the ninja cmdline and add known long-running targets there; adding such an object file reduced the initial build time from 33 seconds to 26 seconds. Subsequent builds with .ninja_log are done in 24 seconds. So I absolutely love it, thanks dude!

FWIW, some info about my very imbalanced build (few long-running processes) with -j24:

    Longest build steps:
           0.4 weighted s to build .reggae/objs/sil.objs/__dub__/core/source/kaleidic/sil/std/core/c... (8.4 s elapsed time)
           0.4 weighted s to build .reggae/objs/sil.objs/__dub__/core-datetime/source/kaleidic/sil/s... (9.3 s elapsed time)
           0.4 weighted s to build .reggae/objs/sil.objs/__dub__/extra/source/kaleidic/sil/std/extra... (10.2 s elapsed time)
           0.4 weighted s to build .reggae/objs/sil.objs/__dub__/extra-xlsx/source/kaleidic/sil/std/... (10.7 s elapsed time)
           0.5 weighted s to build .reggae/objs/sil.objs/__dub__/extra/source/kaleidic/sil/std/extra... (11.4 s elapsed time)
           0.5 weighted s to build .reggae/objs/sil.objs/__dub__/extra-technical/source/kaleidic/sil... (13.2 s elapsed time)
           0.6 weighted s to build .reggae/objs/sil.objs/__dub__/extra-kaleidic/source/kaleidic/sil/... (13.8 s elapsed time)
           0.8 weighted s to build .reggae/objs/sil.objs/__dub__/symmetry-imap/source/imap_auth.o (18.7 s elapsed time)
           0.8 weighted s to build .reggae/objs/sil.objs/__dub__/core/source/kaleidic/sil/std/core_a... (18.9 s elapsed time)
           4.1 weighted s to build sil (4.1 s elapsed time)
    Time by build-step type:
           4.1 s weighted time to generate 1 (no extension found) files (4.1 s elapsed time sum)
          19.5 s weighted time to generate 269 .o files (466.2 s elapsed time sum)
    23.5 s weighted time (470.3 s elapsed time sum, 20.0x parallelism)
    270 build steps completed, average of 11.48/s

ninja-build/ninja#2019, rebased onto latest master.

kinke · 2022-03-23T15:46:43Z

I should add that for primed builds with a .ninja_log, the peak memory requirements can increase significantly in case the target runtimes correlate with memory consumption - all heavy targets being scheduled first at the same time.

digit-google · 2022-04-06T18:35:25Z

Just to say that the current patch doesn't change the build time for Fuchsia builds (it may even slow it down by a few percent but it' s hard to be certain), however we rely heavily on pools, which may be limiting the benefit of the new algorithm.

ninja-build/ninja#2019, rebased onto latest master (including ninja-build/ninja#1866 for a deterministic and predictable build order, which hasn't landed in any official release yet).

hadrielk · 2022-06-03T16:34:25Z

@peterbell10 where did you get such a beautiful chart?

I've been staring at tables, waterfall charts and flame graphs, but I'd much prefer to see build-times your way.

def- · 2022-06-03T21:39:44Z

That's ninjatracing: https://github.com/nico/ninjatracing

To v1.11.0 + ninja-build/ninja#2019.

jhasse · 2022-08-22T21:35:50Z

Closing in favor of #2177.

Including ninja-build/ninja#2019 for adaptive scheduling.

kinke · 2023-07-30T10:55:59Z

FWIW, I've published a drop-in ninja release including this PR as main change, making the historical-build-times usage opt-in via --cp: https://github.com/symmetryinvestments/ninja/releases/tag/v1.11.1-sym1 (and https://github.com/symmetryinvestments/gha-setup-ninja along with it)

travisdowns · 2023-07-30T22:39:50Z

@kinke - awesome, I'm going to try this soon.

Just to clarify (since you can't create issues on forks), without --cp this release would work the same as upstream ninja 1.11.1?

kinke · 2023-07-31T01:41:28Z

Just to clarify (since you can't create issues on forks), without --cp this release would work the same as upstream ninja 1.11.1?

Nope; there's still the default prioritization by explicit cmdline order / node depth. --cp really only affects whether the previous .ninja_log timings are to be used.

nico and others added 5 commits August 25, 2021 11:48

support explicit build order

4af9fc5

Use explicit std:: style and remove debug print statements

12b5b7c

Change priority_list_ into a std::priority_queue of ready edges

8e23200

clang-format diff

c5d355c

peterbell10 force-pushed the cpsched branch from 9ad680a to c5d355c Compare August 25, 2021 22:44