Native MarkSweep: bad load balancing with single-threaded workloads

Currently, we parallelize the sweeping work by making one work packet for the global pool and one packet for each mutator.  It is OK for multi-threaded work loads, but when there is only one mutator, it hits a pathological case where the Release stage is dominated by a single long-running `ReleaseMutator` work packet.  Here is a timeline captured using eBPF when executing the Liquid benchmark using the Ruby binding (a single mutator, but multiple GC workers)

![image](https://github.com/mmtk/mmtk-core/assets/370317/b4a3baca-8af0-44ce-b787-15938b833fc9)

In comparison, here is the timeline for the lusearch benchmark in the DaCapo Chopin benchmark suite (with eager-sweeping force-enabled).  The parallel sweeping of mutators is better, but the `Release` work packet is not parallelized with `ReleaseMutator`

![image](https://github.com/mmtk/mmtk-core/assets/370317/cef45d0f-4b01-4870-9da7-d3fe32447764)

We should parallelize it by making work packets, each releasing a reasonable amount blocks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Native MarkSweep: bad load balancing with single-threaded workloads #1146

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Native MarkSweep: bad load balancing with single-threaded workloads #1146

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions