Specify compute resources using process labels #219

cflerin · 2020-08-21T10:08:12Z

Specifying all computing resources in clusterOptions is specific to grid computing systems and won't work with other executors (Google Pipelines, Kubernetes, AWS, etc.). Using the standard Nextflow way of specifying cpu, memory, etc. is more compatible with systems outside the VSC.

Major changes:

Process labels are defined in the main repo conf/compute_resources.config.
- Categories: default, high memory, high cpu, minimal (others could be added if needed)
- clusterOptions is still present for grid-specific options (the cluster account -A parameter would go here)
Every process gets a "compute_resources__[...]" label (using the default profile unless something specific is needed).
The label definitions are copied into the config file with nextflow config ... and can be edited by the user (instead of being hard-coded into the processes).
Tools can use the top-level labels or define their own (e.g. cellranger, cellranger-atac, scenic have tool-specific profiles).
The executor (local/pbs/other) is defined globally in the config and applies to all processes, but tool-specific configs can override this (to have a mix of local and pbs processes).

(#154)

Submodule update progress:

- The wildcard/default compute resource label needs to be specified in the config file BEFORE loading any tool-specific compute resource labels, or it will override these settings.

dweemx

I think in general it's a nice idea because it's clean up quite nicely the config.
I'm wondering however how far we should go with the label. For SCENIC, they go down to the processe but then for Cell Ranger not. Is there a reason for that ? Usually mkfastq will take much more less time than count for instance.

conf/compute_resources.config

cflerin · 2020-08-21T15:39:02Z

It's a good point about label granularity. But looking at the existing code, and resource usage from previous runs, these seem to be more or less sufficient. There were only a few unique values for clusterOptions, and these are fully covered by the main default, mem, and minimal labels. The general workflows (single_sample or bbknn) are pretty lightweight. Still, it could be useful to have a few more general categories.

Scenic is the most cpu/memory intensive by far and so needs specific labels for each of the three main processes. The cellrangers have one label per tool and I just took these from the original clusterOptions, so this is how it has always been. But we could easily add one to set mkfastq to a 1h queue, for instance.

- Added as an optional profile `cluster_retry` - Currently only adds retry options for the labels found in the main repo

cflerin · 2020-09-25T14:23:26Z

Ok, I think the bulk of the PR is essentially done. I've added labels to all processes from all submodules, some with submodule-specific labels. The last thing to do is add some documentation...

dweemx

Nicely cleaning the config & the code :)

cflerin added 7 commits August 19, 2020 13:23

Added config file for compute resource labels

c7b3167

Replaced clusterOptions with labels in utils

b43d560

Fix include config order for compute resource labels

d4b64b4

- The wildcard/default compute resource label needs to be specified in the config file BEFORE loading any tool-specific compute resource labels, or it will override these settings.

Fix init steps for main_atac.nf

e594242

Remove extra qsub options from global conf

eeb4d9c

Cleanup old resource params from test configs

24ec9aa

Link submodules

0bfb6c7

dweemx reviewed Aug 21, 2020

View reviewed changes

conf/compute_resources.config Outdated Show resolved Hide resolved

cflerin added 5 commits September 1, 2020 10:14

Updated cellranger modules with more granular labels

6b54a88

Updated compute defaults, removed old labels

2e521bb

Added dynamic retry for compute resources

aa5ac78

- Added as an optional profile `cluster_retry` - Currently only adds retry options for the labels found in the main repo

Adjust defaults for compute resources

dfe979f

Link submodules

646f567

cflerin marked this pull request as ready for review September 25, 2020 14:23

cflerin requested review from dweemx and KrisDavie September 25, 2020 14:24

dweemx approved these changes Sep 25, 2020

View reviewed changes

Updated submodule links after resolving submodule branches

f420807

cflerin merged commit 2be35d8 into develop Sep 28, 2020

cflerin deleted the 154-Update_clusterOptions branch September 28, 2020 10:29

cflerin mentioned this pull request Sep 29, 2020

Develop for 0.20.0 #224

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specify compute resources using process labels #219

Specify compute resources using process labels #219

cflerin commented Aug 21, 2020 •

edited

Loading

dweemx left a comment

cflerin commented Aug 21, 2020

cflerin commented Sep 25, 2020

dweemx left a comment

Specify compute resources using process labels #219

Specify compute resources using process labels #219

Conversation

cflerin commented Aug 21, 2020 • edited Loading

dweemx left a comment

Choose a reason for hiding this comment

cflerin commented Aug 21, 2020

cflerin commented Sep 25, 2020

dweemx left a comment

Choose a reason for hiding this comment

cflerin commented Aug 21, 2020 •

edited

Loading