Workflow Description Language

Workflow Description Language
Language Specification
Namespaces
Scope
Optional Parameters & Type Constraints
- Prepending a String to an Optional Parameter
Scatter / Gather
Variable Resolution
- Task-Level Resolution
- Workflow-Level Resolution
Computing Inputs
Type Coercion
Standard Library
Data Types & Serialization
- Serialization of Task Inputs
  - Primitive Types
  - Compound Types
- De-serialization of Task Outputs
  - Primitive Types
  - Compound Types

Introduction

WDL is meant to be a human readable and writable way to express tasks and workflows. The "Hello World" tool in WDL would look like this:

task hello {
  String pattern
  File in

  command {
    egrep '${pattern}' '${in}'
  }

  runtime {
    docker: "broadinstitute/my_image"
  }

  output {
    Array[String] matches = read_lines(stdout())
  }
}

workflow wf {
  call hello
}

This describes a task, called 'hello', which has two parameters (String pattern and File in). A task definition is a way of encapsulating a UNIX command and environment and presenting them as functions. Tasks have both inputs and outputs. Inputs are declared as declarations at the top of the task definition, while outputs are defined in the output section.

The user must provide a value for these two parameters in order for this task to be runnable. Implementations of WDL should accept their inputs as JSON format. For example, the above task needs values for two parameters: String pattern and File in:

Variable	Value
wf.hello.pattern	^[a-z]+$
wf.hello.in	/file.txt

Or, in JSON format:

{
  "wf.hello.pattern": "^[a-z]+$",
  "wf.hello.in": "/file.txt"
}

Running the wf workflow with these parameters would yield a command line from the call hello:

egrep '^[a-z]+$' '/file.txt'

A simple workflow that runs this task in parallel would look like this:

workflow example {
  Array[File] files
  scatter(path in files) {
    call hello {input: in=path}
  }
}

The inputs to this workflow would be example.files and example.hello.pattern.

State of the Specification

17 August 2015

Added concept of fully-qualified-name as well as namespace identifier.
Changed task definitions to have all inputs as declarations.
Changed command parameters (${...}) to accept expressions and fewer "declarative" elements
- command parameters also are required to evaluate to primitive types
Added a output section to workflows
Added a lot of functions to the standard library for serializing/deserializing WDL values
Specified scope, namespace, and variable resolution semantics

LHS Type	Operators	RHS Type	Result	Semantics
`Boolean`	`==`	`Boolean`	`Boolean`
`Boolean`	`!=`	`Boolean`	`Boolean`
`Boolean`	`>`	`Boolean`	`Boolean`
`Boolean`	`>=`	`Boolean`	`Boolean`
`Boolean`	`<`	`Boolean`	`Boolean`
`Boolean`	`<=`	`Boolean`	`Boolean`
`Boolean`	`\|\|`	`Boolean`	`Boolean`
`Boolean`	`&&`	`Boolean`	`Boolean`
`File`	`+`	`File`	`File`	Append file paths
`File`	`==`	`File`	`Boolean`
`File`	`!=`	`File`	`Boolean`
`File`	`+`	`String`	`File`
`File`	`==`	`String`	`Boolean`
`File`	`!=`	`String`	`Boolean`
`Float`	`+`	`Float`	`Float`
`Float`	`-`	`Float`	`Float`
`Float`	`*`	`Float`	`Float`
`Float`	`/`	`Float`	`Float`
`Float`	`%`	`Float`	`Float`
`Float`	`==`	`Float`	`Boolean`
`Float`	`!=`	`Float`	`Boolean`
`Float`	`>`	`Float`	`Boolean`
`Float`	`>=`	`Float`	`Boolean`
`Float`	`<`	`Float`	`Boolean`
`Float`	`<=`	`Float`	`Boolean`
`Float`	`+`	`Int`	`Float`
`Float`	`-`	`Int`	`Float`
`Float`	`*`	`Int`	`Float`
`Float`	`/`	`Int`	`Float`
`Float`	`%`	`Int`	`Float`
`Float`	`==`	`Int`	`Boolean`
`Float`	`!=`	`Int`	`Boolean`
`Float`	`>`	`Int`	`Boolean`
`Float`	`>=`	`Int`	`Boolean`
`Float`	`<`	`Int`	`Boolean`
`Float`	`<=`	`Int`	`Boolean`
`Float`	`+`	`String`	`String`
`Int`	`+`	`Float`	`Float`
`Int`	`-`	`Float`	`Float`
`Int`	`*`	`Float`	`Float`
`Int`	`/`	`Float`	`Float`
`Int`	`%`	`Float`	`Float`
`Int`	`==`	`Float`	`Boolean`
`Int`	`!=`	`Float`	`Boolean`
`Int`	`>`	`Float`	`Boolean`
`Int`	`>=`	`Float`	`Boolean`
`Int`	`<`	`Float`	`Boolean`
`Int`	`<=`	`Float`	`Boolean`
`Int`	`+`	`Int`	`Int`
`Int`	`-`	`Int`	`Int`
`Int`	`*`	`Int`	`Int`
`Int`	`/`	`Int`	`Int`	Integer division
`Int`	`%`	`Int`	`Int`	Integer division, return remainder
`Int`	`==`	`Int`	`Boolean`
`Int`	`!=`	`Int`	`Boolean`
`Int`	`>`	`Int`	`Boolean`
`Int`	`>=`	`Int`	`Boolean`
`Int`	`<`	`Int`	`Boolean`
`Int`	`<=`	`Int`	`Boolean`
`Int`	`+`	`String`	`String`
`String`	`+`	`Float`	`String`
`String`	`+`	`Int`	`String`
`String`	`+`	`String`	`String`
`String`	`==`	`String`	`Boolean`
`String`	`!=`	`String`	`Boolean`
`String`	`>`	`String`	`Boolean`
`String`	`>=`	`String`	`Boolean`
`String`	`<`	`String`	`Boolean`
`String`	`<=`	`String`	`Boolean`
	`-`	`Float`	`Float`
	`+`	`Float`	`Float`
	`-`	`Int`	`Int`
	`+`	`Int`	`Int`
	`!`	`Boolean`	`Boolean`

Precedence	Operator type	Associativity	Example
12	Grouping	n/a	(x)
11	Member Access	left-to-right	x.y
10	Index	left-to-right	x[y]
9	Function Call	left-to-right	x(y,z,...)
8	Logical NOT	right-to-left	!x
	Unary Plus	right-to-left	+x
	Unary Negation	right-to-left	-x
7	Multiplication	left-to-right	x*y
	Division	left-to-right	x/y
	Remainder	left-to-right	x%y
6	Addition	left-to-right	x+y
	Subtraction	left-to-right	x-y
5	Less Than	left-to-right	x<y
	Less Than Or Equal	left-to-right	x<=y
	Greater Than	left-to-right	x>y
	Greater Than Or Equal	left-to-right	x>=y
4	Equality	left-to-right	x==y
	Inequality	left-to-right	x!=y
3	Logical AND	left-to-right	x&&y
2	Logical OR	left-to-right	x\|\|y
1	Assignment	right-to-left	x=y

Variable	Value
reads	/path/to/fastq
stages	["stage1 map1 --min-seq-length 20 map2 --min-seq-length 20", "stage2 map1 --max-seq-length 20 --min-seq-length 10 --seed-length 16 map2 --max-seed-hits -1 --max-seq-length 20 --min-seq-length 10"]

var	value
wf.test.a	["1", "2", "3"]
wf.test.b	["x","y"]
wf.test.c	["a","b","c","d"]

WDL Type	Can Accept	Notes / Constraints
`String`	JSON String
	String-like
	`String`	Identity coercion
	`File`
`File`	JSON String	Interpreted as a file path
	String-like	Interpreted as file path
	`String`	Interpreted as file path
	`File`	Identity Coercion
`Int`	JSON Number	Use floor of the value for non-integers
	Integer-like
	`Int`	Identity coercion
`Float`	JSON Number
	Float-like
	`Float`	Identity coercion
`Boolean`	JSON Boolean
	Boolean-like
	`Boolean`	Identity coercion
`Array[T]`	JSON Array	Elements must be coercable to `T`
	Array-like	Elements must be coercable to `T`
`Map[K, V]`	JSON Object	keys and values must be coercable to `K` and `V`, respectively
	Map-like	keys and values must be coercable to `K` and `V`, respectively

Index	Attribute	Value
0	key_1	"value_1"
	key_2	"value_2"
	key_3	"value_3"
1	key_1	"value_1"
	key_2	"value_2"
	key_3	"value_3"
2	key_1	"value_1"
	key_2	"value_2"
	key_3	"value_3"

JSON Type	WDL Type
object	`Map[String, ?]`
array	`Array[?]`
number	`Int` or `Float`
string	`String`
boolean	`Boolean`
null	???

Index	Attribute	Value
0	attr1	value1
	attr2	value2
	attr3	value3
	attr4	value4
1	attr1	value5
	attr2	value6
	attr3	value7
	attr4	value8

var	value
s	"str"
i	2
f	1.3

Element
/path/to/1.bam
/path/to/2.bam
/path/to/3.bam

Element
/path/to/1.bam
/path/to/2.bam
/path/to/3.bam

Element
/path/to/1.bam
/path/to/2.bam
/path/to/3.bam

Files

SPEC.md

Latest commit

History

SPEC.md

File metadata and controls

Workflow Description Language

Table Of Contents

Introduction

State of the Specification

Language Specification

Global Grammar Rules

Whitespace, Strings, Identifiers, Constants

Types

Fully Qualified Names & Namespaced Identifiers

Declarations

Expressions

If then else

Operator Precedence Table

Member Access

Map and Array Indexing

Pair Indexing

Function Calls

Array Literals

Map Literals

Pair Literals

Document

Import Statements

Task Definition

Sections

Command Section

Command Parts

Command Part Options

sep

true and false

default

Alternative heredoc syntax

Stripping Leading Whitespace

Outputs Section

String Interpolation

Runtime Section

docker

memory

Parameter Metadata Section

Metadata Section

Examples

Example 1: Simplest Task

Example 2: Inputs/Outputs

Example 3: Runtime/Metadata

Example 4: BWA mem

Example 5: Word Count

Example 6: tmap

Workflow Definition

Call Statement

Sub Workflows

Scatter

Loops

Conditionals

Parameter Metadata

Metadata

Outputs

Namespaces

Scope

Optional Parameters & Type Constraints

Prepending a String to an Optional Parameter

Scatter / Gather

Variable Resolution

Task-Level Resolution

Workflow-Level Resolution

Computing Inputs

Task Inputs

Workflow Inputs

Specifying Workflow Inputs in JSON

Type Coercion

Standard Library

File stdout()

File stderr()

Array[String] read_lines(String|File)

Array[Array[String]] read_tsv(String|File)

Map[String, String] read_map(String|File)