Skip to content

antonkulaga/epigenetics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Methylation

Methylation pipeline

Description

The pipeline is based on a WDL (Workflow Description Language) standard. Broad Institute provides a nice video introduction which explains WDL, Cromwell and DNA-Seq pipelines. For users with only high-school knowledge of biology I would also recommend taking any free biology 101 or genetics 101 course ( https://www.edx.org/course/introduction-to-biology-the-secret-of-life-3 is a good example) followed by epigenetic regulation of gene expression

We do not use Broad-s GATK pipeline (because we use DeepVariant as a variant caller) but common tools are similar. All tools are dockerized, for this reason make sure that docker is installed. Before running the pipeline with a large genome (human or mouse) make sure you have 1-1.5 TB of free space.

Prepare data

DVC is used for data management: it downloads annotations and can also be used to run some useful scripts. To download the all the data and do some preprocessing use:

dvc repro

About

Methylation pipeline

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published