Skip to content

Christensen-Lab-Dartmouth/PyMethylProcess

Repository files navigation

PyMethylProcess

pymethylprocess_overview

https://github.com/Christensen-Lab-Dartmouth/PyMethylProcess

Wiki: https://github.com/Christensen-Lab-Dartmouth/PyMethylProcess/wiki

Help documentation: https://christensen-lab-dartmouth.github.io/PyMethylProcess/

Alternatively, you can access the pdf: PyMethylProcess.pdf

What is it:

  • Preprocess 450k and 850k methylation IDAT files in parallel using Minfi, ENmix, and meffil
  • Convenient and scalable implementation
  • Imputation and Feature Selection
  • Preparation for machine learning pipelines

Why:

  • Make DNAm accessible to python developers and more machine learning oriented researchers
  • Streamlined analysis makes processing easy

PyMethyProcess is now available in Bioinformatics: https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btz594/5542385 .

Getting Started:

NOTE: There have been reported issues with installing PyMethylProcess on Mac OS Mojave (rpy2). If this is the issue, try the docker installation and please report an issue.

CWL Workflow Visualization
PyMethylProcess CWL Pipeline

Benchmark Results: benchmark

Supplementary Figure Removed from Manuscript: Supplemental

Supplemental Figure 1: UMAP embeddings (colored) of: a) GSE87571 (age), b) GSE81961 (disease status), c) GSE69138 (subtype), d) GSE42861 (disease status), e) GSE112179 (brain disorder), f) GSE90496 (subclass), g) TCGA Pancancer (subtype)

pipeline-download
pipeline-format
pipeline-preprocess
pipeline-visualize pipeline-train-test-split

About

Preprocessing methylation pipeline, written in python. Easy to use and highly parallelized.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published