This repository contains the best-practices for scRNA-seq analysis as determined by the community. Best practices are built upon the publication "Luecken and Theis, Current best practices in single-cell RNA-seq analysis: a tutorial, Mol Sys Biol (2019)". Extensions to these best-practices are decided upon based on benchmarking results and discussion in this repo.
This project is currently still in the planning phase.
This repo contains 4 main sections:
-
benchmarks:
A folder containing tool benchmarks in jupyter python notebooks. These benchmarks are discussed in PRs and github issues.
-
functions
ScRNA-seq analysis tools that are considered (or have been considered as best practices. R tools are wrapped in rpy2 and anndata2ri function calls.
-
workflow scripts
Scripts that use the best practices functions for data analysis case studies.
To extend the current version of the best-practices with additional tools, we require:
-
An accepted benchmark that is published and/or pushed to the
benchmarks
folder. 'Accepted' refers to benchmark studies that are follow the quality standards set out in this document. -
A community discussion on this tool with final acceptance of the majority of parties involved and all core members of the best-practices team.
- Determine standardized datasets for benchmarking
- Determine benchmarking standards
- Determine environment for benchmarking (docker container)
- Write current best-practices pipeline into individual functions and reproduce the notebook.
- Decide upon scripts format: snakemake or jupyter notebook.
Malte D Luecken
Leander Dony
Fabian Theis