Skip to content

Latest commit

 

History

History

pbmc_10k_v3

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 

pbmc_10k_v3 data set

This example data set is a public scRNA-seq data set containing about 10k human PBMCs from a healthy donor (pbmc_10k_v3), generated using the v3 chemistry and processed with Cell Ranger 3.0.0. The data set is available through the 10x Genomics website and is licensed under the Creative Commons Attribution 4.0 license.

To test Cerebro, download the .crb file from either Seurat v2, Seurat v3 or scanpy and load it into Cerebro.

Workflow

The workflows of all three frameworks are conceptually the same, containing the following steps:

  • Load the transcript counts.
  • Filter cells based on the number of transcripts and expressed genes.
  • Normalize the transcript counts and scaled each cell to contain 10,000 transcripts.
  • Identify variable genes.
  • Scale the expression matrix and regressing out the number of transcripts.
  • Perform cell cycle analysis.
  • Perform principal component analysis.
  • Identify clusters and build a cluster tree.
  • Perform dimensional reduction.

Then, using the functions of cerebroApp, we add some more data:

  • Calculate the percent of mitochondrial and ribosomal gene expression (addPercentMtRibo()).
  • Get the most expressed genes in each sample and cluster (getMostExpressedGenes()).
  • Get marker genes for each sample and cluster (getMarkerGenes()).
  • Perform pathway enrichment analysis using the marker genes of each sample and cluster (getEnrichedPathways()).
  • Perform gene set enrichment analysis for each sample and cluster (performGeneSetEnrichmentAnalysis()).

Next, we calculate trajectories of (1) all cells and (2) a subset of cells (those in G1 phase) using Monocle v2 and the variable features identified by Seurat. We extract these trajectories from the respective Monocle objects and add them to our Seurat object through the extractMonocleTrajectory() function.

Lastly, from the Seurat object we export a Cerebro file (.crb extension) that can be loaded into Cerebro (exportFromSeurat()).

How to reproduce

The example data sets were generated using the official Cerebro Docker image which was built in Docker (Docker Hub) and imported into Singularity (here I used Singularity 2.6.0). The workflows for Seurat v2 and Seurat v3 are conceptually identical with some differences due to changes in the Seurat package. Details and descriptions for all workflows can be found in the respective directories Seurat v2, Seurat v3, and scanpy.