Name		Name	Last commit message	Last commit date
parent directory ..
Seurat_v2		Seurat_v2
Seurat_v3		Seurat_v3
raw_data		raw_data
scanpy		scanpy
README.md		README.md

README.md

`pbmc_10k_v3` data set

This example data set is a public scRNA-seq data set containing about 10k human PBMCs from a healthy donor (pbmc_10k_v3), generated using the v3 chemistry and processed with Cell Ranger 3.0.0. The data set is available through the 10x Genomics website and is licensed under the Creative Commons Attribution 4.0 license.

To test Cerebro, download the .crb file from either Seurat v2, Seurat v3 or scanpy and load it into Cerebro.

Workflow

The workflows of all three frameworks are conceptually the same, containing the following steps:

Load the transcript counts.
Filter cells based on the number of transcripts and expressed genes.
Normalize the transcript counts and scaled each cell to contain 10,000 transcripts.
Identify variable genes.
Scale the expression matrix and regressing out the number of transcripts.
Perform cell cycle analysis.
Perform principal component analysis.
Identify clusters and build a cluster tree.
Perform dimensional reduction.

Then, using the functions of cerebroApp, we add some more data:

Calculate the percent of mitochondrial and ribosomal gene expression (addPercentMtRibo()).
Get the most expressed genes in each sample and cluster (getMostExpressedGenes()).
Get marker genes for each sample and cluster (getMarkerGenes()).
Perform pathway enrichment analysis using the marker genes of each sample and cluster (getEnrichedPathways()).
Perform gene set enrichment analysis for each sample and cluster (performGeneSetEnrichmentAnalysis()).

Next, we calculate trajectories of (1) all cells and (2) a subset of cells (those in G1 phase) using Monocle v2 and the variable features identified by Seurat. We extract these trajectories from the respective Monocle objects and add them to our Seurat object through the extractMonocleTrajectory() function.

Lastly, from the Seurat object we export a Cerebro file (.crb extension) that can be loaded into Cerebro (exportFromSeurat()).

How to reproduce

The example data sets were generated using the official Cerebro Docker image which was built in Docker (Docker Hub) and imported into Singularity (here I used Singularity 2.6.0). The workflows for Seurat v2 and Seurat v3 are conceptually identical with some differences due to changes in the Seurat package. Details and descriptions for all workflows can be found in the respective directories Seurat v2, Seurat v3, and scanpy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pbmc_10k_v3

pbmc_10k_v3

README.md

`pbmc_10k_v3` data set

Workflow

How to reproduce

Files

pbmc_10k_v3

Directory actions

More options

Directory actions

More options

Latest commit

History

pbmc_10k_v3

Folders and files

parent directory

README.md

pbmc_10k_v3 data set

Workflow

How to reproduce

`pbmc_10k_v3` data set