Multiple Random Effects #134

jgockley62 · 2024-05-03T21:48:48Z

Hi,

I have a data set of single cell RNA seq run out on 167 individuals, with individuals spread across several batches. I want to run a model across all cells such as ~ (1|Indv_ID) + (1|Batch) + cov_1 + cov_2. From what I understand, I could change the Indvidual ID to column name to sample_id and run:

mmDS(sce,
     covs = c( "cov_1" + "cov_2" ),
     method = "dream2",
     n_threads = 32 )

But this would only specify a mixed linear model of ~ (1|Indv_ID) + (1|Batch) + cov_1 + cov_2 correct?
How could I add Batch as random effect, would renaming the column ID of batch to (1|Batch) ?
Thanks

The text was updated successfully, but these errors were encountered:

plger · 2024-05-04T05:47:11Z

Hi, No renaming the column won't help, right now it's not possible to specify 2 random effect variables. How are the individuals spread across the batches? If you have large batches containing the different experimental groups, then you should be fine modeling it as a fixed effect variable (I E. Normal covariate). If they're small, there's probably not much added value on top of the individuals effect, except of course of your aim is to understand individual vs batch variability (as opposed to identifying differences between experimental groups). Note also that we did not, in benchmarks, find an advantage of cell-level MM over pseudobulk analysis, and with that many samples I would definitely opt for pseudobulk...

jgockley62 · 2024-05-06T17:05:08Z

Using as a fixed is possible, albeit not the most optimal. Its not posible to use a mixed model an then pseudobulk the corrected expression is it? ie

correct on the cell level: ~ (1|batch) + mt_Percent + logUMI
pseudobulk by individual
DE Analysis: ~ (1|IndvID) + sex + age + disease

plger · 2024-05-07T07:55:28Z

Hi,
of course you can do that, but you lose the uncertainty related to the effect of the covariates you correct for, which is dangerous. This is getting into an area where we don't have very clear facts on which to base decisions, but I'm pretty confident that this is considerably worse than treating your batches as fixed effects.
If you really insist on fitting the model you want to fit (and again I'm not sure you should), you can do it by splitting your clusters and manually running dream on each (instructions for dream are available in this vignette).

jgockley62 · 2024-05-07T16:34:45Z

I'll poke around the options and see how it pans out, thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple Random Effects #134

Multiple Random Effects #134

jgockley62 commented May 3, 2024

plger commented May 4, 2024 via email

jgockley62 commented May 6, 2024

plger commented May 7, 2024

jgockley62 commented May 7, 2024

Multiple Random Effects #134

Multiple Random Effects #134

Comments

jgockley62 commented May 3, 2024

plger commented May 4, 2024 via email

jgockley62 commented May 6, 2024

plger commented May 7, 2024

jgockley62 commented May 7, 2024