Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about sdY #102

Open
maegsul opened this issue Nov 3, 2022 · 1 comment
Open

Question about sdY #102

maegsul opened this issue Nov 3, 2022 · 1 comment

Comments

@maegsul
Copy link

maegsul commented Nov 3, 2022

Hi,

First of all, thanks a lot @chr1swallace again for developing coloc - it has been very useful for us to answer many questions we have. A big thanks! I have a rather theoretical/statistical question regarding the "sdY" paramater that is the population standard deviation of the trait for a quantitative trait.

The vignette here indicates that, if the study standardised their (quantitative) trait to have a variance of 1, we can set sdY to 1. However, I am curious whether this recommendation would be still valid if the study used also a set of covariates in the linear regression model predicting this already standardised quantitative trait?

To give an example: let's say we have a set of standardised gene expression values for geneX across n=100 individuals, and we map cis-eQTL variants near this gene, using covariates such as sex, age, and principal components in linear regression, such as below:

glm(standardised_geneX_expression ~ genotype + sex + age + PC1+ PC2 + PC3, data = example_data_table, family=gaussian(link="identity"))

In this case, would it be still a correct assumption to consider sdY = 1 in coloc.abf function for this eQTL dataset (and providing beta & varbeta as well in the coloc.abf function along with sdY), because the regressed out outcome variable might not have a standard deviation of 1 anymore, and betas & varbetas we obtain (and provide to coloc.abf function) are not informing directly for a standardised geneX expression anymore, but they are informing for this outcome after controlling for sex, age, and principal components?

I was just thinking about it, and I wanted to be on the safe side regarding taking into account sdY parameter correctly. What do you think?

Many thanks,
Fahri

@chr1swallace
Copy link
Owner

chr1swallace commented Nov 8, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants