-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Subject: Inquiry about Using tensorQTL for SV eQTL Mapping #177
Comments
Hi, tensorQTL provides a general linear regression framework, and if you encode the SV genotypes as dosages it will work fine. |
Thank you for developing such a great software! I have a quick question: I noticed that SNPs are typically considered eQTL candidates within a 1MB region. When using SVs, do we only consider the breakpoints' positions, or do we also need to take into account the start and end positions of insertions and deletions? For example, if the start and end positions of insertions or deletions overlap with a gene’s 1MB region, should they also be considered as potential eQTLs? Do I need to modify the software manually to account for this? Thank you again for your help! |
This is flexible and you can define the windows as you see fit. The cis-window is defined as [start-window, end+window] — for TSS ± 1Mb typically used for eQTLs, start and end in the input BED file must be the same, but this is flexible and you could set window to 0 and define start/end for each phenotype. |
Wanted to follow up on this as I'm also exploring structural variants. I think the complication here is that structural variants may be quite large, but tensorQTL uses only a single variant position when determining which variants lie in a gene's cis window. This has been noted in previous publications (with regard to FastQTL), e.g.: "We customized the FastQTL software to include an SV for genotype-expression associations when the span of a deletion, duplication, mCNV, or rMEI fell within the cis window for a particular gene TSS, or when the breakpoints of an inversion or uncharacterized breakend (BND) fell within the cis window." (Chiang et al 2017, The impact of structural variation on human gene expression). One imperfect hack is to simply duplicate SVs. For example, if there is a 1Mb deletion spanning chr1:1,000,000 - 2,000,000 and your cis window is 1Mb, duplicate the variant in the VCF such that the variant is at chr1:1,000,000 and chr1:2,000,000 . This is not ideal as in some cases you'll test the same variant twice against a given gene, and you'll need to postprocess the tensorQTL output to account for this (for example, updating SuSiE PIP values if that variant ends up in a SuSiE credible set). As noted in the excerpt from Chiang et al, the gene - variant pairs to test depend both on the SV start/end points and the SV type, so simply allowing a variant to have a start/end isn't sufficient. The most flexible approach might just be to allow the user to provide a list of variant - gene pairs to test, rather than defining these using cis windows. |
I am currently working on a project involving structural variants (SVs) and their association with gene expression, and I am interested in using tensorQTL for eQTL mapping involving SVs.
I would like to clarify whether tensorQTL supports the calculation of eQTLs for structural variants, or if any modifications to the script or methodology would be required to adapt it for this use case. Specifically, I am interested in:
Any guidance or suggestions would be greatly appreciated.
The text was updated successfully, but these errors were encountered: