Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TBProfiler Dealing with 1S6110 insertions #374

Open
vrennie opened this issue Jun 27, 2024 · 5 comments
Open

TBProfiler Dealing with 1S6110 insertions #374

vrennie opened this issue Jun 27, 2024 · 5 comments

Comments

@vrennie
Copy link

vrennie commented Jun 27, 2024

@jodyphelan we have some indications that 1S6110 insertions may be involved in BDQ resistance

We have some ideas on how to solve this but we have some questions in terms of constructing a vcf that could be readable by TBProfiler

  1. What is the minimum columns that TBProfiler needs in the vcf? Can the INFO/QUAL/FILTER/FORMAT columns be empty

  2. Do you think that TBprofiler would be able to deal with the following VCF (see attached) or would this format cause issues?

Screenshot 2024-06-27 at 09 05 52
@jodyphelan
Copy link
Owner

It uses the following variables in the format column to get the frequency of the read:

##FORMAT=<ID=DR,Number=1,Type=Integer,Description="# high-quality reference pairs">
##FORMAT=<ID=DV,Number=1,Type=Integer,Description="# high-quality variant pairs">
##FORMAT=<ID=RR,Number=1,Type=Integer,Description="# high-quality reference junction reads">
##FORMAT=<ID=RV,Number=1,Type=Integer,Description="# high-quality variant junction reads">

But if these are not available we could probably bypass calculation of frequency and filtering

@vrennie
Copy link
Author

vrennie commented Jun 27, 2024

indeed these are not available unless we pull it from another source, which I think would be rather cumbersome.

How do we bypass the calculation?

@jodyphelan
Copy link
Owner

I'd have to add that to the processing login in pathogen-profiler. How are you calling the insertion?

@vrennie
Copy link
Author

vrennie commented Jun 27, 2024

Using ISMapper (https://github.com/jhawkey/IS_mapper) to generate a text file and then a custom script to generate a vcf from that for interpretation by tb-profiler.

@vrennie
Copy link
Author

vrennie commented Jun 27, 2024

@jodyphelan I did a basic test to generate a vcf, containing minimal information. To test that tb-profiler reads it correctly I started by putting a regular SNP in the Rv0678 gene. However, this generated a json with 1 total variant (expected) but not interpreted by tb-profiler (unexpected). Could you just check you get the same on your side?
IS_NICD_test.vcf.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants