Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing mutations from WHO Catalogue #56

Open
mlarjim opened this issue Jul 13, 2023 · 10 comments
Open

Missing mutations from WHO Catalogue #56

mlarjim opened this issue Jul 13, 2023 · 10 comments

Comments

@mlarjim
Copy link

mlarjim commented Jul 13, 2023

Hi!
As far as I am concerned, tb-profiler database contains all the mutations that confer drug resistance listed in the WHO catalogue.
However, the following mutation is not found in the tbdb https://github.com/jodyphelan/tbdb/blob/master/tbdb.csv

Gene: gid
Mutation: gid_347_ins_1_cgcacgatctcaacggcc_ccgcacgatctcaacggca
Literature Evidence: Catalogue of mutations in Mycobacterium tuberculosis complex and their association with drug resistance (WHO)
conf_grade: 2) Assoc w R - Interim (STM_S)

Why is this variant missing?

@jodyphelan
Copy link
Owner

Some variants could not be translated as there were some issues the reference and alternate not agreeing with the rest of the variant description. In this case it should be an insertion of 1 nucleotide but if we align the reference and the alternate we see it is actually a combination of an insertion and a SNP:

c-gcacgatctcaacggcc
|*||||||||||||||||* 
ccgcacgatctcaacggca

There were a few of these cases

@mlarjim
Copy link
Author

mlarjim commented Jul 13, 2023

Thank you for your remark, Jody. Effectively, the WHO catalogue is mistaken in the variant nomenclature. But the final annotation (column final_annotation.TentativeHGVSNucleotidicAnnotation) states that the mutation is actually a combination of a deletion and an insertion:

c.330_346delGGCCGTTGAGATCGTGCinsTGCCGTTGAGATCGTGCG

Is there any possibility that the tb-profiler database contemplates these cases?

@jodyphelan
Copy link
Owner

Oh right - I hadn't seen that they had added this hgvs notation now. I will take a look and see if I can include more of these cases.

@mlarjim
Copy link
Author

mlarjim commented Jul 13, 2023

thank you so much!

@frogtraveler
Copy link

Hello,

I noticed that mutation fabG1 c.-16A>G in TBDB in only listed as conferring R-interim for INH while in WHO it also has the same prediction for ETH. Is there a reason why ETH prediction was not included?

Thank you!
Varvara

@frogtraveler
Copy link

Another issue: rrl mutation detected by TBProfiler as
n.-255C>T doesn't return match with TBDB though it is present in WHO with "Uncertain" confidence. Instead TBDB has mutation listed as c.-255C>T (also uncertain significance). Those are the same, right?

@jodyphelan
Copy link
Owner

Hi @frogtraveler ,

Indeed it looks like

  1. fabG1 c.-16A>G it missing for ETH
  2. rrl c.-255C>T should be listed as n.-255C>T

I'll get a new version of the db released this week and look for any other potential issues.

@frogtraveler
Copy link

Awesome! Thank you so much, Jody!

@jodyphelan
Copy link
Owner

Hi @frogtraveler,

I've regenerated the mutation lists based on the hgvs annotations from the WHO list now. The mutations you highlighted are now in:

If you run tb-profiler update_tbdb they should be updated for you :)

@frogtraveler
Copy link

Thank you so much, Jody!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants