You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
gatk filters only for "--restrict-alleles-to BIALLELIC", completely ignoring every --select statement. This results in cases where variants that fail all set --select filters still appear in the output. E.g:
Importantly the same command works without any problems on older versions. E.g. SelectVariants with GATK 4.2. on the same raw input file with the same command works as expected (intended) and filters all variants that do not meet the criteria set with --select statements.
Hi. Each select parameter is acting independently in your case and that is expected. You need to compile all cases in a single select parameter with ANDs ORs or parantheses. And and Or statements need to be used as in programming style such as || &&.
Bug Report
Affected tool(s) or class(es)
gatk SelectVariants
Affected version(s)
gatk 4.6.0. (not tested on 4.6.1.)
Description
select statements are ignored.
When using the following filter settings:
##GATKCommandLine=<ID=SelectVariants,CommandLine="SelectVariants --output output --select AC > 6 --select MQ > 40.0 --select DP < 25260 --select SOR < 3.0 --select QD > 2.0 --select FS < 60.0 --select MQRankSum > -12.5 --select ReadPosRankSum > -8.0 --select BaseQRankSum > -12.5 --restrict-alleles-to BIALLELIC
gatk filters only for "--restrict-alleles-to BIALLELIC", completely ignoring every --select statement. This results in cases where variants that fail all set --select filters still appear in the output. E.g:
AC=1;AF=5.453e-04;AN=1834;BaseQRankSum=-6.740e-01;DP=4543;ExcessHet=0.0000;FS=0.000;InbreedingCoeff=-0.0203;MLEAC=2;MLEAF=1.091e-03;MQ=41.22;MQRankSum=0.00;QD=5.98;ReadPosRankSum=-1.282e+00;SOR=0.495 GT:AD:DP:GQ:PGT:PID:PL:PS
Steps to reproduce
Calculations were tested on RHEL 8 AlmaLinux 8.10 and CentOS 7 nodes (7.9.2009 (CORE)).
gatk_4.6.0.0 SelectVariants --variant variant --reference /project/cultibv/fwasch/CultiBeet_3/RefBeet_3/RefBeet-3.0.fa --output output --select "AC > 6" --restrict-alleles-to BIALLELIC --select "MQ > 40.0" --select "DP < 25260" --select "SOR < 3.0" --select "QD > 2.0" --select "FS < 60.0" -select "MQRankSum > -12.5" --select "ReadPosRankSum > -8.0" --select "BaseQRankSum > -12.5"'&> filter_vcfs.error &
Importantly the same command works without any problems on older versions. E.g. SelectVariants with GATK 4.2. on the same raw input file with the same command works as expected (intended) and filters all variants that do not meet the criteria set with --select statements.
Expected behavior
Variants like this:
AC=1;AF=5.453e-04;AN=1834;BaseQRankSum=-6.740e-01;DP=4543;ExcessHet=0.0000;FS=0.000;InbreedingCoeff=-0.0203;MLEAC=2;MLEAF=1.091e-03;MQ=41.22;MQRankSum=0.00;QD=5.98;ReadPosRankSum=-1.282e+00;SOR=0.495 GT:AD:DP:GQ:PGT:PID:PL:PS
should not be in the output.
Thanks already for your help!
The text was updated successfully, but these errors were encountered: