Error: bcftools consensus fail with overlapping variants

When running `artic minion --medaka`, we are occasionally running into an issue where there are overlapping pass variants from the different read groups that is causing bcftools consensus to fail and an empty consensus file is generated. I did see similar issues [1](https://github.com/artic-network/fieldbioinformatics/issues/21), [2](https://github.com/artic-network/fieldbioinformatics/issues/125),  but those seem to be caused by an overlap of a pass and fail variant. 

We have had this issue on two medaka models,  r1041_e82_400bps_hac_v4.2.0 and r941_min_hac_g507, but did not encounter this issue with the model r941_min_high_g360. The models have been updated alongside our basecaller and have been confirmed with 
`medaka tools resolve_model --auto_model`.

---

Log:
Running: 
`bcftools consensus -f sample.preconsensus.fasta sample.pass.vcf.gz -m sample.coverage_mask.txt -o sample.consensus.fasta`
The fasta sequence does not match the REF allele at MN908947.3:669:
   REF .vcf: [GT]
   ALT .vcf: [G]
   REF .fa : [GN]TAC.........
Command failed:
`bcftools consensus -f sample.preconsensus.fasta sample.pass.vcf.gz -m sample.coverage_mask.txt -o sample.fasta`

merged.vcf:
POS | ID | REF | ALT | QUAL | FILTER | INFO | FORMAT | SAMPLE
-- | -- | -- | -- | -- | -- | -- | -- | --
669 | . | GT | G | 500 | PASS | DP=1126;AC=2,44;AM=1080;MC=0;MF=0.000;MB=0.000;AQ=4.42;GM=1;PH=6.02,6.02,6.02,6.02;SC=None; | GT:GQ:DP:PS:UG:UQ | 1/1:103:1126:.:1/1:102.88
670 | . | T | G | 403 | PASS | DP=1126;AC=25,37;AM=1064;MC=0;MF=0.000;MB=0.000;AQ=6.71;GM=1;PH=6.02,6.02,6.02,6.02;SC=None; | GT:GQ:DP:PS:UG:UQ | 0/1:128:1126:.:0/1:127.13

sample.1.vcf:
CHROM | POS | ID | REF | ALT | QUAL | FILTER | INFO | FORMAT | SAMPLE
-- | -- | -- | -- | -- | -- | -- | -- | -- | --
MN908947.3 | 670 | . | T | G | 59.694 | PASS | . | GT:GQ | 1:60

sample.2.vcf:
CHROM | POS | ID | REF | ALT | QUAL | FILTER | INFO | FORMAT | SAMPLE
-- | -- | -- | -- | -- | -- | -- | -- | -- | --
MN908947.3 | 669 | . | GT | G | 7.539 | PASS | . | GT:GQ | 1:8
MN908947.3 | 673 | . | CG | C | 5.769 | PASS | . | GT:GQ | 1:6

sample.primertrimmed.rg.sorted.bam:
![Screenshot 2024-02-16 164332](https://github.com/artic-network/fieldbioinformatics/assets/94877138/f1d37338-7cef-4693-822f-f2ad0d923b8c)



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Error: bcftools consensus fail with overlapping variants #132

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

POS	ID	REF	ALT	QUAL	FILTER	INFO	FORMAT	SAMPLE
669	.	GT	G	500	PASS	DP=1126;AC=2,44;AM=1080;MC=0;MF=0.000;MB=0.000;AQ=4.42;GM=1;PH=6.02,6.02,6.02,6.02;SC=None;	GT:GQ:DP:PS:UG:UQ	1/1:103:1126:.:1/1:102.88
670	.	T	G	403	PASS	DP=1126;AC=25,37;AM=1064;MC=0;MF=0.000;MB=0.000;AQ=6.71;GM=1;PH=6.02,6.02,6.02,6.02;SC=None;	GT:GQ:DP:PS:UG:UQ	0/1:128:1126:.:0/1:127.13

CHROM	POS	ID	REF	ALT	QUAL	FILTER	INFO	FORMAT	SAMPLE
MN908947.3	669	.	GT	G	7.539	PASS	.	GT:GQ	1:8
MN908947.3	673	.	CG	C	5.769	PASS	.	GT:GQ	1:6

Error: bcftools consensus fail with overlapping variants #132

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions