Re: [Samtools-help] incorrect reference bases

2016-07-26 Thread Colin Hercus
Hi Annie, As a way to avoid this problem we usually use a fasta file with no IUPAC ambiguous codes (other than N) as reference for variant calling. We'll take our standard reference without IUPAC codes, add IUPAC codes based on dbSNP and then map reads to this with novoalign. In the variant calli

Re: [Samtools-help] incorrect reference bases

2016-07-26 Thread John Marshall
On 22 Jul 2016, at 21:49, Annie Cowell wrote: > samtools mpileup -C50 -Bug -t AD -Q10 -f my.fasta sample.bam | bcftools call > -mv -Ov > my.vcf > > However, I am getting unusual bases for a couple of indels in the reference > field, such as W (as below) and R. > > AAKM0165 8883.

Re: [Samtools-help] incorrect reference bases

2016-07-22 Thread Thomas W. Blackwell
I'm guessing that these are IUPAC ambiguity codes for cases where the indel overlaps a SNP in other individuals, or the other allele in the same individual. Look in the bcftools documentation for ways to suppress the ambiguity codes. I don't know if there is a way or not.

[Samtools-help] incorrect reference bases

2016-07-22 Thread Annie Cowell
Hello, I'm calling variants the following way for single or multiple samples using samtools and bcftools version 1.3.1 samtools mpileup -C50 -Bug -t AD -Q10 -f my.fasta sample.bam | bcftools call -mv -Ov > my.vcf However, I am getting unusual bases for a couple of indels in the reference field,