Hello,

Sometime back, I had asked for help regarding the conversion for the vcf format 
to bed format for liftOver. But unfortunately, that did not work for me.
My vcf format goes like this

##fileformat=VCFv4.1
##samtoolsVersion=0.1.16 (r963:234)
.......
#CHROM  POS     ID      REF     ALT     QUAL    FILTER  INFO    FORMAT      
sample1      sample2      sample3   sample 4
chr1           11457    .          C       G       49.7    .       
DP=3;AF1=0.9999;CI95=0.375,1;DP4=0,0,0,3;MQ=49;FQ=-29.3 GT:PL:GQ        
1/1:4,3,0:39  1/1:40,3,0:39   ...

I used the awk command

grep -v "^#" input.vcf | awk '{printf 
"%s\t%d\t%d\t%s|%s|%s|%s|%s|%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\n", 
$1,$2-1,$2,$3,$4,$5,$6,$7,$8,$9,$10,$11,$12,$13,$14,$15}' > output.bed

Now when I tried to use it for the liftOver , I get an error saying

>./liftOver output.bed hg19ToHg18.over.chain.gz new.bed unMapped
Reading liftover chains
Mapping coordinates
Expecting integer field 7 line 1 of output.bed, got 1/1:40,3,0:39

I need to retain GT:PL:GQ values as I would need it for my further analysis.

Is there any way out of this ? Any suggestions would be appreciated..


Thank you,
Warm Regards,

Nandini 

Thank you,
Warm Regards,

Nandini Badarinarayan


> Date: Tue, 2 Aug 2011 10:06:23 -0700
> From: [email protected]
> To: [email protected]
> CC: [email protected]
> Subject: Re: [Genome] liftover hg19 to hg18
> 
> Good Morning Nandini:
> 
> You can convert your VCF files to bed format.
> 
> If your VCF files follow the format:
> http://www.1000genomes.org/node/101
> 
> This awk command will convert them to bed format (assuming | is not a 
> character found in the vcf fields):
> 
> grep -v "^#" yourFile.vcf | awk '{printf "%s\t%d\t%d\t%s|%s|%s|%s|%s|%s\n", 
> $1,$2-1,$2,$3,$4,$5,$6,$7,$8}' > yourFile.bed
> liftOver yourFile.bed hg19ToHg18.over.chain newFile.bed unMapped
> 
> Then, from that lifted bed file back to vcf:
> 
> awk '{printf "%s\t%d\t%s\n", $1,$2+1,$4}' newFile.bed | tr '[|]' '[\t]' > 
> newFile.vcf
> 
> You may need to adjust these awk statements depending upon the exact format 
> of your vcf file.
> 
> --Hiram
> 
> Nandini B wrote:
> > Hello,
> > I am trying to use liftover for my SNPs from hg19 to hg18. But I have my 
> > SNP files in either gff3 format or vcf (v4.1). Is it possible to use these 
> > files to execute the command 
> > liftOver -gff oldFile.gff3 hg19ToHg18.over.chain newFile.gff3 unMapped  or
> > 
> > liftOver -vcf oldFile.vcf hg19ToHg18.over.chain newFile.vcf unMapped
> > 
> > Or do I have to convert them into BED format, if so, is there any tool that 
> > could do this ?
> > 
> > 
> > Thank you,
> > Warm Regards,
> > 
> > Nandini Badarinarayan
                                          
_______________________________________________
Genome maillist  -  [email protected]
https://lists.soe.ucsc.edu/mailman/listinfo/genome

Reply via email to