Tom,

Thank you for your advice.  CSI indexes worked fine at performing the
merge.  TBI indexes from the merged file seem to load fine on JBrowse.

Cheers,
-Hans

On Tue, Aug 22, 2017 at 2:52 PM, Thomas W. Blackwell <tbla...@umich.edu>
wrote:

> Hans  -
>
> Read the man page for 'bcftools index'.  This has a brief description of
> the relative capabilities of .tbi versus .csi indexes.  (.csi is bigger.)
> Sounds as though you are hitting the tabix limit of 2^29 - 1.
>
>                                                 -  tom blackwell  -
>
>
> On Tue, 22 Aug 2017, Hans Vasquez-Gross wrote:
>
> Hello,
>>
>> I am trying to merge 24 different individual sample VCF files (currently
>> testing with only 3 samples) into one large VCF file to display on JBrowse
>> as a single track. However, after merging the files, it seems like all the
>> data stops at around 536 Mbp per psuedomolecule.
>>
>> Below is an example single sample VCF ending for a particular chromosome,
>> and then the merged vcf output.
>>
>> Individual File:
>> zgrep "chr7A" G3116_CSv1_calls_full_chr_length.vcf.gz | tail
>> chr7A   736692151       .       A       T       30.4183 .
>> DP=2;VDB=0.32;SGB=-0.453602;MQ0F=0;AC=2;AN=2;DP4=0,0,0,2;MQ=32  GT:PL
>> 1/1:60,6,0
>> chr7A   736692161       .       C       T       30.4183 .
>> DP=2;VDB=0.32;SGB=-0.453602;MQ0F=0;AC=2;AN=2;DP4=0,0,0,2;MQ=32  GT:PL
>> 1/1:60,6,0
>> chr7A   736692407       .       G       C       16.3433 .
>> DP=5;VDB=0.22;SGB=-0.453602;RPB=1;MQB=1;BQB=1;MQ0F=0;ICB=1;
>> HOB=0.5;AC=1;AN=2;DP4=1,0,2,0;MQ=40
>> GT:PL   0/1:49,0,32
>> chr7A   736692408       .       C       T       59      .
>> DP=5;VDB=0.0983484;SGB=-0.511536;MQ0F=0;AC=2;AN=2;DP4=0,0,3,0;MQ=40
>> GT:PL   1/1:89,9,0
>> chr7A   736692435       .       C       G       106     .
>> DP=7;VDB=0.189048;SGB=-0.556411;MQSB=1;MQ0F=0;AC=2;AN=2;DP4=0,0,3,1;MQ=37
>>    GT:PL   1/1:136,12,0
>> chr7A   736692492       .       G       A       106     .
>> DP=7;VDB=0.184775;SGB=-0.556411;MQSB=1;MQ0F=0;AC=2;AN=2;DP4=0,0,3,1;MQ=37
>>    GT:PL   1/1:136,12,0
>> chr7A   736692503       .       G       A       15.355  .
>> DP=7;SGB=-0.379885;RPB=1;MQB=1;MQSB=1;BQB=1;MQ0F=0;ICB=1;HOB
>> =0.5;AC=1;AN=2;DP4=2,1,1,0;MQ=37
>>   GT:PL   0/1:48,0,75
>> chr7A   736692507       .       T       C       42.2273 .
>> DP=7;VDB=0.460446;SGB=-0.511536;RPB=1;MQB=1;MQSB=1;BQB=1;
>> MQ0F=0;ICB=1;HOB=0.5;AC=1;AN=2;DP4=1,0,2,1;MQ=37
>>      GT:PL   0/1:75,0,48
>> chr7A   736692530       .       T       C       15.2759 .
>> DP=6;SGB=-0.379885;RPB=1;MQB=1;MQSB=0;BQB=1;MQ0F=0;ICB=1;HOB
>> =0.5;AC=1;AN=2;DP4=1,2,1,0;MQ=39
>>   GT:PL   0/1:48,0,82
>> chr7A   736692546       .       A       G       4.33255 .
>> DP=2;SGB=-0.379885;RPB=1;MQB=1;BQB=1;MQ0F=0;ICB=1;HOB=0.5;AC
>> =1;AN=2;DP4=0,1,0,1;MQ=44
>>  GT:PL   0/1:35,0,23
>>
>>
>> Merged File
>> zgrep "chr7A" test.vcf | tail
>> chr7A   536773061       .       T       C       225     .
>> VDB=0.185719;SGB=-0.511536;MQ0F=0;MQ=60;MQSB=1;DP=13;DP4=0,
>> 0,2,9;AN=4;AC=4
>>     GT:PL   1/1:102,9,0     1/1:255,24,0
>> chr7A   536773098       .       T       G       129     .
>> VDB=0.0243922;SGB=-0.616816;MQ0F=0;MQ=60;DP=7;DP4=0,0,0,6;AN=2;AC=2
>> GT:PL   ./.:.   1/1:159,18,0
>> chr7A   536773112       .       C       T       112     .
>> VDB=0.0256605;SGB=-0.590765;MQ0F=0;MQ=60;DP=6;DP4=0,0,0,5;AN=2;AC=2
>> GT:PL   ./.:.   1/1:142,15,0
>> chr7A   536773115       .       G       C       112     .
>> VDB=0.0256605;SGB=-0.590765;MQ0F=0;MQ=60;DP=5;DP4=0,0,0,5;AN=2;AC=2
>> GT:PL   ./.:.   1/1:142,15,0
>> chr7A   536773513       .       A       G       225     .
>> VDB=0.0795616;SGB=-0.651104;MQSB=0;MQ0F=0;MQ=51;DP=23;DP4=0,
>> 0,15,4;AN=4;AC=4
>>   GT:PL   1/1:211,24,0    1/1:255,33,0
>> chr7A   536773541       .       C       T       228     .
>> VDB=0.0660093;SGB=-0.662043;MQSB=0;MQ0F=0;MQ=52;RPB=1;MQB=1;
>> BQB=1;DP=30;DP4=0,1,18,4;AN=4;AC=4
>> GT:PL   1/1:244,27,0    1/1:255,1,0
>> chr7A   536773717       .       A       G       101     .
>> VDB=0.0860477;SGB=-0.511536;MQSB=1;MQ0F=0;MQ=52;DP=4;DP4=0,
>> 0,2,1;AN=2;AC=2
>>     GT:PL   1/1:131,9,0     ./.:.
>> chr7A   536773733       .       C       G       101     .
>> VDB=0.0822145;SGB=-0.511536;MQSB=1;MQ0F=0;MQ=52;DP=4;DP4=0,
>> 0,2,1;AN=2;AC=2
>>     GT:PL   1/1:131,9,0     ./.:.
>> chr7A   536773785       .       A       G       74      .
>> VDB=0.490096;SGB=-0.511536;MQSB=1;MQ0F=0;MQ=43;DP=3;DP4=0,0,1,2;AN=2;AC=2
>>    GT:PL   1/1:104,9,0     ./.:.
>> chr7A   536773788       .       T       G       74      .
>> VDB=0.470313;SGB=-0.511536;MQSB=1;MQ0F=0;MQ=43;DP=3;DP4=0,0,1,2;AN=2;AC=2
>>    GT:PL   1/1:104,9,0     ./.:.
>>
>>
>>
>> Is this a limitation of BCF tools, bgzip, or Tabix indexes?  I've tried
>> using vcftools too, but obtained the same result.  Are there any other
>> alternatives in order to merge this data together?
>>
>> Thank you for the help.
>>
>> Cheers,
>> -Hans
>>
>>


-- 
Hans Vasquez-Gross
<http://dubcovskylab.ucdavis.edu/lab-member/hans-vasquez-gross>
PhD Candidate
Dubcovsky Website - http://dubcovskylab.ucdavis.edu/
Department of Plant Science
University of California at Davis
Email: havasquezgr...@ucdavis.edu
Skype: hansvg.ucd
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Samtools-help mailing list
Samtools-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/samtools-help

Reply via email to