Hi Pouya,

Thank you for your interest in the UCSC Genome Browser. We are currently 
working on answering your question and hope to have a reply to you soon.

Thank you,
Mary
---------------------
Mary Goldman
UCSC Bioinformatics Group


On 8/24/10 2:08 PM, Pouya Kheradpour wrote:
> My goal is to have both wgEncodeGencodeManualV4 and
> wgEncodeGencodeAutoV4 in GenePred format.
>
> I tried to download the wgEncodeGencodeManualV4 table from the test
> browser. For some reason when downloading it gets stuck after
> downloading and the file is cutoff (chromosomes 17-22 are completely
> missing, chromosome 16 is there partially; after exactly 409600b =
> 400kb). This happens reproducibly across multiple
> computers/networks/operating systems. I also want the
> wgEncodeGencodeAutoV4, which appears to download ok.
>
> I have had this sort of problem before (where downloads from the table
> browser would get stuck). I am not sure what causes them. Is there a url
> from which the data can always be downloaded in flat files?
>
> I can also download the gtf version of these files from:
>
> http://hgdownload-test.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeGencode/wgEncodeGencode{Auto,Manual}V4.gtf.gz
>
> But when I try to convert the AutoV4 file I get several errors:
>
> gunzip -c wgEncodeGencodeAutoV4.gtf.gz | gtfToGenePred -allErrors
> -genePredExt /dev/stdin /dev/stdout
>
> ... [snip]
> no exons defined for 93876
> no exons defined for 93875
> no exons defined for 93874
> no exons defined for 115098
> no exons defined for 27940
> no exons defined for 29602
> no exons defined for 29603
> no exons defined for 10879
> 622 errors
>
> and these genes are missing from the final output (although they are
> present in the wgEncodeGencodeAutoV4 I download from the test browser).
>
> I was wondering what the command used to convert the gtf files above to
> GenePred actually was. Also, can the table browser be repaired?
>
> Right now I am using wgEncodeGencodeAutoV4 from the table browser and
> wgEncodeGencodeManualV4 converted with gtfToGenePred, but it would be
> nice to have a more consistent way to set it all up.
>
> Thanks,
> Pouya
>
> _______________________________________________
> Genome maillist  -  Genome@lists.soe.ucsc.edu
> https://lists.soe.ucsc.edu/mailman/listinfo/genome
>    
_______________________________________________
Genome maillist  -  Genome@lists.soe.ucsc.edu
https://lists.soe.ucsc.edu/mailman/listinfo/genome

Reply via email to