Strange... Correct URL will be
http://biodb.bioinformatics.ucla.edu/GENOMES/ponAbe2/ponAbe2.gz The URL you
used does not exist, thus it give 404 error (HTML doc).
Hmm... I never downloaded and built the hg18_multiz44way via XMLRPC. I will
try that...

Thanks,
Namshin Kim


On Thu, Sep 3, 2009 at 6:54 AM, Paul Rigor (gmail) <paulri...@gmail.com>wrote:

> Hi Namshim,
> Downloading the 44way alignment was successful.  However, the persistend
> data (.pygrdata) seems to be unworkable.  The metabase lists Bio.MSA, etc,
> but it cannot be loaded.
>
> Also, I've attempted to download the genomes from the UCLA metabase, but a
> genome might be corrupt.  Specifically,
>
> http://biodb.bioinformatics.ucla.edu/GENOMES/ponAbe2/chromFa.tar.gz
>
> which gives the error message below.  In fact, checking the file that is
> downloaded (ponAbe2.tar.gz), is an HTML document!
>
> $ file ponAbe2.tar.gz
> ponAbe2.tar.gz: HTML document text
>
>
> ....[error trace below]
> ....
> /home/dock/shared_libraries/lx64/pkgs/pythonsandbox/2.6.2/lib/python2.6/site-packages/pygr-0.8.0.beta1-py2.6-linux-x86_64.egg/pygr/downloader.pyc
> in do_untar(filepath, mode, newpath, singleFile, **kwargs)
>     105         newpath = filepath + '.out'
>     106     import tarfile
> --> 107     t = tarfile.open(filepath, mode)
>     108     try:
>     109         if singleFile: # extract to a single file
>
> /home/dock/shared_libraries/lx64/pkgs/pythonsandbox/2.6.2/lib/python2.6/tarfile.pyc
> in open(cls, name, mode, fileobj, bufsize, **kwargs)
>    1662             else:
>    1663                 raise CompressionError("unknown compression type
> %r" % comptype)
> -> 1664             return func(name, filemode, fileobj, **kwargs)
>    1665
>    1666         elif "|" in mode:
>
> /home/dock/shared_libraries/lx64/pkgs/pythonsandbox/2.6.2/lib/python2.6/tarfile.pyc
> in gzopen(cls, name, mode, fileobj, compresslevel, **kwargs)
>    1713                 **kwargs)
>    1714         except IOError:
> -> 1715             raise ReadError("not a gzip file")
>    1716         t._extfileobj = False
>    1717         return t
>
> ReadError: not a gzip file
>
>
>
>
> On Tue, Sep 1, 2009 at 9:55 PM, Paul Rigor (gmail) <paulri...@gmail.com>wrote:
>
>> Well, we have time, storage and bandwidth =)
>> I'll let you know how it goes?  Maybe we can host an XMLRPC mirror someday
>> too.
>>
>> Thanks,
>> Paul
>>
>>
>> On Tue, Sep 1, 2009 at 9:41 PM, Namshin Kim <deepr...@gmail.com> wrote:
>>
>>> Hi Paul,
>>> I just checked the size of hg18_multiz44way and it is 167GB for just
>>> NLMSA. If we consider genome assemblies you may not have, it would be ~
>>> 250GB. I think it would take a long time to download all files.
>>>
>>> --
>>> Namshin Kim
>>>
>>>
>>>
>>> On Wed, Sep 2, 2009 at 1:33 PM, Paul Rigor (gmail) 
>>> <paulri...@gmail.com>wrote:
>>>
>>>>
>>>> Hi Namshin,
>>>> I'm running this over night =)  Has anyone successfully pulled and used
>>>> this alignment?
>>>>
>>>> Thanks,
>>>> Paul
>>>>
>>>> On Sun, Aug 2, 2009 at 4:40 PM, Namshin Kim <deepr...@gmail.com> wrote:
>>>>
>>>>> Now the downloadable resources are available on biodb2 XMLRPC server.
>>>>>
>>>>> Two ways to build NLMSA.
>>>>>
>>>>> 1. metabase
>>>>>
>>>>> >>> import os
>>>>> >>> os.environ['WORLDBASEPATH'] = '.,
>>>>> http://biodb2.bioinformatics.ucla.edu:5000'
>>>>> >>> from pygr import metabase
>>>>> >>> mdb = metabase.MetabaseList()
>>>>> >>> hg18 = mdb('Bio.MSA.UCSC.hg18_multiz44way',download=True)
>>>>>
>>>>> 2. from text files
>>>>>
>>>>> download text files from
>>>>> http://biodb.bioinformatics.ucla.edu/PYGRDATA/
>>>>> use cnestedlist.textfile_to_binaries('hg18_multiz44way') function to
>>>>> convert from text to binaries
>>>>>
>>>>> If you want to see the script used to add these resources, visit this
>>>>> URL.
>>>>>
>>>>>
>>>>> http://github.com/deepreds/pygr/tree/d7ab9247dcd39b7d474029cb8749a53eb8582968/tests/biodb2_update
>>>>>
>>>>>
>>>>
>>>>
>>>
>>>
>>>
>>
>>
>> --
>> Paul Rigor
>> Graduate Student
>> Institute for Genomics and Bioinformatics
>> Donald Bren School of Information and Computer Sciences
>> University of California, Irvine
>> http://www.paulrigor.net/
>> http://www.ics.uci.edu/~prigor
>>
>
>
>
> --
> Paul Rigor
> Graduate Student
> Institute for Genomics and Bioinformatics
> Donald Bren School of Information and Computer Sciences
> University of California, Irvine
> http://www.paulrigor.net/
> http://www.ics.uci.edu/~prigor
>
> >
>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"pygr-dev" group.
To post to this group, send email to pygr-dev@googlegroups.com
To unsubscribe from this group, send email to 
pygr-dev+unsubscr...@googlegroups.com
For more options, visit this group at 
http://groups.google.com/group/pygr-dev?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to