Hi Namshim,
Downloading the 44way alignment was successful.  However, the persistend
data (.pygrdata) seems to be unworkable.  The metabase lists Bio.MSA, etc,
but it cannot be loaded.

Also, I've attempted to download the genomes from the UCLA metabase, but a
genome might be corrupt.  Specifically,

http://biodb.bioinformatics.ucla.edu/GENOMES/ponAbe2/chromFa.tar.gz

which gives the error message below.  In fact, checking the file that is
downloaded (ponAbe2.tar.gz), is an HTML document!

$ file ponAbe2.tar.gz
ponAbe2.tar.gz: HTML document text


....[error trace below]
....
/home/dock/shared_libraries/lx64/pkgs/pythonsandbox/2.6.2/lib/python2.6/site-packages/pygr-0.8.0.beta1-py2.6-linux-x86_64.egg/pygr/downloader.pyc
in do_untar(filepath, mode, newpath, singleFile, **kwargs)
    105         newpath = filepath + '.out'
    106     import tarfile
--> 107     t = tarfile.open(filepath, mode)
    108     try:
    109         if singleFile: # extract to a single file

/home/dock/shared_libraries/lx64/pkgs/pythonsandbox/2.6.2/lib/python2.6/tarfile.pyc
in open(cls, name, mode, fileobj, bufsize, **kwargs)
   1662             else:
   1663                 raise CompressionError("unknown compression type %r"
% comptype)
-> 1664             return func(name, filemode, fileobj, **kwargs)
   1665
   1666         elif "|" in mode:

/home/dock/shared_libraries/lx64/pkgs/pythonsandbox/2.6.2/lib/python2.6/tarfile.pyc
in gzopen(cls, name, mode, fileobj, compresslevel, **kwargs)
   1713                 **kwargs)
   1714         except IOError:
-> 1715             raise ReadError("not a gzip file")
   1716         t._extfileobj = False
   1717         return t

ReadError: not a gzip file




On Tue, Sep 1, 2009 at 9:55 PM, Paul Rigor (gmail) <paulri...@gmail.com>wrote:

> Well, we have time, storage and bandwidth =)
> I'll let you know how it goes?  Maybe we can host an XMLRPC mirror someday
> too.
>
> Thanks,
> Paul
>
>
> On Tue, Sep 1, 2009 at 9:41 PM, Namshin Kim <deepr...@gmail.com> wrote:
>
>> Hi Paul,
>> I just checked the size of hg18_multiz44way and it is 167GB for just
>> NLMSA. If we consider genome assemblies you may not have, it would be ~
>> 250GB. I think it would take a long time to download all files.
>>
>> --
>> Namshin Kim
>>
>>
>>
>> On Wed, Sep 2, 2009 at 1:33 PM, Paul Rigor (gmail) 
>> <paulri...@gmail.com>wrote:
>>
>>>
>>> Hi Namshin,
>>> I'm running this over night =)  Has anyone successfully pulled and used
>>> this alignment?
>>>
>>> Thanks,
>>> Paul
>>>
>>> On Sun, Aug 2, 2009 at 4:40 PM, Namshin Kim <deepr...@gmail.com> wrote:
>>>
>>>> Now the downloadable resources are available on biodb2 XMLRPC server.
>>>>
>>>> Two ways to build NLMSA.
>>>>
>>>> 1. metabase
>>>>
>>>> >>> import os
>>>> >>> os.environ['WORLDBASEPATH'] = '.,
>>>> http://biodb2.bioinformatics.ucla.edu:5000'
>>>> >>> from pygr import metabase
>>>> >>> mdb = metabase.MetabaseList()
>>>> >>> hg18 = mdb('Bio.MSA.UCSC.hg18_multiz44way',download=True)
>>>>
>>>> 2. from text files
>>>>
>>>> download text files from http://biodb.bioinformatics.ucla.edu/PYGRDATA/
>>>> use cnestedlist.textfile_to_binaries('hg18_multiz44way') function to
>>>> convert from text to binaries
>>>>
>>>> If you want to see the script used to add these resources, visit this
>>>> URL.
>>>>
>>>>
>>>> http://github.com/deepreds/pygr/tree/d7ab9247dcd39b7d474029cb8749a53eb8582968/tests/biodb2_update
>>>>
>>>>
>>>
>>>
>>
>> >>
>>
>
>
> --
> Paul Rigor
> Graduate Student
> Institute for Genomics and Bioinformatics
> Donald Bren School of Information and Computer Sciences
> University of California, Irvine
> http://www.paulrigor.net/
> http://www.ics.uci.edu/~prigor
>



-- 
Paul Rigor
Graduate Student
Institute for Genomics and Bioinformatics
Donald Bren School of Information and Computer Sciences
University of California, Irvine
http://www.paulrigor.net/
http://www.ics.uci.edu/~prigor

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"pygr-dev" group.
To post to this group, send email to pygr-dev@googlegroups.com
To unsubscribe from this group, send email to 
pygr-dev+unsubscr...@googlegroups.com
For more options, visit this group at 
http://groups.google.com/group/pygr-dev?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to