Hi,

On 10 November 2010 14:39, Francesc Alted <fal...@pytables.org> wrote:
> A Wednesday 10 November 2010 12:08:34 Francesc Alted escrigué:
>> A Tuesday 09 November 2010 16:38:10 Francesc Alted escrigué:
>> > Hmm, this seems a problem with Blosc indeed.  Could you put the
>> > compressed datafile (for example, noaa18_2008_zlib3.h5) in a public
>> > place so that I can see what's going on?
>>
>> Thanks Gerrit for your sample file.  I've looked at the issue, but it
>> looks like a (complex) inefficiency in the Lempel-Ziv part of Blosc
>> (BloscLZ).  I've no time now to look into this, so I created a
>> ticket:
>>
>> http://blosc.pytables.org/trac/ticket/7
>>
>> Hope this can be addressed in the near future.
>
> Hmm, I realized what's going on.  The fix is in PyTables trunk now (or
> std-2.2 branch, if you prefer).  Can you have a try at it?

I tested for the reduced version below:

-rw-r--r-- 1 gerrit students 3.2M Wednesday 10-11-2010 15:36:20
noaa18_2008_blosc1_reduced_newversion.h5
-rw-r--r-- 1 gerrit students 3.2M Wednesday 10-11-2010 15:36:20
noaa18_2008_blosc2_reduced_newversion.h5
-rw-r--r-- 1 gerrit students 2.8M Wednesday 10-11-2010 15:36:20
noaa18_2008_blosc3_reduced_newversion.h5
-rw-r--r-- 1 gerrit students 670K Wednesday 10-11-2010 15:36:20
noaa18_2008_blosc4_reduced_newversion.h5
-rw-r--r-- 1 gerrit students 670K Wednesday 10-11-2010 15:36:20
noaa18_2008_blosc5_reduced_newversion.h5
-rw-r--r-- 1 gerrit students 636K Wednesday 10-11-2010 15:36:21
noaa18_2008_blosc6_reduced_newversion.h5
-rw-r--r-- 1 gerrit students 640K Wednesday 10-11-2010 15:36:21
noaa18_2008_blosc7_reduced_newversion.h5
-rw-r--r-- 1 gerrit students 638K Wednesday 10-11-2010 15:36:21
noaa18_2008_blosc8_reduced_newversion.h5
-rw-r--r-- 1 gerrit students 640K Wednesday 10-11-2010 15:36:21
noaa18_2008_blosc9_reduced_newversion.h5

7, 8 and 9 are still larger than 6, but now by a very slight amount.
Write speeds were similar, I didn't test read speeds, but usually
higher compressions are slower. So then maybe number 6 (as I'm using
now) is still optimal. Might this be expected?

One side note: I forgot to update numexpr before recompiling pytables,
and pytables only complained run-time, not compile-time. It would be
nice if pytables checked such essential dependencies compile- or
install-time.

Gerrit.

-- 
Gerrit Holl
PhD student at Department of Space Science, Luleå University of
Technology, Kiruna, Sweden
http://www.sat.ltu.se/members/gerrit/

------------------------------------------------------------------------------
The Next 800 Companies to Lead America's Growth: New Video Whitepaper
David G. Thomson, author of the best-selling book "Blueprint to a 
Billion" shares his insights and actions to help propel your 
business during the next growth cycle. Listen Now!
http://p.sf.net/sfu/SAP-dev2dev
_______________________________________________
Pytables-users mailing list
Pytables-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/pytables-users

Reply via email to