A Monday 13 December 2010 14:56:26 Dominik Szczerba escrigué:
> > But, for knowing if accessing columns this is efficient for your
> > case, I'd need more info on your datasets.  Are they contiguous or
> > chunked? If chunked, which is the chunkshape you have chosen?
> 
> Both. Files saved from matlab are uncompressed/contiguous, the ones
> saved from my program are usually compressed/chunked and the size is
> around 1024^2/sizeof(type).

Well, for PyTables (or any C application) and contiguous datasets, 
accessing data by columns is inefficient: the privileged direction for 
performance are rows.

For the chunked datasets, you only provided the chunksize of one of the 
dimensions, which I assume is the last one, so again, this will be 
inefficient for a C-based app.

Hope this helps,

-- 
Francesc Alted

------------------------------------------------------------------------------
Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL,
new data types, scalar functions, improved concurrency, built-in packages, 
OCI, SQL*Plus, data movement tools, best practices and more.
http://p.sf.net/sfu/oracle-sfdev2dev 
_______________________________________________
Pytables-users mailing list
Pytables-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/pytables-users

Reply via email to