You may find some of the chunking discussions in this paper of interest:
http://www.hdfgroup.org/pubs/papers/2008-06_netcdf4_perf_report.pdf
in particular, section 3.2 and port6ions of sections 4 & 5.
On Sep 27, 2010, at 4:19 PM, John Knutson wrote:
> Setting aside the strange sizing issues in the earlier messages for a
> moment...
>
> Let's say I have a data set, dimensioned ( 26, 160, 1051200 )
> and chunked ( 1, 15, 240 )
>
> As I understand it, each individual chunk in the file will be in the
> following order:
> [ 0, 0, 0-239 ] - [ 0, 14, 0-239 ]
>
> and the chunks will be ordered thus:
> [ 0, 0, 0 ], [ 0, 0, 240 ] ... [ 0, 0, 1051200 ], [ 0, 15, 0 ], [ 0, 15, 240
> ] ... [ 0, 15, 1051200 ]
> and so on...
>
> Is that correct?
Chunks are not necessarily ordered on the disk, so the sequence in which you
read the chunks shouldn't impact performance.
>
> Should I expect peak read performance by reading one chunk at a time in that
> order, assuming each chunk is 1MB in size, as is the cache?
>
> I notice there are functions for examining the hit % of the metadata cache...
> any chance of equivalent functions for the raw data chunk cache?
>
>
>
> _______________________________________________
> Hdf-forum is for HDF software users discussion.
> [email protected]
> http://mail.hdfgroup.org/mailman/listinfo/hdf-forum_hdfgroup.org
_______________________________________________
Hdf-forum is for HDF software users discussion.
[email protected]
http://mail.hdfgroup.org/mailman/listinfo/hdf-forum_hdfgroup.org