On 05/27/2014 02:45 PM, Quincey Koziol wrote:
On May 27, 2014, at 2:43 PM, Dana Robinson <[email protected]
<mailto:[email protected]>> wrote:
Hi,
The HDF5 library does not support asynchronous I/O at this time. We
are looking into including async I/O support in a future release, however.
I've attached a document that describes our current ideas in this space.
Good read. Just how compute bound is HDF5, anyway? I'm always living
in a land of large datasets, where library overhead is dwarfed by the
I/O workload overhead.
you did not mention the multi-dataset I/O approach: it's a half-step
towards asynchronism -- or maybe a half-step backwards -- in that
instead of decoupling the description of the data with the execution of
the data, HDF5's multi-dataset routines will describe more data in a
single call.
I don't think the global HDF5 lock precludes an async approach.
Probably this async facility should exist on top of HDF5, though, and
can provide the caching, read-ahead, coalescing, and other benefits
while leaving the bulk of the 300k lines of C code untouched. In my
head it's MPI_THREAD_FUNELED for HDF5.
The various ways one can manage MPI progress are instructive here.
==rob
Quincey
_______________________________________________
Hdf-forum is for HDF software users discussion.
[email protected]
http://mail.lists.hdfgroup.org/mailman/listinfo/hdf-forum_lists.hdfgroup.org
Twitter: https://twitter.com/hdf5
--
Rob Latham
Mathematics and Computer Science Division
Argonne National Lab, IL USA
_______________________________________________
Hdf-forum is for HDF software users discussion.
[email protected]
http://mail.lists.hdfgroup.org/mailman/listinfo/hdf-forum_lists.hdfgroup.org
Twitter: https://twitter.com/hdf5