Re: h5py and hdf5-mpi

Alastair McKinstry Tue, 13 Aug 2019 03:47:51 -0700


On 13/08/2019 05:01, Drew Parsons wrote:

On 2019-08-13 03:51, Steffen Möller wrote:
Hello,


There are a few data formats in bioinformatics now depending on hdf5 and
h5py is used a lot. My main concern is that the user should not need to
configure anything, like a set of hostnames. And there should not be
anything stalling since it waiting for contacting a server. MPI needs to
be completely transparent and then I would very much like to see it.
MPI is generally good that way. The programs runs directly as asimple serial program if you run it on its own, so in that sense itshould be transparent to the user (i.e. you won't know its mpi-enabledunless you know to look for it). A multicpu job is launched viarunning the program with mpirun (or mpiexec).
e.g. in the context of python and h5py, if you run
  python3 -c 'import h5py'
then the job runs as a serial job, regardless of whether h5py is builtfor hdf5-serial or hdf5-mpi.
If you want to run on 4 cpus, you launch the same program with
  mpirun -n 4 python3 -c 'import h5py'
Then if h5py is available with hdf5-mpi, it handles hdf5 as amultiprocessor job. If h5py here is built with hdf5-serial, then itruns the same serial job 4 times at the same time.
To reiterate, having h5py-mpi available will be transparent to a userinteracting with hdf as a serial library. It doesn't break serial use,it just provides the capability to also run multicpu jobs.

I'd go with this policy in general: codes available as both serial andmpi should probably be shipped mpi by default.

The main reason not to do so is normally "it drags in MPI" and "itspainful to build", but these are arguments against an end-user having tobuild all the software; the advantage of Debian is the stack isavailable for free :-) . Typically space for the MPI libraries is not anissue.

At the moment the main exception is NetCDF : serial and parallel NetCDFhave orthogonal features: the MPI version provides parallelism but onlythe serial version provides compression with I/O, (because I/O writeshappen on byte ranges via POSIX). This is changing though (not sure ofthe timetable); in the future a parallel version with full features isexpected.

How do autotests work for MPI?
We simply configure the test script to invoke the same tests usingmpirun.

This is a bigger issue. We have test suites that test MPI featureswithout checking MPI processor counts (eg the Magics /Metview code). Oneworkaround is to enable oversubscribe to allow the test to work(inefficiently), though the suites that use MPI should really detect anddisable such tests if resources are not found. We will always havefeatures in our codes that our build/test systems aren't capable oftesting: eg. pmix is designed to work scalably to > 100,000 cores. Wecan't test that :-)

Drew

Alastair


--
Alastair McKinstry, <[email protected]>, <[email protected]>, 
https://diaspora.sceal.ie/u/amckinstry
Misentropy: doubting that the Universe is becoming more disordered.

Re: h5py and hdf5-mpi

Reply via email to