Re: [petsc-dev] PETSc Meeting errata

Jakub Kruzik via petsc-dev Fri, 14 Jun 2019 12:54:39 -0700

The problem is that you need to write the file with an optimal stripecount/size in the first place. An unaware user who just uses somethinglike cp will end up with the default stripe count which is usually 1.

For large files, you should just set the stripe count to the number ofOSTs. Your results seem to support this.

For the small mesh and 64 nodes, you are reading just 2 MiB per process.I think that collective I/O should give you a significant improvement.

Also, it would be interesting to know what performance you get from asingle process reading from a single OST. I think you should be able toget 0.5-2.5 GiB/s which is what you are getting from 36 OSTs (~70 MiB/sper OST).

BTW, since you also used Salomon for testing, I found some old tests Idid there with pure MPI I/O, and I was able to get 18.5 GiB/s read for 1GiB file on 108 processes / 54 nodes, 54 OSTs, 4 MiB stripe.


Best,

Jakub


On 6/14/19 12:31 PM, Hapla Vaclav via petsc-dev wrote:

I take back one thing I mentioned in my talk in Atlanta. I think Isaid that Lustre striping does not really influence the readperformance. With my latest results in hand, I must point out this isnot true. I might have been confused by some former Piz Daint Lustreperformance issues and/or HDF5 library issues I mentioned.
Here are my latest slides from PASC19.
https://polybox.ethz.ch/index.php/s/PPZLSyZOKo3UXPS
On slide 18, there is some comparison for different stripe settings. Ican now see a speed-up of ~4 for 1 vs 12 stripes (which is actuallythe number of cores per node) for the mesh with 128M elements. Thetimes are very similar for 8 and 64 computation nodes.
Toby, could you maybe forward this message to the meeting attendees? Idon't want to leave anybody confused.
Thanks,
Vaclav

Re: [petsc-dev] PETSc Meeting errata

Reply via email to