[Dmtcp-forum] Performance of Restarted MPI applications under DMTCP

Cyrill Burth Mon, 29 Aug 2022 01:06:08 -0700

Hi,

I was working the last few weeks with DMTCP and made some performancebenchmarks. Therefore I have used the NPB 3.4.2 BT - MPI benchmark [1]at the Taurus Supercomputer at the TU Dresden always with 16 MPI ranksand gzip disabled.

I have realized that if I would restart an application from itscheckpoint it would (drastically) slow down compared to before thecheckpoint, I will refer to this as phenomena as "restart penalty".

I will describe shortly my methodology: I have performed an checkpointin the 20th iteration and if I took the time before restart from the21st to last iteration of the benchmark it would be between 25% to 45%less then when I did the same after restarting from the checkpoint inthe 20th iteration. I verified this with the MPI benchmark (25%-45%"restart penalty") as well as with the OpenMP benchmark (consistent 15%restart penalty) which is also provided by NPB under [1]. I ran alltests multiple times on multiple nodes and all of them yielded the sameresults. To compile and run the benchmark I have used the intel/2019btoolchain, since I had some compatibility issues with newer versions.I have repeated the tests with application initiated checkpointing aswell as with the "-i" option, without modifying the benchmarks sourcecode. Both yielded the same results.

However the reason I am contacting you is since I have not only realizedthe behavior described above but also that the "restart penalty" seemsto scale with the speed of the used filesystem at least when using MPI.If I would restart from our relatively slow local SSDs, I have seen a"restart penalty" of roughly 45%, however if I restarted the samecheckpoint from a RAM disk, I would only see a "restart penalty" of 25%.This could only be seen when using the MPI version of the benchmark, forthe OpenMP version there was seen a "restart penalty" of 15%, but itwould not scale with the used filesystem.

I was wondering if anyone could give me any insights that could explainthis behavior.

The restart times themselves obviously go up when the slower filesystemis used, but this was to be expected, however it appears rather odd thatthe performance after restart depends on the filesystem used forrestart. Some further research showed that every single iteration of thebenchmark gets slowed down. It is *not* the case that some iterationstake significantly longer than others.There were no further checkpoints taken except for the very first one inthe 20th iteration from which I have restarted and which was excludedfrom the time measurements.



Thank you very much in advance.


Best regards,

C. Burth


[1] https://www.nas.nasa.gov/software/npb.html

_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

[Dmtcp-forum] Performance of Restarted MPI applications under DMTCP

Reply via email to