Hi Cyrill,
I remember having similarly looking problem with CRIU [1]. Try to take a look
at performance counters.
https://github.com/checkpoint-restore/criu/issues/1171
On 8/29/22 09:47, Cyrill Burth wrote:
Hi,
I was working the last few weeks with DMTCP and made some performance benchmarks. Therefore I have used the NPB 3.4.2 BT
- MPI benchmark [1] at the Taurus Supercomputer at the TU Dresden always with 16 MPI ranks and gzip disabled.
I have realized that if I would restart an application from its checkpoint it would (drastically) slow down compared to
before the checkpoint, I will refer to this as phenomena as "restart penalty".
I will describe shortly my methodology: I have performed an checkpoint in the 20th iteration and if I took the time
before restart from the 21st to last iteration of the benchmark it would be between 25% to 45% less then when I did the
same after restarting from the checkpoint in the 20th iteration. I verified this with the MPI benchmark (25%-45%
"restart penalty") as well as with the OpenMP benchmark (consistent 15% restart penalty) which is also provided by NPB
under [1]. I ran all tests multiple times on multiple nodes and all of them yielded the same results. To compile and run
the benchmark I have used the intel/2019b toolchain, since I had some compatibility issues with newer versions.
I have repeated the tests with application initiated checkpointing as well as with the "-i" option, without modifying
the benchmarks source code. Both yielded the same results.
However the reason I am contacting you is since I have not only realized the behavior described above but also that the
"restart penalty" seems to scale with the speed of the used filesystem at least when using MPI. If I would restart from
our relatively slow local SSDs, I have seen a "restart penalty" of roughly 45%, however if I restarted the same
checkpoint from a RAM disk, I would only see a "restart penalty" of 25%. This could only be seen when using the MPI
version of the benchmark, for the OpenMP version there was seen a "restart penalty" of 15%, but it would not scale with
the used filesystem.
I was wondering if anyone could give me any insights that could explain this
behavior.
The restart times themselves obviously go up when the slower filesystem is used, but this was to be expected, however it
appears rather odd that the performance after restart depends on the filesystem used for restart. Some further research
showed that every single iteration of the benchmark gets slowed down. It is *not* the case that some iterations take
significantly longer than others.
There were no further checkpoints taken except for the very first one in the 20th iteration from which I have restarted
and which was excluded from the time measurements.
Thank you very much in advance.
Best regards,
C. Burth
[1] https://www.nas.nasa.gov/software/npb.html
_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum
--
Regards,
Maksym Planeta
_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum