Hi,

I am using dmtcp to checkpoint a MCMC program with long run times because the cluster I use has a runtime limit of 48 h. After successfully checkpointing and restarting the program 21 times, I now get the following message during checkpointing of the 22nd run:

[40000] WARNING at procselfmaps.cpp:101 in ~ProcSelfMaps; REASON='JWARNING(numAllocExpands == jalib::JAllocDispatcher::numExpands()) failed'
      numAllocExpands = 10
      jalib::JAllocDispatcher::numExpands() = 11
Message: JAlloc: memory expanded through call to mmap(). Inconsistent JAlloc will be a problem on restart

I am not completely understanding what is the problem here. It is probably related to the large amount of memory required to checkpoint a MCMC run that has already ran for about 40 days? Does anyone know how to fix the issue?

Kind regards and thanks,
Tobias



_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

Reply via email to