Hi,
I am using dmtcp to checkpoint a MCMC program with long run times
because the cluster I use has a runtime limit of 48 h. After
successfully checkpointing and restarting the program 21 times, I now
get the following message during checkpointing of the 22nd run:
[40000] WARNING at procselfmaps.cpp:101 in ~ProcSelfMaps;
REASON='JWARNING(numAllocExpands ==
jalib::JAllocDispatcher::numExpands()) failed'
numAllocExpands = 10
jalib::JAllocDispatcher::numExpands() = 11
Message: JAlloc: memory expanded through call to mmap(). Inconsistent
JAlloc will be a problem on restart
I am not completely understanding what is the problem here. It is
probably related to the large amount of memory required to checkpoint a
MCMC run that has already ran for about 40 days? Does anyone know how to
fix the issue?
Kind regards and thanks,
Tobias
_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum