Dear DMTCP team -- Notwithstanding the bug reported quite some time ago (how's the fix coming? ), I have no encountered a different failure to restart:
[47995] mtcp_restart.c:1003 read_shared_memory_area_from_file: mapping /tmp/hsperfdata_moss/45000 with data from ckpt image [47995] mtcp_restart.c:1296 open_shared_file: unable to create file /usr/lib/jvm/java-1.7.0-openjdk-1.7.0.79-2.5.5.2.el7_1.x86_64/jre/lib/ext/pulse-java.jar This causes a core dump. I'm not sure what to do -- my queue is currently limited to two weeks of execution time and this job timed out at the two weeks, and with this limitation apparently cannot be restarted ... Regards -- Eliot Moss ------------------------------------------------------------------------------ _______________________________________________ Dmtcp-forum mailing list Dmtcp-forum@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dmtcp-forum