Dear mailing list, Thank you for developing and supporting DMTCP.
I am trying to use it to checkpoint a multi-process application. My application works as follows: - the main process spawns children via fork/exec - the children communicate through segments of shared memory In order to use DMTCP, I modified the main process as follows: - it starts by starting a coordinator - instead of execve-ing the executable foo, it executes dmtcp_launch foo. One of the processes triggers the checkpoint with dmtcp_checkpoint() and I get a set of files created. I am a bit confused when I try to restart my application with dmtcp_restart_script.sh: I get the following output and my program is stuck $ ./dmtcp_restart_script.sh [6522] mtcp_restart.c:404 restore_brk: error: new/current break (0x555555762000) != saved break (0x55c9a545f000) [6520] mtcp_restart.c:404 restore_brk: error: new/current break (0x555555762000) != saved break (0x56008807e000) [6524] mtcp_restart.c:404 restore_brk: error: new/current break (0x555555762000) != saved break (0x55c08be3b000) [6505] mtcp_restart.c:404 restore_brk: error: new/current break (0x555555762000) != saved break (0x55ef77669000) *** dmtcp_checkpoint: This program is now restarting. Is there anything I am doing incorrectly? Thanks, Camille _______________________________________________ Dmtcp-forum mailing list Dmtcp-forum@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dmtcp-forum