Dear mailing list,

Thank you for developing and supporting DMTCP.

I am trying to use it to checkpoint a multi-process application. My
application works as follows:
- the main process spawns children via fork/exec
- the children communicate through segments of shared memory

In order to use DMTCP, I modified the main process as follows:
- it starts by starting a coordinator
- instead of execve-ing the executable foo, it executes dmtcp_launch foo.

One of the processes triggers the checkpoint with dmtcp_checkpoint() and
I get a set of files created.

I am a bit confused when I try to restart my application with
dmtcp_restart_script.sh: I get the following output and my program is stuck

$ ./dmtcp_restart_script.sh
[6522] mtcp_restart.c:404 restore_brk:
  error: new/current break (0x555555762000) != saved break (0x55c9a545f000)
[6520] mtcp_restart.c:404 restore_brk:
  error: new/current break (0x555555762000) != saved break (0x56008807e000)
[6524] mtcp_restart.c:404 restore_brk:
  error: new/current break (0x555555762000) != saved break (0x55c08be3b000)
[6505] mtcp_restart.c:404 restore_brk:
  error: new/current break (0x555555762000) != saved break (0x55ef77669000)
*** dmtcp_checkpoint: This program is now restarting.

Is there anything I am doing incorrectly?

Thanks,
Camille




_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

Reply via email to