Hi Gene!

Thank you for your reply.

I know about that bug on DMTCP_CHECKPOINT_INTERVAL, version 2.4.1. My
previous question are about the version 2.2.

When I call ~/test/plugin/applic-initiated-ckpt/applic$ dmtcp_launch
--no-coordinator ./applic the applic, should the applic wait for the value
defined on DMTCP_CHECKPOINT_INTERVAL? (DMTCP version 2.2)

In the version 2.4.0 the --no-coordinator option are not working.

Thanks!

Edson
On Oct 8, 2015 10:20 PM, "Gene Cooperman" <g...@ccs.neu.edu> wrote:

> Hi Edson,
>     You seem to have hit a known bug that we have for dmtcp version 2.4.1.
> We had an unfortunate regression concerning interval checkpointing.
> --interval and DMTCP_CHECKPOINT_INTERVAL are not working properly
> in version 2.4.1.
>
>     We will be releasing version 2.4.2 in a few days.  In the meantime,
> your options are to use dmtcp-2.4.0, or else the development
> branch (which is currently reasonable stable).  The development branch
> can be found through:
>     google dmtcp download
>     --> http://dmtcp.sourceforge.net/downloads.html
>     ----> git clone https://github.com/dmtcp/dmtcp.git
>     ----> OR: wget https://github.com/dmtcp/dmtcp/archive/master.zip
>
> Best,
> - Gene
>
> On Thu, Oct 08, 2015 at 07:29:38PM +0200, Edson Tavares de Camargo wrote:
> > Hi Kapil, I will comment below:
> >
> > 2015-10-08 15:39 GMT+02:00 Kapil Arya <kapil.arya...@gmail.com>:
> >
> > > Hi Edson,
> > >
> > > For coordinator-less checkpointing, I would suggest that you use the
> > > "--no-coordinator" flag with dmtcp_launch.
> > >
> >
> > The version 2.2 woks fine with --no-coordination:
> >
> >  ~/test/plugin/applic-initiated-ckpt/applic$ dmtcp_launch
> --no-coordinator
> > ./applic
> >
> >
> >
> > > This allows you to specify an checkpoint interval.
> > >
> >
> > In this case above, how the checkpoint interval works? Should the
> applic.c
> > wait until DMTCP_CHECKPOINT_INTERVAL for makes the checkpoint. I ask
> > because seems that it not waiting for DMTCP_CHECKPOINT_INTERVAL.
> >
> >
> > > Further, you can also provide a port number with "--port" and then use
> > > dmtcp_command to request checkpoints explicitly.
> > >
> >
> > I would like that each process start the checkpoint in a different
> > interval. For sample, suppose 4 MPI processes:
> > - process 0 makes checkpoint every 5 seconds
> > - process 1 makes checkpoint every 8 seconds
> > - process 2 makes checkpoint every 3 seconds
> >  so on...
> >
> > Can I set that behaviour, that is, both interval and request checkpoints
> > directly in my MPI application code?
> >
> >
> > Thank you again!
> >
> > Edson
>
> >
> ------------------------------------------------------------------------------
>
> > _______________________________________________
> > Dmtcp-forum mailing list
> > Dmtcp-forum@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/dmtcp-forum
>
>
------------------------------------------------------------------------------
_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

Reply via email to