Hi Edson,
Back in the old days,
Dmtcp is link to user application.
Use a signal handler to trigger checkpoint.
A checkpoint thread is created to handle the checkpoint.
So there is no need of coordinator.

In this latest version, I am afraid you need a coordinator anyway,
If you don’t run the coordinator, dmtcp_launch will automatically bring up a 
coordinator for you.
This is the current behavior of 2.4.1

If you just want to checkpoint each process, not every process.
Maybe you can run a coordinator for each dmtcp_launch and set the environment 
variable to different coordinator.

Thanks.
Nausca.


From: Edson Tavares de Camargo 
<etcamarg...@gmail.com<mailto:etcamarg...@gmail.com>>
Date: 2015年10月7日 星期三 17:54
To: Nausca <nau...@cadence.com<mailto:nau...@cadence.com>>
Cc: "Sourceforge. Net Dmtcp-Forum@Lists." 
<dmtcp-forum@lists.sourceforge.net<mailto:dmtcp-forum@lists.sourceforge.net>>
Subject: Re: [Dmtcp-forum] Uncoordinated checkpoint for MPI

Hi Nausca,

Thank you for your reply!

Let me see if I understood correctly. Using an older version (1.x) my system 
will be capable of to create non-coordinated checkpoints among processes. Then, 
if I run:

- <dmtcp command> mpirun -np 8 ./test - where each process executes on a 
different machine

I will have each one of that process creating a checkpoint, ok?

> But in this case, you have to link your source code with dmtcp library so 
> files.

How could I do that? I will have to use the function dmtcp Checkpoint() into 
the application code?

Thanks a lot!

Edson

2015-10-07 11:28 GMT+02:00 Nausca Hsu 
<nau...@cadence.com<mailto:nau...@cadence.com>>:
Hi,
You have to find old version of dmtcp (1.x). At that version, no coordinator is 
required.
I am working on this now.
To make latest version run as a single process and no coordinator needed.
And no dmtcp_launch needed neither.

But in this case, you have to link your source code with dmtcp library so files.

Thanks.
Nausca.

From: Edson Tavares de Camargo 
<etcamarg...@gmail.com<mailto:etcamarg...@gmail.com>>
Date: 2015年10月7日 星期三 16:32
To: "Sourceforge. Net Dmtcp-Forum@Lists." 
<dmtcp-forum@lists.sourceforge.net<mailto:dmtcp-forum@lists.sourceforge.net>>
Subject: [Dmtcp-forum] Uncoordinated checkpoint for MPI

Hi Everyone!

This is my first contact with DMTCP. I'm a phd student and I'm working on a 
message logging protocol for MPI. I'm using OpenMPI for implementing my 
proposal. I have read the DMTCP documentation and I have few questions. But 
first of all, I will tell you why I would like to use a checkpoint tool:

- My message logging protocol supposes that processes create checkpoints on a 
uncoordinated approach. Each process creates a checkpoint independently of 
other. There will be no coordination among the processes.

- For now, I am not worried about a  process recovery. This will be part of a 
next phase of my work.

Now my questions about DMTCP.

- There is a coordinator. It is responsible for starting the checkpoints on the 
other processes, right? DMTCP follows a coordinated checkpoint approach and 
creates a consistent global state, ok?

- Would be possible to use DMTCP, or DMTCP plugin, in order to implement a 
uncoordinated checkpoint? In this moment just take checkpoint independently on 
each process.

Thank you in advance!

Edson

------------------------------------------------------------------------------
Full-scale, agent-less Infrastructure Monitoring from a single dashboard
Integrate with 40+ ManageEngine ITSM Solutions for complete visibility
Physical-Virtual-Cloud Infrastructure monitoring from one console
Real user monitoring with APM Insights and performance trend reports 
Learn More http://pubads.g.doubleclick.net/gampad/clk?id=247754911&iu=/4140
_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

Reply via email to