Artem, Could you reply about the possibilities for integrating DMTCP directly into SLURM. We had talked about this, but I don't know if there are any concrete plans yet. If we don't yet have some concrete plans, can we set up a timetable for doing that? We can also do a lot of the work at our end.
Thanks, - Gene [ Sorry for my incomplete earlier reply. I had missed seeing the later e-mails when I repllied. ] On Fri, May 22, 2015 at 06:27:00AM -0400, Gene Cooperman wrote: > Hi Manuel, > We don't yet have direct integration with SLURM. We're hoping to > add that either this Summer or this Fall. One of our team members, > Artem Polyakov, has been talking a lot with the SLURM developers. > In the meantime, we do have indirect SLURM support. We provide > some submission scripts for SLURM that allow you to run the equivalent of: > dmtcp_launch > dmtcp_restart > > If you look in <DMTCP_ROOT>/plugin/batch-queue/job_examples/ > you'll find our sample SLURM scripts, and we hope that they're also > easy to modify. Let us know if you have any difficulty using it. > > As you look at that information and the README file, please tell us > if anything in the documentation is unclear. We are continuing to > revise the documentation to improve it. > > Best wishes, > - Gene > > > On Thu, May 21, 2015 at 05:04:58PM +0200, Manuel Rodríguez Pascual wrote: > > Hi all, > > > > I am (unsuccessfully) trying to integrate DMTCP with Slurm. > > > > On the first step, employing the scripts provided with DMTCP, I have > > succeeded and it is now working, both with serial tasks and MPICH3 > > ones. This is great news :) > > > > However, I would now like to employ this library from Slurm API. To do > > so, I guess I'll have to integrate DMTCP as a plugin, and then specify > > it in slurm.conf (variable "CheckpointType=checkpoint/XXXX". Is this > > possible? I have looked inside Slurm code and doesn't seem to have > > support out of the box, but I was imagining that maybe you have > > provided it some way or another. > > > > Thanks for your help, > > > > > > Manuel > > > > > > -- > > Dr. Manuel Rodríguez-Pascual > > skype: manuel.rodriguez.pascual > > phone: (+34) 913466173 // (+34) 679925108 > > > > CIEMAT-Moncloa > > Edificio 22, desp. 1.25 > > Avenida Complutense, 40 > > 28040- MADRID > > SPAIN > > > > ------------------------------------------------------------------------------ > > One dashboard for servers and applications across Physical-Virtual-Cloud > > Widest out-of-the-box monitoring support with 50+ applications > > Performance metrics, stats and reports that give you Actionable Insights > > Deep dive visibility with transaction tracing using APM Insight. > > http://ad.doubleclick.net/ddm/clk/290420510;117567292;y > > _______________________________________________ > > Dmtcp-forum mailing list > > Dmtcp-forum@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/dmtcp-forum > > ------------------------------------------------------------------------------ > One dashboard for servers and applications across Physical-Virtual-Cloud > Widest out-of-the-box monitoring support with 50+ applications > Performance metrics, stats and reports that give you Actionable Insights > Deep dive visibility with transaction tracing using APM Insight. > http://ad.doubleclick.net/ddm/clk/290420510;117567292;y > _______________________________________________ > Dmtcp-forum mailing list > Dmtcp-forum@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/dmtcp-forum ------------------------------------------------------------------------------ One dashboard for servers and applications across Physical-Virtual-Cloud Widest out-of-the-box monitoring support with 50+ applications Performance metrics, stats and reports that give you Actionable Insights Deep dive visibility with transaction tracing using APM Insight. http://ad.doubleclick.net/ddm/clk/290420510;117567292;y _______________________________________________ Dmtcp-forum mailing list Dmtcp-forum@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dmtcp-forum