Presumably Adaptive MPI would allow you to do that. I don’t know all the details of how that works there though.
From: users <users-boun...@open-mpi.org> on behalf of Ralph Castain <r...@open-mpi.org> Reply-To: Open MPI Users <us...@open-mpi.org> Date: Thursday, March 17, 2016 at 9:17 AM To: Open MPI Users <us...@open-mpi.org> Subject: Re: [OMPI users] Fault tolerant feature in Open MPI Just to clarify: I am not aware of any MPI that will allow you to relocate a process while it is running. You have to checkpoint the job, terminate it, and then restart the entire thing with the desired process on the new node. On Mar 16, 2016, at 3:15 AM, Husen R <hus...@gmail.com<mailto:hus...@gmail.com>> wrote: In the case of MPI application (not gromacs), How do I relocate MPI application from one node to another node while it is running ? I'm sorry, as far as I know the ompi-restart command is used to restart application, based on checkpoint file, once the application already terminated (no longer running). Thanks regards, Husen On Wed, Mar 16, 2016 at 4:29 PM, Jeff Hammond <jeff.scie...@gmail.com<mailto:jeff.scie...@gmail.com>> wrote: Just checkpoint-restart the app to relocate. The overhead will be lower than trying to do with MPI. Jeff On Wednesday, March 16, 2016, Husen R <hus...@gmail.com<mailto:hus...@gmail.com>> wrote: Hi Jeff, Thanks for the reply. After consulting the Gromacs docs, as you suggested, Gromacs already supports checkpoint/restart. thanks for the suggestion. Previously, I asked about checkpoint/restart in Open MPI because I want to checkpoint MPI Application and restart/migrate it while it is running. For the example, I run MPI application in node A,B and C in a cluster and I want to migrate process running in node A to other node, let's say to node C. is there a way to do this with open MPI ? thanks. Regards, Husen On Wed, Mar 16, 2016 at 12:37 PM, Jeff Hammond <jeff.scie...@gmail.com> wrote: Why do you need OpenMPI to do this? Molecular dynamics trajectories are trivial to checkpoint and restart at the application level. I'm sure Gromacs already supports this. Please consult the Gromacs docs or user support for details. Jeff On Tuesday, March 15, 2016, Husen R <hus...@gmail.com> wrote: Dear Open MPI Users, Does the current stable release of Open MPI (v1.10 series) support fault tolerant feature ? I got the information from Open MPI FAQ that The checkpoint/restart support was last released as part of the v1.6 series. I just want to make sure about this. and by the way, does Open MPI able to checkpoint or restart mpi application/GROMACS automatically ? Please, I really need help. Regards, Husen -- Jeff Hammond jeff.scie...@gmail.com http://jeffhammond.github.io/ _______________________________________________ users mailing list us...@open-mpi.org Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users Link to this post: http://www.open-mpi.org/community/lists/users/2016/03/28705.php -- Jeff Hammond jeff.scie...@gmail.com<mailto:jeff.scie...@gmail.com> http://jeffhammond.github.io/ _______________________________________________ users mailing list us...@open-mpi.org<mailto:us...@open-mpi.org> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users Link to this post: http://www.open-mpi.org/community/lists/users/2016/03/28709.php _______________________________________________ users mailing list us...@open-mpi.org<mailto:us...@open-mpi.org> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users Link to this post: http://www.open-mpi.org/community/lists/users/2016/03/28710.php