Re: [OMPI users] checkpointing multi node and multi process applications

2010-03-04 Thread Joshua Hursey
On Mar 4, 2010, at 8:17 AM, Fernando Lemos wrote: > On Wed, Mar 3, 2010 at 10:24 PM, Fernando Lemos wrote: > >> Is there anything I can do to provide more information about this bug? >> E.g. try to compile the code in the SVN trunk? I also have kept the >> snapshots

Re: [OMPI users] checkpointing multi node and multi process applications

2010-03-04 Thread Fernando Lemos
On Wed, Mar 3, 2010 at 10:24 PM, Fernando Lemos wrote: > Is there anything I can do to provide more information about this bug? > E.g. try to compile the code in the SVN trunk? I also have kept the > snapshots intact, I can tar them up and upload them somewhere in case >

Re: [OMPI users] checkpointing multi node and multi process applications

2010-01-25 Thread Josh Hursey
. Thank you Jean --- On Mon, 11/1/10, Josh Hursey <jjhur...@open-mpi.org> wrote: From: Josh Hursey <jjhur...@open-mpi.org> Subject: Re: [OMPI users] checkpointing multi node and multi process applications To: "Open MPI Users" <us...@open-mpi.org> Date: Mon

Re: [OMPI users] checkpointing multi node and multi process applications

2010-01-25 Thread Josh Hursey
org> wrote: From: Josh Hursey <jjhur...@open-mpi.org> Subject: Re: [OMPI users] checkpointing multi node and multi process applications To: "Open MPI Users" <us...@open-mpi.org> Date: Monday, 11 January, 2010, 21:42 On Dec 19, 2009, at 7:42 AM, Jean

Re: [OMPI users] checkpointing multi node and multi process applications

2010-01-25 Thread Josh Hursey
to resolve this problem. Thank you Jean --- On Mon, 11/1/10, Josh Hursey <jjhur...@open-mpi.org> wrote: From: Josh Hursey <jjhur...@open-mpi.org> Subject: Re: [OMPI users] checkpointing multi node and multi process applications To: "Open MPI Users" <us...@open-

Re: [OMPI users] checkpointing multi node and multi process applications

2010-01-11 Thread Josh Hursey
On Dec 19, 2009, at 7:42 AM, Jean Potsam wrote: Hi Everyone, I am trying to checkpoint an mpi application running on multiple nodes. However, I get some error messages when i trigger the checkpointing process. Error: expected_component: PID information unavailable!