Re: [OMPI users] Checkpointing a restarted app fails

2008-09-24 Thread Matthias Hovestadt
Hi Josh! I believe this is now fixed in the trunk. I was able to reproduce with the current trunk and committed a fix a few minutes ago in r19601. So the fix should be in tonight's tarball (or you can grab it from SVN). I've made a request to have the patch applied to v1.3, but that may take a

Re: [OMPI users] Checkpointing a restarted app fails

2008-09-22 Thread Josh Hursey
I believe this is now fixed in the trunk. I was able to reproduce with the current trunk and committed a fix a few minutes ago in r19601. So the fix should be in tonight's tarball (or you can grab it from SVN). I've made a request to have the patch applied to v1.3, but that may take a day

Re: [OMPI users] Checkpointing a restarted app fails

2008-09-18 Thread Matthias Hovestadt
Hi Josh! First of all, thanks a lot for replying. :-) When executing this checkpoint command, the running application directly aborts, even though I did not specify the "--term" option: -- mpirun noticed that process

Re: [OMPI users] Checkpointing a restarted app fails

2008-09-17 Thread Josh Hursey
On Sep 16, 2008, at 11:18 PM, Matthias Hovestadt wrote: Hi! Since I am interested in fault tolerance, checkpointing and restart of OMPI is an intersting feature for me. So I installed BLCR 0.7.3 as well as OMPI from SVN (rev. 19553). For OMPI I followed the instructions in the "Fault

[OMPI users] Checkpointing a restarted app fails

2008-09-17 Thread Matthias Hovestadt
Hi! Since I am interested in fault tolerance, checkpointing and restart of OMPI is an intersting feature for me. So I installed BLCR 0.7.3 as well as OMPI from SVN (rev. 19553). For OMPI I followed the instructions in the "Fault Tolerance Guide" in the OMPI wiki: ./autogen.sh ./configure