Re: [OMPI users] Using OpenMPI on a network

2012-06-19 Thread VimalMathew
Just finished doing that. Still getting the same error. How do I make sure there are no old builds/files left? I uninstalled everything to do with MPI, Cygwin, cleared environment variables, did the whole Windows build again and then did the supercomputing tutorial. -- Vimal From:

Re: [OMPI users] 2012/06/18 14:35:07 自动保存草稿

2012-06-19 Thread Ralph Castain
That's a little bit strong - OMPI still supports checkpoint/restart as a fault tolerance mechanism. There really isn't anything the sys admin has to do, though - what is required is that users periodically order their programs to checkpoint so they can be restarted after a failure.

Re: [OMPI users] Using OpenMPI on a network

2012-06-19 Thread Damien
There's something else wrong, if that's the Supercomputing Blog tutorial 1 you're running. It works happily without a hostfile. I think you have some borked paths there. I don't know why a Windows version is looking for an etc directory for a hostfile, unless there's some of your previous

Re: [OMPI users] 2012/06/18 14:35:07 自动保存草稿

2012-06-19 Thread George Bosilca
It has been clearly stated that the official position pushed forward by a majority of the Open MPI developer community is that fault tolerance is not needed so we (read this as the official version of Open MPI) do not support it. However, a group of researchers have been working toward a

Re: [OMPI users] Using OpenMPI on a network

2012-06-19 Thread VimalMathew
Is hostname the name of the system I'm running it on? Just tried that. Got the same error message -- Vimal From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Tuesday, June 19, 2012 1:03 PM To: Open MPI Users Subject: Re: [OMPI users]

Re: [OMPI users] Using OpenMPI on a network

2012-06-19 Thread Ralph Castain
You're getting that error because you failed to specify any hosts on your cmd line - so OMPI has no idea where to launch the procs. It looked for a default hostfile, but didn't find that either. Just add a -host option to your command line and tell it where you want the procs to run. On Jun

Re: [OMPI users] Using OpenMPI on a network

2012-06-19 Thread VimalMathew
Damien, Shiqing, Jeff? -- Vimal From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On Behalf Of vimalmat...@eaton.com Sent: Monday, June 18, 2012 3:32 PM To: us...@open-mpi.org Subject: [OMPI users] Using OpenMPI on a network So I configured and compiled a simple

Re: [OMPI users] NVCC mpi.h: error: attribute "__deprecated__" does not take arguments

2012-06-19 Thread Dmitry N. Mikushin
Dear Rolf, I compiled openmpi-trunk with $ ../configure --prefix=/opt/openmpi-trunk --disable-mpi-interface-warning --with-cuda=/opt/cuda And that error is now gone! Thanks a lot for your assistance, - D. 2012/6/19 Rolf vandeVaart > Dmitry: > > ** ** > > It turns

[OMPI users] 2012/06/18 14:35:07 自动保存草稿

2012-06-19 Thread 陈松
Hi all,Can anyone explain me the fault tolerant features in OpenMPI? I've read the FAQs and some papers about this topic listed in open-mpi.org, but still can't figure out when one node of my supercomputer system fails down during computing, what would happen with the fault tolerant mechanism

[OMPI users] checkpointing of NPB

2012-06-19 Thread Ifeanyi
Dear, Please help. I configured the open mpi and it can checkpoint HPL. However, whenever I want to checkpoint NAS parallel benchmark it kills the application without informative message. Please how do I configure the openmpi 1.6 to checkpoint NPB? I really need a help, I have been on this