Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-20 Thread Ralph H Castain
-mpi.org] On Behalf Of Ralph H Castain >>> Sent: Wednesday, December 19, 2007 2:35 PM >>> To: Open MPI Users ; pat.o'bry...@exxonmobil.com >>> Cc: Castain, Ralph H. (LANL) >>> Subject: Re: [OMPI users] Torque and OpenMPI 1.2 >>> >>> >&

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-19 Thread Ralph H Castain
845, FAX: 505-284-2518 > >> -Original Message- >> From: users-boun...@open-mpi.org >> [mailto:users-boun...@open-mpi.org] On Behalf Of Ralph H Castain >> Sent: Wednesday, December 19, 2007 2:35 PM >> To: Open MPI Users ; pat.o'bry...@exxonmobil.com &g

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-19 Thread Adams, Brian M
] On Behalf Of Ralph H Castain > Sent: Wednesday, December 19, 2007 2:35 PM > To: Open MPI Users ; pat.o'bry...@exxonmobil.com > Cc: Castain, Ralph H. (LANL) > Subject: Re: [OMPI users] Torque and OpenMPI 1.2 > > > Open MPI 1.3 will support use of the hostfile and the tm

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-19 Thread Ralph H Castain
To > Sent by: "Open MPI Users > users-bounces@ " > open- > mpi.org cc > Ralph H Cast

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-19 Thread pat . o'bryant
/19/07 10:10 Re: [OMPI users] Torque and OpenMPI AM 1.2 Please respond to

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-19 Thread Ralph H Castain
- just be aware of the limitations and disclaimers. I would only suggest it be used as a temporary workaround as opposed to a general practice. Ralph > >> From: "Caird, Andrew J" >> Date: December 19, 2007 9:40:27 AM EST >> To: "Open MPI Users" >>

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-19 Thread pat . o'bryant
users-bounces@ cc open-mpi.org Subject Re: [OMPI users] Torque and OpenMPI 12/

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-19 Thread Caird, Andrew J
age- > From: users-boun...@open-mpi.org > [mailto:users-boun...@open-mpi.org] On Behalf Of > pat.o'bry...@exxonmobil.com > Sent: Wednesday, December 19, 2007 9:37 AM > To: Open MPI Users > Subject: Re: [OMPI users] Torque and OpenMPI 1.2 > > Andrew, > Tha

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-19 Thread pat . o'bryant
cc open-mpi.org Subject Re: [OMPI users] Torque and OpenMPI 12/19/07 07:59 1.2

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-19 Thread Caird, Andrew J
On Behalf Of > > pat.o'bry...@exxonmobil.com > > Sent: Wednesday, December 19, 2007 8:47 AM > > To: Open MPI Users > > Cc: Open MPI Users; users-boun...@open-mpi.org > > Subject: Re: [OMPI users] Torque and OpenMPI 1.2 > > > > Terry, > > You

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-19 Thread Caird, Andrew J
47 AM > To: Open MPI Users > Cc: Open MPI Users; users-boun...@open-mpi.org > Subject: Re: [OMPI users] Torque and OpenMPI 1.2 > > Terry, > Your suggestion worked. So long as I specifically state > "--without-tm", > the OpenMPI 1.2.4 build allows the use of &qu

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-19 Thread pat . o'bryant
cc users-bounces@ open-mpi.org Subject Re: [OMPI users] Torque and OpenMPI 1.2

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-18 Thread Terry Frankcombe
On Tue, 2007-12-18 at 11:59 -0700, Ralph H Castain wrote: > Hate to be a party-pooper, but the answer is "no" in OpenMPI 1.2. We don't > allow the use of a hostfile in a Torque environment in that version. > > We have changed this for v1.3, but you'll have to wait for that release. Can one not b

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-18 Thread Ralph H Castain
cc > open-mpi.org > > Subject > Re: [OMPI users] Torque and > OpenMPI > 12/18/07 11:57 1.2 > AM > > > Please respond >

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-18 Thread pat . o'bryant
open-mpi.org Subject Re: [OMPI users] Torque and OpenMPI 12/18/07 11:57 1.2 AM

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-18 Thread Tim Prins
Open MPI v1.2 had some problems with the TM configuration code which was fixed in v1.2.1. So any version v1.2.1 or later should work fine (and, as you indicate, 1.2.4 works fine). Tim On Tuesday 18 December 2007 12:48:40 pm pat.o'bry...@exxonmobil.com wrote: > Jeff, > Here is the result of

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-18 Thread pat . o'bryant
Jeff, Here is the result of the "pbs-config". By the way, I have successfully built OpenMPI 1.2.4 on this same system. The "config.log" for OpenMPI 1.2.4 shows the correct Torque path. That is not surprising since the "configure" script for OpenMPI 1.2.4 uses "pbs-config" while the configure s

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-18 Thread Jeff Squyres
Well that's fun. Is this the library location where Torque put them by default? What does "pbs-config --libs" return? Also -- I second Reuti's question: what is the nature of your requirement such that you need to be able to run outside of the nodes that have been allocated to a job? Are

Re: [OMPI users] Torque and OpenMPI 1.2

2007-12-18 Thread Reuti
Am 18.12.2007 um 17:09 schrieb pat.o'bry...@exxonmobil.com: We have Torque as an mpi job scheduler. Additionally, I have some users that want to modify the contents of "-hostfile" when they execute Why do they want to modify the hostfile? They should stay with the granted machines a

[OMPI users] Torque and OpenMPI 1.2

2007-12-18 Thread pat . o'bryant
We have Torque as an mpi job scheduler. Additionally, I have some users that want to modify the contents of "-hostfile" when they execute "mpirun". To allow the modification of the hostfile, I downloaded OpenMPI 1.2 and attempted to do a "configure" with the options shown below: ./configur

Re: [OMPI users] torque and openmpi

2007-08-01 Thread Adams, Samuel D Contr AFRL/HEDR
General Dynamics Information Technology Phone: 210.536.5945 -Original Message- From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On Behalf Of Jeff Squyres Sent: Friday, July 27, 2007 2:58 PM To: Open MPI Users Subject: Re: [OMPI users] torque and openmpi On Jul 27, 2007, at 2

Re: [OMPI users] torque and openmpi

2007-07-27 Thread Jeff Squyres
On Jul 27, 2007, at 2:48 PM, Galen Shipman wrote: I set up ompi before I configured Torque. Do I need to recompile ompi with appropriate torque configure options to get better integration? If libtorque wasn't present on the machine at configure then yes, you need to run: ./configure --with

Re: [OMPI users] torque and openmpi

2007-07-27 Thread Galen Shipman
t: Re: [OMPI users] torque and openmpi Are you not using the built-in OMPI support for Torque? The ssh keys should be irrelevant if using the TM API in Torque (i.e., OMPI won't be using ssh to launch remote processes; we use the internal TM API in Torque). On Jul 27, 2007, at 11:38 AM, Adams

Re: [OMPI users] torque and openmpi

2007-07-27 Thread Adams, Samuel D Contr AFRL/HEDR
...@open-mpi.org] On Behalf Of Jeff Squyres Sent: Friday, July 27, 2007 12:14 PM To: Open MPI Users Subject: Re: [OMPI users] torque and openmpi Are you not using the built-in OMPI support for Torque? The ssh keys should be irrelevant if using the TM API in Torque (i.e., OMPI won't be using s

Re: [OMPI users] torque and openmpi

2007-07-27 Thread Jeff Squyres
logy Phone: 210.536.5945 -Original Message- From: users-boun...@open-mpi.org [mailto:users-bounces@open- mpi.org] On Behalf Of George Bosilca Sent: Friday, July 27, 2007 10:13 AM To: Open MPI Users Subject: Re: [OMPI users] torque and openmpi The key is in the first line of the provided output.

Re: [OMPI users] torque and openmpi

2007-07-27 Thread Adams, Samuel D Contr AFRL/HEDR
Phone: 210.536.5945 -Original Message- From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On Behalf Of George Bosilca Sent: Friday, July 27, 2007 10:13 AM To: Open MPI Users Subject: Re: [OMPI users] torque and openmpi The key is in the first line of the provided output. One o

Re: [OMPI users] torque and openmpi

2007-07-27 Thread George Bosilca
The key is in the first line of the provided output. One of the connection failed because a wrong ssh key. Clean your .ssh/ known_hosts and the problem will vanish. Thanks, george. On Jul 27, 2007, at 11:01 AM, Adams, Samuel D Contr AFRL/HEDR wrote: When I run jobs with torque, I get

[OMPI users] torque and openmpi

2007-07-27 Thread Adams, Samuel D Contr AFRL/HEDR
When I run jobs with torque, I get this error message. Any ideas? [sam@prodnode1 all]$ cat script.sh.err Host key verification failed. [prodnode3.brooks.af.mil:03321] [0,0,0] ORTE_ERROR_LOG: Timeout in file base/pls_base_orted_cmds.c at line 275 [prodnode3.brooks.af.mil:03321] [0,0,0] ORTE_ERROR

Re: [OMPI users] Torque and OpenMPI 1.2.1 problems

2007-05-02 Thread Ralph Castain
On 5/2/07 7:57 AM, "Ole Holm Nielsen" wrote: > > What I'm saying is that users should be able run the same script in different > environments, they being Torque or non-Torque, without having to change > the arguments to the mpirun command. Maybe they submit batch jobs to > our Linux/Torque

Re: [OMPI users] Torque and OpenMPI 1.2.1 problems

2007-05-02 Thread Ole Holm Nielsen
Ralph, thanks very much for your continued support: Ralph Castain wrote: I'd say that this behavior of mpirun under Torque TM should be considered as a bug. Ideally, users should not have to design their scripts differently according to whether the sysadmin decided to configure in TM or not. Als

Re: [OMPI users] Torque and OpenMPI 1.2.1 problems

2007-05-02 Thread Ralph Castain
I guess I am now totally confused, so I will have to ask your patience with a few questions. On 5/2/07 4:55 AM, "Ole Holm Nielsen" wrote: > Ralph Castain wrote: >> We would consider it a "feature" that OpenMPI is integrated with Torque. We >> actually read the PBS_NODEFILE internally ourselves.

Re: [OMPI users] Torque and OpenMPI 1.2.1 problems

2007-05-02 Thread Ole Holm Nielsen
Ralph Castain wrote: We would consider it a "feature" that OpenMPI is integrated with Torque. We actually read the PBS_NODEFILE internally ourselves. I believe the problem here is that specifying the "machinefile" prevents us from using that Torque-integrated code and forces us down a different c

Re: [OMPI users] Torque and OpenMPI 1.2.1 problems

2007-05-02 Thread Ralph Castain
On 5/2/07 1:28 AM, "Ole Holm Nielsen" wrote: > Bas hit the nail on the head: When using OpenMPI's mpirun under > Torque TM one apparently *must* omit the "-machinefile $PBS_NODEFILE" > flags and only specify "-np 2", presumably because TM knows all > about the machines under its control. > >

Re: [OMPI users] Torque and OpenMPI 1.2.1 problems

2007-05-02 Thread Ole Holm Nielsen
Bas hit the nail on the head: When using OpenMPI's mpirun under Torque TM one apparently *must* omit the "-machinefile $PBS_NODEFILE" flags and only specify "-np 2", presumably because TM knows all about the machines under its control. This behavior is new to me: Is this a feature or a bug in O

Re: [OMPI users] Torque and OpenMPI 1.2.1 problems

2007-05-02 Thread Bas van der Vlies
Ole Holm Nielsen wrote: We have built OpenMPI 1.2.1 with support for Torque 2.1.8 and its Task Manager interface. We use the PGI 6.2-4 compiler and the --with-tm option as described in http://www.open-mpi.org/faq/?category=building#build-rte-tm for building an OpenMPI RPM on a Pentium-4 machin

Re: [OMPI users] Torque and OpenMPI 1.2.1 problems

2007-05-01 Thread Ole Holm Nielsen
Thanks for the suggestion. I inserted a printenv command and the path and library variables seem to be correct for our OpenMPI installation: LD_LIBRARY_PATH=/usr/local/openmpi-1.2.1-pgi/lib:/opt/intel/compiler90/lib MPIHOME=/usr/local/openmpi-1.2.1-pgi PATH=/usr/local/openmpi-1.2.1-pgi/bin:/usr/

Re: [OMPI users] Torque and OpenMPI 1.2.1 problems

2007-05-01 Thread Ralph Castain
The most likely problem is that you have a path or library issue regarding the location of the OpenMPI/OpenRTE executables when running batch versus interactive. We see this sometimes when the shell startups differ in those two modes. You might try just running a batch vs interactive printenv to s

[OMPI users] Torque and OpenMPI 1.2.1 problems

2007-05-01 Thread Ole Holm Nielsen
We have built OpenMPI 1.2.1 with support for Torque 2.1.8 and its Task Manager interface. We use the PGI 6.2-4 compiler and the --with-tm option as described in http://www.open-mpi.org/faq/?category=building#build-rte-tm for building an OpenMPI RPM on a Pentium-4 machine running CentOS 4.4 (RHEL4