Re: [OMPI users] orted daemon not found! --- environment not passed to slave nodes

2012-03-02 Thread Ralph Castain
On Mar 2, 2012, at 3:23 PM, Yiguang Yan wrote: > It turns out that the "-x" option should be put on each line of the app file > if app file is used. > > > So from tests (a),(b),(c), if I am using app file, the PATH and > LD_LIBRARY_PATH are only passed to slave node > when the "-x" is set

Re: [OMPI users] orted daemon not found! --- environment not passed to slave nodes

2012-03-02 Thread Yiguang Yan
It turns out that the "-x" option should be put on each line of the app file if app file is used. OK, now test results on our cluster, in case this may be useful to some Open MPI users(Open MPI 1.4.3 used on my system): (1) If I run mpirun command from command line as Jeff's foo test,

Re: [OMPI users] ssh between nodes

2012-03-02 Thread Denver Smith
Does anybody know if STARCCM software requires passwordless ssh? Big Thanks From: users-boun...@open-mpi.org [users-boun...@open-mpi.org] on behalf of Prentice Bisbal [prent...@ias.edu] Sent: Friday, March 02, 2012 9:09 AM To: us...@open-mpi.org

Re: [OMPI users] orted daemon not found! --- environment not passed to slave nodes

2012-03-02 Thread Jeffrey Squyres
On Mar 2, 2012, at 2:50 PM, Ralph Castain wrote: >> Ralph and I just had a phone conversation about this. We consider it a bug >> -- you shouldn't need to put --prefix in the app file. Meaning: --prefix is >> currently being ignored if you use an app file (and therefore you have to >> put

Re: [OMPI users] orted daemon not found! --- environment not passed to slave nodes

2012-03-02 Thread Ralph Castain
On Mar 2, 2012, at 10:50 AM, Jeffrey Squyres wrote: > On Mar 2, 2012, at 9:48 AM, Yiguang Yan wrote: > >> (All with the same test script test.bash I post in my previous emails, so >> run with app file fed to mpirun command.) >> >> (1) If I put the --prefix in the app file, on each line of it,

Re: [OMPI users] run orterun with more than 200 processes

2012-03-02 Thread Ralph Castain
On Mar 2, 2012, at 12:14 PM, Jianzhang He wrote: > Sorry that I did not make it clear. Actually, we are able to run in this way > with less than 100 processes. > I put ‘hostname1.domain.com,1,2,3,4,5,6,7,8,9,…..,196,197,198,199’ in a > hostfile and it does not work. I wonder what might be the

Re: [OMPI users] run orterun with more than 200 processes

2012-03-02 Thread Jianzhang He
Sorry that I did not make it clear. Actually, we are able to run in this way with less than 100 processes. I put 'hostname1.domain.com,1,2,3,4,5,6,7,8,9,.,196,197,198,199' in a hostfile and it does not work. I wonder what might be the equivalent format in a

Re: [OMPI users] Hybrid OpenMPI / OpenMP programming

2012-03-02 Thread Ralph Castain
On Mar 2, 2012, at 11:52 AM, Paul Kapinos wrote: > Hello Ralph, > I've some questions on placement and -cpus-per-rank. > >> First, use the --cpus-per-rank option to separate the ranks from each other. >> In other words, instead of --bind-to-socket -bysocket, you do: >> -bind-to-core

Re: [OMPI users] Hybrid OpenMPI / OpenMP programming

2012-03-02 Thread Paul Kapinos
Hello Ralph, I've some questions on placement and -cpus-per-rank. First, use the --cpus-per-rank option to separate the ranks from each other. In other words, instead of --bind-to-socket -bysocket, you do: -bind-to-core -cpus-per-rank N This will take each rank and bind it to a unique set of

Re: [OMPI users] orted daemon not found! --- environment not passed to slave nodes

2012-03-02 Thread Jeffrey Squyres
On Mar 2, 2012, at 9:48 AM, Yiguang Yan wrote: > (All with the same test script test.bash I post in my previous emails, so run > with app file fed to mpirun command.) > > (1) If I put the --prefix in the app file, on each line of it, it works fine > as Jeff said. > > (2) Since in the manual,

Re: [OMPI users] orted daemon not found! --- environment not passed to slave nodes

2012-03-02 Thread Ralph Castain
We'll take a look at the prefix behavior. As to the btl, you can always just force it: for example, -mca btl sm,self,openib would restrict it to shared memory and IB. On Mar 2, 2012, at 7:48 AM, Yiguang Yan wrote: > Hi Jeff, Ralph-- > > Please let me follow the thread, here are what I

Re: [OMPI users] Simple question on GRID

2012-03-02 Thread Prentice Bisbal
On 03/01/2012 12:10 AM, Shaandar Nyamtulga wrote: > Hi > I have two Beowulf clusters (both Ubuntu 10.10, one is OpenMPI, one is > MPICH2). > They run separately in their local network environment.I know there is > a way to integrate them through Internet, presumably by Grid software, > I guess. Is

Re: [OMPI users] ssh between nodes

2012-03-02 Thread Prentice Bisbal
On 02/29/2012 04:51 PM, Martin Siegert wrote: > Hi, > > On Wed, Feb 29, 2012 at 09:09:27PM +, Denver Smith wrote: >>Hello, >>On my cluster running moab and torque, I cannot ssh without a password >>between compute nodes. I can however request multiple node jobs fine. I >>was

Re: [OMPI users] [EXTERNAL] Re: Question regarding osu-benchamarks 3.1.1

2012-03-02 Thread Prentice Bisbal
On 02/29/2012 03:15 PM, Jeffrey Squyres wrote: > On Feb 29, 2012, at 2:57 PM, Jingcha Joba wrote: > >> So if I understand correctly, if a message size is smaller than it will use >> the MPI way (non-RDMA, 2 way communication), if its larger, then it would >> use the Open Fabrics, by using the

Re: [OMPI users] compilation error with pgcc Unknown switch

2012-03-02 Thread George Bosilca
Something is definitively weird in your compilation environment. The "undefined" function is defined in atomic_impl.h as a static inline (static inline int64_t opal_atomic_swap_64(volatile int64_t *addr,…). So either the compiler should have complained during compilation, or it should be

Re: [OMPI users] orted daemon not found! --- environment not passed to slave nodes

2012-03-02 Thread Yiguang Yan
Hi Jeff, Ralph-- Please let me follow the thread, here are what I observed: (All with the same test script test.bash I post in my previous emails, so run with app file fed to mpirun command.) (1) If I put the --prefix in the app file, on each line of it, it works fine as Jeff said. (2) Since

Re: [OMPI users] compilation error with pgcc Unknown switch

2012-03-02 Thread Jeffrey Squyres
I'm going to have to defer this to those who regularly build on Crays... Sandia / LANL? On Mar 2, 2012, at 12:12 AM, Abhinav Sarje wrote: > Hi again, > > I just tried building afresh -> svn co, autogen, configure, make. And > it failed at the same point as before: CCLD ompi_info

[OMPI users] "Connection to lifeline" with openmpi-1.4.5

2012-03-02 Thread Barnet Wagman
I've run into a problem upgrading from 1.4.3 to 1.4.4 or 1.4.5 With 1.4.4 and 1.4.5, I'm getting error messages like [[59597,1],0] routed:binomial: Connection to lifeline [[59597,0],0] lost The error does not occur if I restrict the host list to localhost. Basic tests like 'mpirun hello_c'

Re: [OMPI users] compilation error with pgcc Unknown switch

2012-03-02 Thread Abhinav Sarje
Hi again, I just tried building afresh -> svn co, autogen, configure, make. And it failed at the same point as before: >>>  CCLD   ompi_info >>> ../../../ompi/.libs/libmpi.so: undefined reference to `opal_atomic_swap_64' Any more ideas/fixes? Thanks all. Abhinav. On Fri, Mar 2, 2012 at 8:14