Hi,

I found that the server installed OpenMPI arms with iptables, so further 
communication between ib03 and ib04 is prohibited! The mpirun works fine across 
multi-server with hello_c

Thanks Ralph !

Thanks
-Yanfei

发件人: devel [mailto:devel-boun...@open-mpi.org] 代表 Ralph Castain
发送时间: 2014年3月26日 17:45
收件人: Open MPI Developers
主题: Re: [OMPI devel] 答复: example/Hello_c.c : mpirun run failed on two physical 
nodes.

Can you please configure OMPI with --enable-debug, and then execute
mpirun -mca plm_base_verbose 10 -host ib03 hostname
This will provide debug information about the problem.
Thanks
Ralph

On Tue, Mar 25, 2014 at 9:51 PM, Wang,Yanfei(SYS) 
<wangyanfe...@baidu.com<mailto:wangyanfe...@baidu.com>> wrote:
Hi,

Thanks jeff,  and I have not figured out what happened yet with this FAQ.

1. Ssh remote login OK:
[root@bb-nsi-ib04 examples]# ssh ib03 hostname
bb-nsi-ib03.bb01.*.com
[root@bb-nsi-ib04 examples]#
2. following command return immediately without nothing returned
[root@bb-nsi-ib04 examples]# mpirun -host ib03 hostname
[root@bb-nsi-ib04 examples]#
3. following command excute successfully.
[root@bb-nsi-ib04 examples]# ssh ib03  mpirun
--------------------------------------------------------------------------
mpirun could not find anything to do.

It is possible that you forgot to specify how many processes to run
via the "-np" argument.
--------------------------------------------------------------------------
[root@bb-nsi-ib04 examples]#

So, does it seem like that the non-interactive shell profile is not correctly 
configured?  Step 3 can execute succefully...

Hope any response!

BR

Yanfei Wang

-----邮件原件-----
发件人: devel 
[mailto:devel-boun...@open-mpi.org<mailto:devel-boun...@open-mpi.org>] 代表 Jeff 
Squyres (jsquyres)
发送时间: 2014年3月25日 22:09
收件人: Open MPI Developers
主题: Re: [OMPI devel] example/Hello_c.c : mpirun run failed on two physical 
nodes.

Try this FAQ entry:

http://www.open-mpi.org/faq/?category=running#diagnose-multi-host-problems

On Mar 25, 2014, at 6:54 AM, "Wang,Yanfei(SYS)" 
<wangyanfe...@baidu.com<mailto:wangyanfe...@baidu.com>> wrote:

> Hi,
>
> I am a fresh learner of OpenMPI programmer, and have some troubles on 
> building mpi programming, hope some helps..
>
> The example/helloc_c can works successfully with 2 process on local machine, 
> however, do not work on two separate physical node.
>
> Physical two nodes:
> Eg:
> [root@bb-nsi-ib04 examples]# mpirun -hostfile hosts -np 2 hello_c The
> command just return instantly without nothing printed.
> Local machine:
> [root@bb-nsi-ib04 examples]# mpirun -np 2 hello_c Hello, world, I am 0
> of 2, (Open MPI v1.7.5, package: Open MPI 
> root@bb-nsi-ib04.bb01.*.com<mailto:root@bb-nsi-ib04.bb01.*.com>
> Distribution, ident: 1.7.5, Mar 20, 2014, 108) Hello, world, I am 1 of
> 2, (Open MPI v1.7.5, package: Open MPI 
> root@bb-nsi-ib04.bb01.*.com<mailto:root@bb-nsi-ib04.bb01.*.com>
> Distribution, ident: 1.7.5, Mar 20, 2014, 108)
> [root@bb-nsi-ib04 examples]#
> -----peer machine is ok--------
> [root@bb-nsi-ib03 examples]# mpirun -np 2 hello_c Hello, world, I am 0
> of 2, (Open MPI v1.7.5, package: Open MPI 
> root@bb-nsi-ib03.bb01.*.com<mailto:root@bb-nsi-ib03.bb01.*.com>
> Distribution, ident: 1.7.5, Mar 20, 2014, 108) Hello, world, I am 1 of
> 2, (Open MPI v1.7.5, package: Open MPI 
> root@bb-nsi-ib03.bb01.*.com<mailto:root@bb-nsi-ib03.bb01.*.com>
> Distribution, ident: 1.7.5, Mar 20, 2014, 108)
> [root@bb-nsi-ib03 examples]#
> the command run successfully, and print two message!!
>
> Configuration details:
> OpenMPI version: 1.7.5
> Hostfile:
> [root@bb-nsi-ib04 examples]# cat hosts
> ib03 slots=1
> ib04 slots=1
> [root@bb-nsi-ib04 examples]#
> /etc/hosts:
> [root@bb-nsi-ib04 examples]# cat /etc/hosts
> 192.168.71.3 ib03
> 192.168.71.4 ib04
> SSH:
> Public rsa key is redistributed two machine, ib03 and ib04, and to do ssh 
> login in without password is ok, I am sure.
>
> I am confused about this trouble, and anyone can help us?  It have nothing 
> log and error tip, could anyone tell me how to do diagnose it.
>
> BR
>
> Yanfei Wang
>
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org<mailto:de...@open-mpi.org>
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2014/03/14385.php


--
Jeff Squyres
jsquy...@cisco.com<mailto:jsquy...@cisco.com>
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/

_______________________________________________
devel mailing list
de...@open-mpi.org<mailto:de...@open-mpi.org>
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post: 
http://www.open-mpi.org/community/lists/devel/2014/03/14386.php
_______________________________________________
devel mailing list
de...@open-mpi.org<mailto:de...@open-mpi.org>
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post: 
http://www.open-mpi.org/community/lists/devel/2014/03/14396.php

Reply via email to