Re: [MTT users] starting mtt
On Aug 29, 2007, at 4:54 PM, Mohamad Chaarawi wrote: Thanks Jeff, that does it.. I actually didn't realize there were sample inis and sh in ompi-test.. those help a lot :) I'd actually call them "what works for us", not necessarily rugged samples. :-) Cisco and IU keep their files in ompi-tests. -- Jeff Squyres Cisco Systems
Re: [MTT users] starting mtt
Thanks Jeff, that does it.. I actually didn't realize there were sample inis and sh in ompi-test.. those help a lot :) Jeff Squyres wrote: > I see the problem -- this is pretty obscure. :-( > > The specify_module field is listed twice in the [Test run: onesided] > section. > > I can explain what exactly is happening, but it's neither important > nor interesting. :-) Remove one of them, and you should be ok. > > FWIW, I've been using a funclet to get my DB password recently, like > this: > > mttdatabase_username = cisco > mttdatabase_password = ("/home/mpiteam/mtt-db-password.txt") > mttdatabase_platform = svbu-mpi > > That allows me to SVN commit my INI file to the ompi-tests > repository, etc. > > > > On Aug 29, 2007, at 3:35 PM, Mohamad Chaarawi wrote: > >> i removed username/password since im sending this publicly.. >> >> Jeff Squyres wrote: >>> On Aug 29, 2007, at 2:19 PM, Mohamad Chaarawi wrote: >>> Im trying to execute mtt on our cluster at uh.. This is the first time we do mtt here so bare with me here.. I edited the ompi-core-template.ini file to test only the trunk and added the username/password/platform. I allocated 4 nodes and ran: client/mtt -d --force -f samples/ompi-core-template.ini --trial -s scratch >>> Can you send your edited ini file? >>> OMPI configures and compiles fine, and then IBM tests start. After the IBM tests complete mtt just exits with an error. I copied some of the output that i saw was not right at the end.. any help is appreciated. Thanks, -Mohamad Making dir: /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi- nightly-trunk/ 1.3a1r15990/gcc_warnings/ibm/ibm/final (cwd: /home/mschaara/mtt/trunk/scratch/installs/iCPw/tests/ibm/ibm) /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi-nightly- trunk/1.3 a1r15990/gcc_warnings/ibm/ibm/final does not exist -- creating chdir /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi- nightly-trunk/ 1.3a1r15990/gcc_warnings/ibm/ibm/final/ chdir /home/mschaara/mtt/trunk/scratch/installs/iCPw/tests/ibm/ibm ### Test progress: 181 of 181 section tests complete (100%) Evaluating: require MTT::Reporter::MTTDatabase Evaluating: $ret = ::Reporter::MTTDatabase::Submit(@args) MTTDatabase reporter Got hostname: shark Submitting to MTTDatabase... Running command: gzip --force /tmp/3omP2I6lK7.inc MTTDatabase trying proxy: / Default (none) MTTDatabase got response: MTTDatabase server notice: mpi_install_section_name is not in mtt database. MTTDatabase server notice: fields is not in mtt database. MTTDatabase server notice: phase is not in mtt database. MTTDatabase server notice: test_build_section_name is not in mtt database. MTTDatabase server notice: variant is not in mtt database. MTTDatabase server notice: number_of_results is not in mtt database. MTTDatabase server notice: command is not in mtt database. MTT submission for test run MTTDatabase server notice: The submitting client did not submit valid IDs for one or more of the following 'launcher', 'resource_manager', 'paramters' or 'network' some output. Evaluating: require MTT::Test::Specify::Simple Simple *** ERROR: Module aborted during require: MTT::Test::Specify::Simple Simple: syntax error at (eval 12728) line 4, near "require MTT::Test::Specify::Simple Simple " -- Mohamad Chaarawi Instructional Assistant http://www.cs.uh.edu/~mschaara Department of Computer Science University of Houston 4800 Calhoun, PGH Room 526Houston, TX 77204, USA ___ mtt-users mailing list mtt-us...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/mtt-users >>> >> >> -- >> Mohamad Chaarawi >> Instructional Assistanthttp://www.cs.uh.edu/~mschaara >> Department of Computer Science University of Houston >> 4800 Calhoun, PGH Room 526Houston, TX 77204, USA >> # >> # Copyright (c) 2006-2007 Cisco Systems, Inc. All rights reserved. >> # Copyright (c) 2006-2007 Sun Microystems, Inc. All rights reserved. >> # >> >> # Template MTT configuration file for Open MPI core testers. The >> # intent for this template file is to establish at least some loose >> # guidelines for what Open MPI core testers should be running on a >> # nightly basis. This file is not intended to be an exhaustive sample >> # of all possible fields and values that MTT offers. Each site will >> # undoubtedly have to edit this template for their local needs (e.g., >> # pick
Re: [MTT users] starting mtt
I see the problem -- this is pretty obscure. :-( The specify_module field is listed twice in the [Test run: onesided] section. I can explain what exactly is happening, but it's neither important nor interesting. :-) Remove one of them, and you should be ok. FWIW, I've been using a funclet to get my DB password recently, like this: mttdatabase_username = cisco mttdatabase_password = ("/home/mpiteam/mtt-db-password.txt") mttdatabase_platform = svbu-mpi That allows me to SVN commit my INI file to the ompi-tests repository, etc. On Aug 29, 2007, at 3:35 PM, Mohamad Chaarawi wrote: i removed username/password since im sending this publicly.. Jeff Squyres wrote: On Aug 29, 2007, at 2:19 PM, Mohamad Chaarawi wrote: Im trying to execute mtt on our cluster at uh.. This is the first time we do mtt here so bare with me here.. I edited the ompi-core-template.ini file to test only the trunk and added the username/password/platform. I allocated 4 nodes and ran: client/mtt -d --force -f samples/ompi-core-template.ini --trial -s scratch Can you send your edited ini file? OMPI configures and compiles fine, and then IBM tests start. After the IBM tests complete mtt just exits with an error. I copied some of the output that i saw was not right at the end.. any help is appreciated. Thanks, -Mohamad Making dir: /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi- nightly-trunk/ 1.3a1r15990/gcc_warnings/ibm/ibm/final (cwd: /home/mschaara/mtt/trunk/scratch/installs/iCPw/tests/ibm/ibm) /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi-nightly- trunk/1.3 a1r15990/gcc_warnings/ibm/ibm/final does not exist -- creating chdir /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi- nightly-trunk/ 1.3a1r15990/gcc_warnings/ibm/ibm/final/ chdir /home/mschaara/mtt/trunk/scratch/installs/iCPw/tests/ibm/ibm ### Test progress: 181 of 181 section tests complete (100%) Evaluating: require MTT::Reporter::MTTDatabase Evaluating: $ret = ::Reporter::MTTDatabase::Submit(@args) MTTDatabase reporter Got hostname: shark Submitting to MTTDatabase... Running command: gzip --force /tmp/3omP2I6lK7.inc MTTDatabase trying proxy: / Default (none) MTTDatabase got response: MTTDatabase server notice: mpi_install_section_name is not in mtt database. MTTDatabase server notice: fields is not in mtt database. MTTDatabase server notice: phase is not in mtt database. MTTDatabase server notice: test_build_section_name is not in mtt database. MTTDatabase server notice: variant is not in mtt database. MTTDatabase server notice: number_of_results is not in mtt database. MTTDatabase server notice: command is not in mtt database. MTT submission for test run MTTDatabase server notice: The submitting client did not submit valid IDs for one or more of the following 'launcher', 'resource_manager', 'paramters' or 'network' some output. Evaluating: require MTT::Test::Specify::Simple Simple *** ERROR: Module aborted during require: MTT::Test::Specify::Simple Simple: syntax error at (eval 12728) line 4, near "require MTT::Test::Specify::Simple Simple " -- Mohamad Chaarawi Instructional Assistant http://www.cs.uh.edu/~mschaara Department of Computer ScienceUniversity of Houston 4800 Calhoun, PGH Room 526Houston, TX 77204, USA ___ mtt-users mailing list mtt-us...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/mtt-users -- Mohamad Chaarawi Instructional Assistant http://www.cs.uh.edu/~mschaara Department of Computer ScienceUniversity of Houston 4800 Calhoun, PGH Room 526Houston, TX 77204, USA # # Copyright (c) 2006-2007 Cisco Systems, Inc. All rights reserved. # Copyright (c) 2006-2007 Sun Microystems, Inc. All rights reserved. # # Template MTT configuration file for Open MPI core testers. The # intent for this template file is to establish at least some loose # guidelines for what Open MPI core testers should be running on a # nightly basis. This file is not intended to be an exhaustive sample # of all possible fields and values that MTT offers. Each site will # undoubtedly have to edit this template for their local needs (e.g., # pick compilers to use, etc.), but this file provides a baseline set # of configurations that we intend you to run. # OMPI core members will need to edit some values in this file based # on your local testing environment. Look for comments with "OMPI # Core:" for instructions on what to change. # Note that this file is artificially longer than it really needs to # be -- a bunch of values are explicitly set here that are exactly # equivalent to their defaults. This is mainly because there is no # reliable form of documentation for this ini file yet, so the values # here comprise a good set of what options are settable (although it # is
Re: [MTT users] starting mtt
i removed username/password since im sending this publicly.. Jeff Squyres wrote: > On Aug 29, 2007, at 2:19 PM, Mohamad Chaarawi wrote: > >> Im trying to execute mtt on our cluster at uh.. >> This is the first time we do mtt here so bare with me here.. >> >> I edited the ompi-core-template.ini file to test only the trunk and >> added the username/password/platform. >> I allocated 4 nodes and ran: >> >> client/mtt -d --force -f samples/ompi-core-template.ini --trial -s >> scratch > > Can you send your edited ini file? > >> OMPI configures and compiles fine, and then IBM tests start. >> >> After the IBM tests complete mtt just exits with an error. >> I copied some of the output that i saw was not right at the end.. >> any help is appreciated. >> >> Thanks, >> -Mohamad >> >> >> Making dir: >>/home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi- >> nightly-trunk/ >>1.3a1r15990/gcc_warnings/ibm/ibm/final (cwd: >>/home/mschaara/mtt/trunk/scratch/installs/iCPw/tests/ibm/ibm) >> /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi-nightly- >> trunk/1.3 >>a1r15990/gcc_warnings/ibm/ibm/final does not exist -- creating >> chdir >>/home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi- >> nightly-trunk/ >>1.3a1r15990/gcc_warnings/ibm/ibm/final/ >> chdir /home/mschaara/mtt/trunk/scratch/installs/iCPw/tests/ibm/ibm >>### Test progress: 181 of 181 section tests complete (100%) >> Evaluating: require MTT::Reporter::MTTDatabase >> Evaluating: $ret = ::Reporter::MTTDatabase::Submit(@args) >> MTTDatabase reporter >> Got hostname: shark >> Submitting to MTTDatabase... >> Running command: gzip --force /tmp/3omP2I6lK7.inc >> MTTDatabase trying proxy: / Default (none) >> MTTDatabase got response: MTTDatabase server notice: >>mpi_install_section_name is not in mtt database. >>MTTDatabase server notice: fields is not in mtt database. >>MTTDatabase server notice: phase is not in mtt database. >>MTTDatabase server notice: test_build_section_name is not in mtt >>database. >>MTTDatabase server notice: variant is not in mtt database. >>MTTDatabase server notice: number_of_results is not in mtt >> database. >>MTTDatabase server notice: command is not in mtt database. >> >>MTT submission for test run >>MTTDatabase server notice: The submitting client did not submit >> valid >>IDs for one or more of the following >>'launcher', 'resource_manager', 'paramters' or 'network' >> >> some output. >> >> >> Evaluating: require MTT::Test::Specify::Simple >>Simple >> *** ERROR: Module aborted during require: MTT::Test::Specify::Simple >> Simple: syntax error at (eval 12728) line 4, near "require >> MTT::Test::Specify::Simple >> Simple >> " >> >> -- >> Mohamad Chaarawi >> Instructional Assistanthttp://www.cs.uh.edu/~mschaara >> Department of Computer Science University of Houston >> 4800 Calhoun, PGH Room 526Houston, TX 77204, USA >> ___ >> mtt-users mailing list >> mtt-us...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/mtt-users > > -- Mohamad Chaarawi Instructional Assistant http://www.cs.uh.edu/~mschaara Department of Computer ScienceUniversity of Houston 4800 Calhoun, PGH Room 526Houston, TX 77204, USA # # Copyright (c) 2006-2007 Cisco Systems, Inc. All rights reserved. # Copyright (c) 2006-2007 Sun Microystems, Inc. All rights reserved. # # Template MTT configuration file for Open MPI core testers. The # intent for this template file is to establish at least some loose # guidelines for what Open MPI core testers should be running on a # nightly basis. This file is not intended to be an exhaustive sample # of all possible fields and values that MTT offers. Each site will # undoubtedly have to edit this template for their local needs (e.g., # pick compilers to use, etc.), but this file provides a baseline set # of configurations that we intend you to run. # OMPI core members will need to edit some values in this file based # on your local testing environment. Look for comments with "OMPI # Core:" for instructions on what to change. # Note that this file is artificially longer than it really needs to # be -- a bunch of values are explicitly set here that are exactly # equivalent to their defaults. This is mainly because there is no # reliable form of documentation for this ini file yet, so the values # here comprise a good set of what options are settable (although it # is not a comprehensive set). # Also keep in mind that at the time of this writing, MTT is still # under active development and therefore the baselines established in # this file may change on a relatively frequent basis. # The guidelines are as follows: # # 1. Download and test nightly snapshot tarballs of at least one of #the following: #- the trunk
Re: [MTT users] starting mtt
On Aug 29, 2007, at 2:19 PM, Mohamad Chaarawi wrote: Im trying to execute mtt on our cluster at uh.. This is the first time we do mtt here so bare with me here.. I edited the ompi-core-template.ini file to test only the trunk and added the username/password/platform. I allocated 4 nodes and ran: client/mtt -d --force -f samples/ompi-core-template.ini --trial -s scratch Can you send your edited ini file? OMPI configures and compiles fine, and then IBM tests start. After the IBM tests complete mtt just exits with an error. I copied some of the output that i saw was not right at the end.. any help is appreciated. Thanks, -Mohamad Making dir: /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi- nightly-trunk/ 1.3a1r15990/gcc_warnings/ibm/ibm/final (cwd: /home/mschaara/mtt/trunk/scratch/installs/iCPw/tests/ibm/ibm) /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi-nightly- trunk/1.3 a1r15990/gcc_warnings/ibm/ibm/final does not exist -- creating chdir /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi- nightly-trunk/ 1.3a1r15990/gcc_warnings/ibm/ibm/final/ chdir /home/mschaara/mtt/trunk/scratch/installs/iCPw/tests/ibm/ibm ### Test progress: 181 of 181 section tests complete (100%) Evaluating: require MTT::Reporter::MTTDatabase Evaluating: $ret = ::Reporter::MTTDatabase::Submit(@args) MTTDatabase reporter Got hostname: shark Submitting to MTTDatabase... Running command: gzip --force /tmp/3omP2I6lK7.inc MTTDatabase trying proxy: / Default (none) MTTDatabase got response: MTTDatabase server notice: mpi_install_section_name is not in mtt database. MTTDatabase server notice: fields is not in mtt database. MTTDatabase server notice: phase is not in mtt database. MTTDatabase server notice: test_build_section_name is not in mtt database. MTTDatabase server notice: variant is not in mtt database. MTTDatabase server notice: number_of_results is not in mtt database. MTTDatabase server notice: command is not in mtt database. MTT submission for test run MTTDatabase server notice: The submitting client did not submit valid IDs for one or more of the following 'launcher', 'resource_manager', 'paramters' or 'network' some output. Evaluating: require MTT::Test::Specify::Simple Simple *** ERROR: Module aborted during require: MTT::Test::Specify::Simple Simple: syntax error at (eval 12728) line 4, near "require MTT::Test::Specify::Simple Simple " -- Mohamad Chaarawi Instructional Assistant http://www.cs.uh.edu/~mschaara Department of Computer ScienceUniversity of Houston 4800 Calhoun, PGH Room 526Houston, TX 77204, USA ___ mtt-users mailing list mtt-us...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/mtt-users -- Jeff Squyres Cisco Systems
[MTT users] starting mtt
Im trying to execute mtt on our cluster at uh.. This is the first time we do mtt here so bare with me here.. I edited the ompi-core-template.ini file to test only the trunk and added the username/password/platform. I allocated 4 nodes and ran: client/mtt -d --force -f samples/ompi-core-template.ini --trial -s scratch OMPI configures and compiles fine, and then IBM tests start. After the IBM tests complete mtt just exits with an error. I copied some of the output that i saw was not right at the end.. any help is appreciated. Thanks, -Mohamad Making dir: /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi-nightly-trunk/ 1.3a1r15990/gcc_warnings/ibm/ibm/final (cwd: /home/mschaara/mtt/trunk/scratch/installs/iCPw/tests/ibm/ibm) /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi-nightly-trunk/1.3 a1r15990/gcc_warnings/ibm/ibm/final does not exist -- creating chdir /home/mschaara/mtt/trunk/scratch/test_runs/test_runs/ompi-nightly-trunk/ 1.3a1r15990/gcc_warnings/ibm/ibm/final/ chdir /home/mschaara/mtt/trunk/scratch/installs/iCPw/tests/ibm/ibm ### Test progress: 181 of 181 section tests complete (100%) Evaluating: require MTT::Reporter::MTTDatabase Evaluating: $ret = ::Reporter::MTTDatabase::Submit(@args) MTTDatabase reporter Got hostname: shark Submitting to MTTDatabase... Running command: gzip --force /tmp/3omP2I6lK7.inc MTTDatabase trying proxy: / Default (none) MTTDatabase got response: MTTDatabase server notice: mpi_install_section_name is not in mtt database. MTTDatabase server notice: fields is not in mtt database. MTTDatabase server notice: phase is not in mtt database. MTTDatabase server notice: test_build_section_name is not in mtt database. MTTDatabase server notice: variant is not in mtt database. MTTDatabase server notice: number_of_results is not in mtt database. MTTDatabase server notice: command is not in mtt database. MTT submission for test run MTTDatabase server notice: The submitting client did not submit valid IDs for one or more of the following 'launcher', 'resource_manager', 'paramters' or 'network' some output. Evaluating: require MTT::Test::Specify::Simple Simple *** ERROR: Module aborted during require: MTT::Test::Specify::Simple Simple: syntax error at (eval 12728) line 4, near "require MTT::Test::Specify::Simple Simple " -- Mohamad Chaarawi Instructional Assistant http://www.cs.uh.edu/~mschaara Department of Computer ScienceUniversity of Houston 4800 Calhoun, PGH Room 526Houston, TX 77204, USA
[OMPI users] Out of office message
I will be out of office until Sept, 10 and can't read incoming messages in time. Please contact my colleague Michael Rauh r...@par-tec.com (fon +49 89 99809 500). Thank you Dieter Schuett ParTec GmbH Sales & Marketing
Re: [OMPI users] Basic problems with OpenMPI
Amit -- I think you want to have a look at the "setup" FAQ -- many of the questions you have asked are answered there: http://www.open-mpi.org/faq/?category=running On Aug 29, 2007, at 6:07 AM, Amit Kumar Saha wrote: Hi Gleb, The above output shows that you have a problem on host ubuntu- desktop-2. Have you setup login without a password from ubuntu-desktop-1 to ubuntu-desktop-2? Thank you very much. It works! Regards -- Amit Kumar Saha [URL]:http://amitsaha.in.googlepages.com ___ users mailing list us...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres Cisco Systems
Re: [OMPI users] Basic problems with OpenMPI
Hi Gleb, > The above output shows that you have a problem on host ubuntu-desktop-2. > Have you setup login without a password from ubuntu-desktop-1 to > ubuntu-desktop-2? Thank you very much. It works! Regards -- Amit Kumar Saha [URL]:http://amitsaha.in.googlepages.com
Re: [OMPI users] Basic problems with OpenMPI
On Wed, Aug 29, 2007 at 03:22:54PM +0530, Amit Kumar Saha wrote: > Hi Glib, > > i am sending a sample trace of my program: > > amit@ubuntu-desktop-1:~/mpi-exec$ mpirun --np 3 --hostfile > mpi-host-file HellMPI > > amit@debian-desktop-1's password: [ubuntu-desktop-1:28575] [0,0,0] > ORTE_ERROR_LOG: Timeout in file base/pls_base_orted_cmds.c at line 275 > [ubuntu-desktop-1:28575] [0,0,0] ORTE_ERROR_LOG: Timeout in file > pls_rsh_module.c at line 1164 > [ubuntu-desktop-1:28575] [0,0,0] ORTE_ERROR_LOG: Timeout in file > errmgr_hnp.c at line 90 > [ubuntu-desktop-1:28575] ERROR: A daemon on node ubuntu-desktop-2 > failed to start as expected. > [ubuntu-desktop-1:28575] ERROR: There may be more information available from > [ubuntu-desktop-1:28575] ERROR: the remote shell (see above). > [ubuntu-desktop-1:28575] ERROR: The daemon exited unexpectedly with status > 255. > [ubuntu-desktop-1:28575] [0,0,0] ORTE_ERROR_LOG: Timeout in file > base/pls_base_orted_cmds.c at line 188 > [ubuntu-desktop-1:28575] [0,0,0] ORTE_ERROR_LOG: Timeout in file > pls_rsh_module.c at line 1196 > -- > mpirun was unable to cleanly terminate the daemons for this job. > Returned value Timeout instead of ORTE_SUCCESS. > > -- > > this is what I get when i run the program. > > However when i use "--np 2 " it works perfectly which of course means > that it is not a problem with "debian-desktop-1" as the above output > may show. > The above output shows that you have a problem on host ubuntu-desktop-2. Have you setup login without a password from ubuntu-desktop-1 to ubuntu-desktop-2? -- Gleb.
Re: [OMPI users] Basic problems with OpenMPI
Hi Glib, i am sending a sample trace of my program: amit@ubuntu-desktop-1:~/mpi-exec$ mpirun --np 3 --hostfile mpi-host-file HellMPI amit@debian-desktop-1's password: [ubuntu-desktop-1:28575] [0,0,0] ORTE_ERROR_LOG: Timeout in file base/pls_base_orted_cmds.c at line 275 [ubuntu-desktop-1:28575] [0,0,0] ORTE_ERROR_LOG: Timeout in file pls_rsh_module.c at line 1164 [ubuntu-desktop-1:28575] [0,0,0] ORTE_ERROR_LOG: Timeout in file errmgr_hnp.c at line 90 [ubuntu-desktop-1:28575] ERROR: A daemon on node ubuntu-desktop-2 failed to start as expected. [ubuntu-desktop-1:28575] ERROR: There may be more information available from [ubuntu-desktop-1:28575] ERROR: the remote shell (see above). [ubuntu-desktop-1:28575] ERROR: The daemon exited unexpectedly with status 255. [ubuntu-desktop-1:28575] [0,0,0] ORTE_ERROR_LOG: Timeout in file base/pls_base_orted_cmds.c at line 188 [ubuntu-desktop-1:28575] [0,0,0] ORTE_ERROR_LOG: Timeout in file pls_rsh_module.c at line 1196 -- mpirun was unable to cleanly terminate the daemons for this job. Returned value Timeout instead of ORTE_SUCCESS. -- this is what I get when i run the program. However when i use "--np 2 " it works perfectly which of course means that it is not a problem with "debian-desktop-1" as the above output may show. Please refer to the host file as well. (attached). I am using the same openMPI version 1.2.3 and compiled all the executables using that. Waiting for your suggestions. Thanks -- Amit Kumar Saha [URL]:http://amitsaha.in.googlepages.com mpi-host-file Description: Binary data
Re: [OMPI users] Basic problems with OpenMPI
On Wed, Aug 29, 2007 at 02:49:35PM +0530, Amit Kumar Saha wrote: > Hi gleb, > > > > Have you installed Open MPI at the same place on all nodes? What command > > line are you using to run app on more then one host? > > this is a sample run > > amit@ubuntu-desktop-1:~/mpi-exec$ mpirun --np 2 --hostfile > mpi-host-file HellMPI > amit@ubuntu-desktop-2's password: > HellMPI: error while loading shared libraries: liborte.so.0: cannot > open shared object file: No such file or directory > HellMPI compiled with Open MPI 1.1 mpicc. Version 1.2 has libopen-rte.so and not liborte.so. -- Gleb.
Re: [OMPI users] Basic problems with OpenMPI
Hi gleb, > Have you installed Open MPI at the same place on all nodes? What command > line are you using to run app on more then one host? this is a sample run amit@ubuntu-desktop-1:~/mpi-exec$ mpirun --np 2 --hostfile mpi-host-file HellMPI amit@ubuntu-desktop-2's password: HellMPI: error while loading shared libraries: liborte.so.0: cannot open shared object file: No such file or directory I have them installed at the same place, (I have used the 'configure' switch that you told me earlier) Hope that helps. Thanks -- Amit Kumar Saha [URL]:http://amitsaha.in.googlepages.com
Re: [OMPI users] Basic problems with OpenMPI
On Wed, Aug 29, 2007 at 02:32:58PM +0530, Amit Kumar Saha wrote: > Hi all, > > I have installed OpenMPI 1.2.3 on all my hosts (3). > > Now when I try to start a simple demo program ("hello world") using > ./a.out I get the error. When I run my program using "mpirun" on more > than one host it gives me similar error: > > error while loading shared libraries: libopen-rte.so.0: cannot open > shared object file: No such file or directory > > However when I do a mpirun a.out , it gives me no error. > > Please suggest > Have you installed Open MPI at the same place on all nodes? What command line are you using to run app on more then one host? -- Gleb.
Re: [OMPI users] Basic problems with OpenMPI
Hi all, I have installed OpenMPI 1.2.3 on all my hosts (3). Now when I try to start a simple demo program ("hello world") using ./a.out I get the error. When I run my program using "mpirun" on more than one host it gives me similar error: error while loading shared libraries: libopen-rte.so.0: cannot open shared object file: No such file or directory However when I do a mpirun a.out , it gives me no error. Please suggest Thanks Amit -- Amit Kumar Saha [URL]:http://amitsaha.in.googlepages.com
Re: [OMPI users] Basic problems with OpenMPI
On Wed, Aug 29, 2007 at 01:03:30PM +0530, Amit Kumar Saha wrote: > Also, is open MPI 1.1 compatible with MPI 1.2.3, I mean to ask is > whether a MPI executable generated using 1.1 is executable by 1.2.3? No. They are not compatible. -- Gleb.
Re: [OMPI users] Basic problems with OpenMPI
Hi Gleb, On 8/29/07, Gleb Natapovwrote: > Where have you installed it? If in /usr/local/ then try to run > mpirun --prefix /usr/local/ --np 1 --hostfile hostfile ./a.out Thanks again. It solves the problem. > > If this helps then you may want to re-run configure script with flag > --enable-orterun-prefix-by-default and recompile. Also, is open MPI 1.1 compatible with MPI 1.2.3, I mean to ask is whether a MPI executable generated using 1.1 is executable by 1.2.3? i am trying to run a 1.1 generated executable on a remote 1.2.3 host when i get the following: amit@ubuntu-desktop-1:~/mpi-exec$ mpirun -np 3 --hostfile /home/amit/junk/mpi-codes/mpi-host-file --mca btl ^openib ./HellMPI amit@debian-desktop-1's password: amit@ubuntu-desktop-2's password: [ubuntu-desktop-1:13202] [0,0,0] ORTE_ERROR_LOG: Data unpack failed in file dss/dss_peek.c at line 59 [ubuntu-desktop-1:13202] [0,0,0] ORTE_ERROR_LOG: Data unpack failed in file dss/dss_peek.c at line 59 [ubuntu-desktop-1:13202] [0,0,0] ORTE_ERROR_LOG: Data unpack failed in file dss/dss_peek.c at line 59 Note that the host "debian-desktop-1" is 1.2.3 and the other 2 is 1.1 Regards -- Amit Kumar Saha [URL]:http://amitsaha.in.googlepages.com
Re: [OMPI users] Basic problems with OpenMPI
On Wed, Aug 29, 2007 at 12:26:54PM +0530, Amit Kumar Saha wrote: > Hello all, > > I have installed Open MPI 1.2.3 from source on Debian 4.0. I did the > "make all install" using root privileges. > > Now when I try to execute a simple program , I get the following: > > debian-desktop-1:/home/amit/junk/mpi-codes# mpirun --np 1 --hostfile > hostfile ./a.out > ./a.out: error while loading shared libraries: libmpi.so.0: cannot > open shared object file: No such file or directory > > I get the error whether I do it as "normal user" or "root user" > > Please suggest. > Where have you installed it? If in /usr/local/ then try to run mpirun --prefix /usr/local/ --np 1 --hostfile hostfile ./a.out If this helps then you may want to re-run configure script with flag --enable-orterun-prefix-by-default and recompile. -- Gleb.
Re: [OMPI users] Basic problems with OpenMPI
Hello all, I have installed Open MPI 1.2.3 from source on Debian 4.0. I did the "make all install" using root privileges. Now when I try to execute a simple program , I get the following: debian-desktop-1:/home/amit/junk/mpi-codes# mpirun --np 1 --hostfile hostfile ./a.out ./a.out: error while loading shared libraries: libmpi.so.0: cannot open shared object file: No such file or directory I get the error whether I do it as "normal user" or "root user" Please suggest. Thanks -- Amit Kumar Saha [URL]:http://amitsaha.in.googlepages.com
Re: [OMPI users] Basic problems with OpenMPI
On 8/29/07, Gleb Natapovwrote: > On Wed, Aug 29, 2007 at 11:42:29AM +0530, Amit Kumar Saha wrote: > > hello all, > > > > I am just trying to get started with OpenMPI (version 1.1) on Linux. > Vesrion 1.1 is old an no longer supported. > > > > > When I try to run a simple MPI - "Hello World" program, here is what i get: > > > > amit@ubuntu-desktop-1:~/junk/mpi-codes$ mpirun -np 1 --hostfile > > mpi-host-file ./a.out > > libibverbs: Fatal: couldn't read uverbs ABI version. > > -- > > [0,1,0]: OpenIB on host ubuntu-desktop-1 was unable to find any HCAs. > > Another transport will be used instead, although this may result in > > lower performance. > > -- > > Processor 0 of 1: Hello World! > > > > Please explain the statements above. > Open MPI has Infiniband module compiled but there is no IB device found > on your host. Try to add "--mca btl ^openib" string to your command > line. > > > > > Also, when I am trying to launch the above process on 2 processors, > > instead of one, it gives me: > > > > Failed to find or execute the following executable: > > > > Host: ubuntu-desktop-2 > > Executable: ./a.out > > > > Cannot continue. > > > > Does that mean I have to place a copy of the executable on the other > > node as well? Where should I place the executable? > > > Yes. At the same location on each host. Thank you very much Gleb. It works! Regards -- Amit Kumar Saha [URL]:http://amitsaha.in.googlepages.com
Re: [OMPI users] Basic problems with OpenMPI
On Wed, Aug 29, 2007 at 11:42:29AM +0530, Amit Kumar Saha wrote: > hello all, > > I am just trying to get started with OpenMPI (version 1.1) on Linux. Vesrion 1.1 is old an no longer supported. > > When I try to run a simple MPI - "Hello World" program, here is what i get: > > amit@ubuntu-desktop-1:~/junk/mpi-codes$ mpirun -np 1 --hostfile > mpi-host-file ./a.out > libibverbs: Fatal: couldn't read uverbs ABI version. > -- > [0,1,0]: OpenIB on host ubuntu-desktop-1 was unable to find any HCAs. > Another transport will be used instead, although this may result in > lower performance. > -- > Processor 0 of 1: Hello World! > > Please explain the statements above. Open MPI has Infiniband module compiled but there is no IB device found on your host. Try to add "--mca btl ^openib" string to your command line. > > Also, when I am trying to launch the above process on 2 processors, > instead of one, it gives me: > > Failed to find or execute the following executable: > > Host: ubuntu-desktop-2 > Executable: ./a.out > > Cannot continue. > > Does that mean I have to place a copy of the executable on the other > node as well? Where should I place the executable? > Yes. At the same location on each host. -- Gleb.
[OMPI users] Basic problems with OpenMPI
hello all, I am just trying to get started with OpenMPI (version 1.1) on Linux. When I try to run a simple MPI - "Hello World" program, here is what i get: amit@ubuntu-desktop-1:~/junk/mpi-codes$ mpirun -np 1 --hostfile mpi-host-file ./a.out libibverbs: Fatal: couldn't read uverbs ABI version. -- [0,1,0]: OpenIB on host ubuntu-desktop-1 was unable to find any HCAs. Another transport will be used instead, although this may result in lower performance. -- Processor 0 of 1: Hello World! Please explain the statements above. Also, when I am trying to launch the above process on 2 processors, instead of one, it gives me: Failed to find or execute the following executable: Host: ubuntu-desktop-2 Executable: ./a.out Cannot continue. Does that mean I have to place a copy of the executable on the other node as well? Where should I place the executable? Thanks in advance. Regards -- Amit Kumar Saha [URL]:http://amitsaha.in.googlepages.com