I tried to echo but it returns nothing. [pmdtest@pmd02 d00_dayfiles]$ echo ${MPIRUN} -np ${NPROC} -hostfile $i{ABSDIR}/hostlist -mca btl sm,openib,self --mca btl_openib_use_srq 1 ./hrm >> ${OUTFILE}_hrm 2>&1 [pmdtest@pmd02 d00_dayfiles]$
On Wed, Feb 29, 2012 at 12:01 PM, Jingcha Joba <pukkimon...@gmail.com>wrote: > Just to be sure, can u try > echo "${MPIRUN} -np ${NPROC} -hostfile ${ABSDIR}/hostlist -mca btl > sm,openib,self --mca btl_openib_use_srq 1 ./hrm >> ${OUTFILE}_hrm 2>&1" > and check if you are indeed getting the correct argument. > > If that looks fine, can u add --mca btl_openib_verbose 1 to the mpirun > argument list, and see what it says? > > > > On Tue, Feb 28, 2012 at 10:15 PM, Syed Ahsan Ali <ahsansha...@gmail.com>wrote: > >> After creating new hostlist and making the scripts again it is working >> now and picking up the hostlist as u can see : >> >> * >> ${MPIRUN} -np ${NPROC} -hostfile ${ABSDIR}/hostlist -mca btl >> sm,openib,self --mca btl_openib_use_srq 1 ./hrm >> ${OUTFILE}_hrm 2>&1 >> (The above command is used to submit job)* >> >> * >> [pmdtest@pmd02 d00_dayfiles]$ tail -f *_hrm >> mpirun -np 32 /home/MET/hrm/bin/hrm >> * >> but it just stays on this command and the model simulation don't start >> further. I can't understand this behavior because the simulation works >> fine when hostlist is not given as follows: >> >> * ${MPIRUN} -np ${NPROC} ./hrm >> ${OUTFILE}_hrm 2>&1* >> >> ** >> ** >> * * >> >> On Tue, Feb 28, 2012 at 3:49 PM, Jeffrey Squyres <jsquy...@cisco.com>wrote: >> >>> Yes, this is known behavior for our CLI parser. We could probably >>> improve that a bit... >>> >>> On Feb 28, 2012, at 4:55 AM, Ralph Castain wrote: >>> >>> > >>> > On Feb 28, 2012, at 2:52 AM, Reuti wrote: >>> > >>> >> Am 28.02.2012 um 10:21 schrieb Ralph Castain: >>> >> >>> >>> Afraid I have to agree with the prior reply - sounds like NPROC >>> isn't getting defined, which causes your cmd line to look like your >>> original posting. >>> >> >>> >> Maybe the best to investigate this is to `echo` $MPIRUN and $NPROC. >>> >> >>> >> But: is this the intended behavior of mpirun? It looks like -np is >>> eating -hostlist as a numeric argument? Shouldn't it complain about: >>> argument for -np missing or argument not being numeric? >>> > >>> > Probably - I'm sure that the atol is returning zero, which should >>> cause an error output. I'll check. >>> > >>> > >>> >> >>> >> -- Reuti >>> >> >>> >> >>> >>> >>> >>> On Feb 27, 2012, at 10:29 PM, Syed Ahsan Ali wrote: >>> >>> >>> >>>> The following command in used in script for job submission >>> >>>> >>> >>>> ${MPIRUN} -np ${NPROC} -hostfile ${ABSDIR}/hostlist -mca btl >>> sm,openib,self --mca btl_openib_use_srq 1 ./hrm >> ${OUTFILE}_hrm 2>&1 >>> >>>> where NPROC in defined in someother file. The same application is >>> running on the other system with same configuration. >>> >>>> >>> >>>> On Tue, Feb 28, 2012 at 10:12 AM, PukkiMonkey < >>> pukkimon...@gmail.com> wrote: >>> >>>> No of processes missing after -np >>> >>>> Should be something like: >>> >>>> mpirun -np 256 ./exec >>> >>>> >>> >>>> >>> >>>> >>> >>>> Sent from my iPhone >>> >>>> >>> >>>> On Feb 27, 2012, at 8:47 PM, Syed Ahsan Ali <ahsansha...@gmail.com> >>> wrote: >>> >>>> >>> >>>>> Dear All, >>> >>>>> >>> >>>>> I am running an application with mpirun but it gives following >>> error, it is not picking up hostlist, there are other applications which >>> run well with hostlist but it just gives following error with >>> >>>>> >>> >>>>> >>> >>>>> [pmdtest@pmd02 d00_dayfiles]$ tail -f *_hrm >>> >>>>> mpirun -np /home/MET/hrm/bin/hrm >>> >>>>> >>> -------------------------------------------------------------------------- >>> >>>>> Could not execute the executable "/home/MET/hrm/bin/hostlist": >>> Exec format error >>> >>>>> >>> >>>>> This could mean that your PATH or executable name is wrong, or >>> that you do not >>> >>>>> have the necessary permissions. Please ensure that the executable >>> is able to be >>> >>>>> found and executed. >>> >>>>> >>> >>>>> >>> -------------------------------------------------------------------------- >>> >>>>> >>> >>>>> Following the permission of the hostlist directory. Please help me >>> to remove this error. >>> >>>>> >>> >>>>> [pmdtest@pmd02 bin]$ ll >>> >>>>> total 7570 >>> >>>>> -rwxrwxrwx 1 pmdtest pmdtest 2517815 Feb 16 2012 gme2hrm >>> >>>>> -rwxrwxrwx 1 pmdtest pmdtest 0 Feb 16 2012 gme2hrm.map >>> >>>>> -rwxrwxrwx 1 pmdtest pmdtest 473 Jan 30 2012 hostlist >>> >>>>> -rwxrwxrwx 1 pmdtest pmdtest 5197698 Feb 16 2012 hrm >>> >>>>> -rwxrwxrwx 1 pmdtest pmdtest 0 Dec 31 2010 hrm.map >>> >>>>> -rwxrwxrwx 1 pmdtest pmdtest 1680 Dec 31 2010 mpd.hosts >>> >>>>> >>> >>>>> >>> >>>>> Thank you and Regards >>> >>>>> Ahsan >>> >>>>> >>> >>>>> >>> >>>>> >>> >>>>> >>> >>>>> >>> >>>>> _______________________________________________ >>> >>>>> users mailing list >>> >>>>> us...@open-mpi.org >>> >>>>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> >>>> >>> >>>> _______________________________________________ >>> >>>> users mailing list >>> >>>> us...@open-mpi.org >>> >>>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> >>>> >>> >>>> >>> >>>> >>> >>>> -- >>> >>>> Syed Ahsan Ali Bokhari >>> >>>> Electronic Engineer (EE) >>> >>>> >>> >>>> Research & Development Division >>> >>>> Pakistan Meteorological Department H-8/4, Islamabad. >>> >>>> Phone # off +92518358714 >>> >>>> Cell # +923155145014 >>> >>>> >>> >>>> _______________________________________________ >>> >>>> users mailing list >>> >>>> us...@open-mpi.org >>> >>>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> >>> >>> >>> _______________________________________________ >>> >>> users mailing list >>> >>> us...@open-mpi.org >>> >>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> >> >>> >> >>> >> _______________________________________________ >>> >> users mailing list >>> >> us...@open-mpi.org >>> >> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> > >>> > >>> > _______________________________________________ >>> > users mailing list >>> > us...@open-mpi.org >>> > http://www.open-mpi.org/mailman/listinfo.cgi/users >>> >>> >>> -- >>> Jeff Squyres >>> jsquy...@cisco.com >>> For corporate legal information go to: >>> http://www.cisco.com/web/about/doing_business/legal/cri/ >>> >>> >>> _______________________________________________ >>> users mailing list >>> us...@open-mpi.org >>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> >> >> >> >> -- >> Syed Ahsan Ali Bokhari >> Electronic Engineer (EE) >> >> Research & Development Division >> Pakistan Meteorological Department H-8/4, Islamabad. >> Phone # off +92518358714 >> Cell # +923155145014 >> >> > -- Syed Ahsan Ali Bokhari Electronic Engineer (EE) Research & Development Division Pakistan Meteorological Department H-8/4, Islamabad. Phone # off +92518358714 Cell # +923155145014