Ralph Castain wrote:
The two files have a slightly different formatAgreed. and completely different meaning.Somewhat agreed. They're both related to mapping processes onto a cluster. The hostfile specifies how many slots are on a node. The rankfile specifies a rank and what node/slot it is to be mapped onto.Agreed. This is the main part I'm concerned about. E.g., % cat rankfile rank 0=node0 slot=0 rank 1=node1 slot=0 % mpirun -np 2 -rf rankfile ./a.out -------------------------------------------------------------------------- Rankfile claimed host node1 that was not allocated or oversubscribed it's slots: -------------------------------------------------------------------------- [node0:14611] [[61560,0],0] ORTE_ERROR_LOG: Bad parameter in file rmaps_rank_file.c at line 107 [node0:14611] [[61560,0],0] ORTE_ERROR_LOG: Bad parameter in file base/rmaps_base_map_job.c at line 86 [node0:14611] [[61560,0],0] ORTE_ERROR_LOG: Bad parameter in file base/plm_base_launch_support.c at line 86 [node0:14611] [[61560,0],0] ORTE_ERROR_LOG: Bad parameter in file plm_rsh_module.c at line 1016 % mpirun -np 2 -host node0,node1 -rf rankfile ./a.out 0 on node0 1 on node1 done It seems to me that the rankfile has sufficient information to express what I want it to do. But mpirun won't accept this. To fix this, I have to, e.g., supply/maintain/specify redundant information in a hostfile or host list. Right. And my issue is that I'm forced to use both when I only want rankfile functionality.
|
- [OMPI devel] why does --rankfile need hostlist? Eugene Loh
- Re: [OMPI devel] why does --rankfile need hostlist? Ralph Castain
- Re: [OMPI devel] why does --rankfile need hostli... Eugene Loh
- Re: [OMPI devel] why does --rankfile need ho... Ralph Castain
- Re: [OMPI devel] why does --rankfile nee... Terry Dontje
- Re: [OMPI devel] why does --rankfil... Ralph Castain
- Re: [OMPI devel] why does --ran... Lenny Verkhovsky
- Re: [OMPI devel] why does -... Ralph Castain
- Re: [OMPI devel] why does -... Lenny Verkhovsky
- Re: [OMPI devel] why does -... Terry Dontje
- Re: [OMPI devel] why does -... Mike Dubman
- Re: [OMPI devel] why does -... Terry Dontje
- Re: [OMPI devel] why does -... Eugene Loh