devel-boun...@open-mpi.org a écrit sur 27/02/2012 15:53:06 :

> De : Ralph Castain <r...@open-mpi.org>
> A : Open MPI Developers <de...@open-mpi.org>
> Date : 27/02/2012 16:17
> Objet : Re: [OMPI devel] Problem with the openmpi-default-hostfile 
> (on the trunk)
> Envoyé par : devel-boun...@open-mpi.org
> 
> That's strange - I run on slurm frequently and never have this 
> problem, and my default hostfile is present and empty. Do you have 
> anything in your default mca param file that might be telling us to 
> use the hostfile?
> 
> The only way I can find to get that behavior is if your default mca 
> param file includes the orte_default_hostfile value. In that case, 
> you are telling us to use the default hostfile, and so we will enforce 
it.

Hi Ralph,

On my side, the default value of orte_default_hostfile is a pointer to 
etc/openmpi-default-hostfile.
The command ompi_info -a gives :

MCA orte: parameter "orte_default_hostfile" (current value: 
<..../etc/openmpi-default-hostfile>, data source: default value)
Name of the default hostfile (relative or absolute path, "none" to ignore 
environmental or default MCA param setting)

The following files are empty:
 - .../etc/openmpi-mca-params.conf
 - $HOME/.openmpi/mca-params.conf
Another solution for me is to put "orte_default_hostfile=none" in one of 
these files.

Pascal

> 
> On Feb 27, 2012, at 5:57 AM, pascal.dev...@bull.net wrote:
> 
> Hi all, 
> 
> I have problems with the openmpi-default-hostfile since the 
> following patch on the trunk 
> 
> changeset:   19874:088fc6c84a9f 
> user:        rhc 
> date:        Wed Feb 01 17:40:44 2012 +0000 
> summary:     In accordance with prior releases, we are supposed to 
> default to looking at the openmpi-default-hostfile as a default 
> hostfile. Restore that behavior, but ignore the file if it is empty.
> Allow the user to ignore any MCA param setting pointing to a default
> hostfile by setting the param to "none" (via cmd line or whatever) -
> this allows them to override a setting in the system default MCA param 
file. 
> 
> According to the summary of this patch, the openmpi-default-hostfile
> is ignored if it is empty. 
> But, when I run my jobs with slurm + mpirun, I get the following 
message: 
> 
-------------------------------------------------------------------------- 

> No nodes are available for this job, either due to a failure to 
> allocate nodes to the job, or allocated nodes being marked 
> as unavailable (e.g., down, rebooting, or a process attempting 
> to be relocated to another node when none are available). 
> 
-------------------------------------------------------------------------- 

> 
> I am able to run my job if: 
>  - either I put my node(s) in the file etc/openmpi-default-hostfile 
>  - or use "-mca orte_default_hostfile=none" in the mpirun command line 
>  - or "export OMPI_MCA_orte_default_hostfile none" in my environment 
> 
> It appears that an empty openmpi-default-hostfile is not ignored. 
> This patch seems not be complete 
> 
>  Or do I misunderstand something ? 
> 
> Pascal Devèze_______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel

Reply via email to