I noticed today a drastic change in how ORTE deal with the hostfile between 
trunk and 1.5.

1. 1.5 and prior used the hostile as a suggestion, a placeholder where to pick 
the requested number of daemons during the launch. The current trunk spawn 
daemons on all the nodes provided on the host file, and then spawn the apps 
only on some of them.

2. If a default hostfile is provided and --host was specified 1.5 and prior use 
the nodes to limit the number of nodes in the environment to the requested 
nodes. The current trunk seems to ignore the --host option if a default 
hostfile is available.

In my configuration the hostfile is system wide, specified in the /etc via 
orte_default_hostfile. It contains all the nodes in the cluster, the users are 
supposed to use --host to limit their mpirun to a specified subset.

This seems a quite significant change. I would have expected an RFC.

  george.


Reply via email to