On Fri, 15 Nov 2002, Daniel Alberto Burbano wrote:
> As root, where the server is malambo:
>
> ****************
> [root@malambo root]# ssh malambo
> Warning: Permanently added 'malambo,192.168.1.200' (RSA) to the list of
> known hosts.
> Last login: Fri Nov 15 00:31:34 2002
> ****************
This is ok -- this always happens the first time you ssh somewhere.
> as root, where the first client is node2:
>
> ************
> [dburbano@malambo dburbano]$ ssh node2
> Last login: Fri Nov 15 01:38:46 2002 from malambo.ece.uprm.edu
> switcher:mpi: Cannot find modulefile for mpic-1.2.4 -- skipping
> [dburbano@node2 dburbano]$
> ****************
As you discuss below, the problem is because you don't have an MPI
implementation by that name. Hence, switcher complains.
> I have the message of switcher, because I don't have an MPI
> implementation with that name, I canged it to available MPI
> implementation and I did a manual execute to push the configuration
> information to the nodes, but it did not upgrate. The command was:
>
> /opt/opium/bin/sync_users
Yes, this is an unfortunate catch-22. If you set an incorrect MPI and
then push it out, you're hosed. I see two ways to fix this:
1. Don't push an incorrectly-set MPI out. ;-)
2. Use "cexec" instead of "cpush" to set the right MPI. cexec will run a
command on every node instead of pushing files out, so it won't run into
the same rsync issues:
cexec switcher mpi = mpich-1.2.4 --system
That should set it correctly on all machines (such that sync_users isn't
necessary), and then cpush will work properly in the future.
--
{+} Jeff Squyres
{+} [EMAIL PROTECTED]
{+} http://www.lam-mpi.org/
-------------------------------------------------------
This sf.net email is sponsored by: To learn the basics of securing
your web site with SSL, click here to get a FREE TRIAL of a Thawte
Server Certificate: http://www.gothawte.com/rd524.html
_______________________________________________
Oscar-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/oscar-users