Ick!

This is a long-known problem [apparently] with Sun's NFS, unfortunately. :-(

I'd be ok with this if there is an eventual bail out of the loop -- the prospect of an infinite loop is a bit scary for me.


On Oct 16, 2007, at 11:23 AM, Ethan Mallove wrote:

On certain NFS servers, I run into the error message
"Interrupted system call" when executing long running
commands such as "make all". One solution I've been able to
use is to setup an NFS mount point solely for the cluster
I'm using, but this is not always an option. The below link
advises to restart the build on "Interrupted system call":

  http://developers.sun.com/solaris/articles/parallel_make.html

I wrapped the GNU_Install.pm make commands in a do-while to
effect the build restarts. E.g.,

  do {
      $x = MTT::DoCommand::Cmd("make install")
} while (!MTT::DoCommand::wsuccess($x->{exit_status}) and ($x-> {result_stderr} =~ /interrupted system call/i));

As long as make emits "interrupted system call" and fails,
MTT will keep restarting make.

I realize this is ugly, but is it acceptable?

-Ethan
_______________________________________________
mtt-devel mailing list
mtt-de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/mtt-devel


--
Jeff Squyres
Cisco Systems

Reply via email to