On Oct 16, 2007, at 6:36 PM, Ethan Mallove wrote:
The bail is that "make" will eventually succeed or fail
with something other than "interrupted system call". Do
we need another condition?
I'm just worried that Sun's NFS will somehow get in a
perpetual "interrupted system call" loop such that you'll
never actually break out of it.
How about a counter? E.g., after "x" number of "interrupted
system call" messages, MTT moves on. Or should the "command
restart" go in DoCommand.pm so we can have a timeout?
Either or both of those would be fine (don't we have a timeout in
DoCommand.pm already?).
I also noticed that our build script (which prints hundreds
of "interrupted system call" messages per build, but does
not seem to die because of them) uses "make -j 24" while MTT
has been using "make -j 4". I'll experiment with -j.
I know that Terry/Sun and co. spent a good amount of time trying to
solve the "interrupted system call" error -- they may have some more
information for you, such as how/why it happens...?
--
Jeff Squyres
Cisco Systems