Re: Testsuite failures on Solaris 2.10 on SPARC (was: testsuite results from master)

2010-11-16 Thread Dave Hart
On Tue, Nov 16, 2010 at 20:45 UTC, Ralf Wildenhues
ralf.wildenh...@gmx.de wrote:
 * Stefano Lattarini wrote on Sun, Nov 14, 2010 at 11:05:55PM CET:
  http://autobuild.josefsson.org/automake/log-201011141903417895000.txt

  checking build system type... sparc-sun-solaris2.10

  All the testsuite failures seem spurious, and due to the following
  rm error (in the distclean target, if I'm not mistaken):
    rm: Unable to remove directory ...: File exists

  What's going on here?

  Maybe NFS issues (wild guess)?

 Yes, these are NFS-related.  E.g., acloca13.test has leftover files
 .nfs6E44 and .nfs7E44 in tests/acloca13.dir/acloca13-1.0
 which correspond to install-sh and configure (or configure.lineno).

 This test fails reproducibly.  I cannot remove the files after the
 tests have ended.  So I suspect that these are file system issues
 that I don't have under control.

The NTP reference implementation make distcheck reliably fails
similarly on OpenSolaris in a NFS-mounted directory:

SunOS psp-os1 5.11 snv_111b i86pc i386 i86pc Solaris

I dug into it, because that machine is over twice as fast building NTP
as the other options, so I really wanted to make our distcheck work.
I gave up because the code was out of my control without complaining
further, but I'd still like to get around it.

The Sun NFS client intentionally defers deleting files, renaming them
to .nfs for some period of time that is not brief enough for make
distcheck to succeed.  If you delete a .nfs1234, don't be suprised to
see it silently renamed to .nfs5678 :)

It would be lovely from my perspective if Automake-generated distcheck
would ignore .nfs specifically and trust they will indeed be rm'd
eventually.

Cheers,
Dave Hart



Testsuite failures on Solaris 2.10 on SPARC (was: testsuite results from master)

2010-11-14 Thread Stefano Lattarini
[Reposting with a better subject. Sorry for the noise.]

On Sunday 14 November 2010, Ralf Wildenhues wrote:
 Hello Stefano,
 
 I think it is time to reevaluate some of the work that has been done in
 the last few weeks.  I'd like to ask you to postpone pushes of pending
 stuff that has a clock ticking, and look at the lots of new failures
 that a testsuite run shows on a few hosts; you can find them in some
 minutes on http://autobuild.josefsson.org/automake/.

Log:
 http://autobuild.josefsson.org/automake/log-201011141903417895000.txt

Minimal system info:
 configure: autobuild project... GNU Automake
 configure: autobuild revision... v1.11-225-gcdd3cf3
 configure: autobuild hostname... hikaru
 configure: autobuild mode... default
 configure: autobuild timestamp... 20111412T185841Z
 checking build system type... sparc-sun-solaris2.10

-*-*-*-

 All the testsuite failures seem spurious, and due to the following
 rm error (in the distclean target, if I'm not mistaken):
   rm: Unable to remove directory ...: File exists
 
 What's going on here?

 Maybe NFS issues (wild guess)?
  
http://www.unix.com/unix-dummies-questions-answers/72080-rm-unable-remove-directory-mnt-users-test-logs-file-exists.html

 Or something relate to FS logging or FS corruption (wilder guess)?
  http://www.webservertalk.com/archive101-2004-1-65261.html

-*-*-*-

I can say I test automake on a Solaris 10 system (i86pc) quite often,
and the testsuite is behaving quite well there (apart from some
pre-existing, mostly spurious failures).

Regards,
  Stefano