Looks like someone is creating our test files outside of jenkins. I modified the job to output our id and look at the perms on those files:
---- [ZooKeeper-trunk] $ /bin/bash /tmp/jenkins291402182647699851.sh uid=910(jenkins) gid=910(jenkins) groups=910(jenkins),999(docker) drwxr-xr-x 3 10025 12036 4096 Nov 10 01:39 /tmp/zkdata -rw-r--r-- 1 10025 12036 2 Nov 10 01:39 /tmp/zkdata/myid /tmp/zkdata/version-2: total 20 drwxr-xr-x 2 10025 12036 4096 Oct 22 23:35 . drwxr-xr-x 3 10025 12036 4096 Nov 10 01:39 .. -rw-r--r-- 1 10025 12036 1 Oct 22 23:35 acceptedEpoch -rw-r--r-- 1 10025 12036 1 Oct 22 23:35 currentEpoch -rw-r--r-- 1 10025 12036 562 Oct 22 23:35 snapshot.0 ---- Notice that it's not jenkins. Can you (Abe?) submit a jira/patch (ASAP as it's breaking the build) to create a new directory in /tmp and then host all the tmp files there? Thanks, Patrick On Tue, Nov 21, 2017 at 10:37 AM, Patrick Hunt <[email protected]> wrote: > With the same issue? Does it ever pass? > > Patrick > > On Tue, Nov 21, 2017 at 10:32 AM, Andor Molnar <[email protected]> wrote: > >> I checked back a few failing builds and see different hosts failing: H4, >> H9, H12, ... >> >> >> >> >> >> On Tue, Nov 21, 2017 at 6:26 PM, Patrick Hunt <[email protected]> wrote: >> >> > Could it be an environment issue? I see the following just before the >> > failure: >> > >> > [exec] rm: cannot remove '/tmp/zkdata/myid': Permission denied >> > >> > check whether it's happening on just one host (jenkins). >> > >> > Patrick >> > >> > On Tue, Nov 21, 2017 at 6:25 AM, Andor Molnar <[email protected]> >> wrote: >> > >> > > Looks like only https://builds.apache.org/job/ZooKeeper-trunk is >> > affected. >> > > >> > > >> > > On Tue, Nov 21, 2017 at 3:22 PM, Andor Molnar <[email protected]> >> > wrote: >> > > >> > > > Hi, >> > > > >> > > > Zookeeper build has been failing for a while with some weird error >> in >> > > > test-core-cppunit task. In most cases the error is the following: >> > > > >> > > > ... >> > > > [exec] Zookeeper_simpleSystem::testGetChildren2 : elapsed >> 1052 : >> > OK >> > > > [exec] Zookeeper_simpleSystem::testLastZxid : elapsed 4520 : >> OK >> > > > [exec] Zookeeper_simpleSystem::testRemoveWatchers ZooKeeper >> > server >> > > > started : elapsed 5390 : OK >> > > > [exec] rm: cannot remove '/tmp/zkdata/myid': Permission denied >> > > > [exec] Zookeeper_readOnly::testReadOnly : assertion : elapsed >> > 4018 >> > > > [exec] /home/jenkins/jenkins-slave/wo >> rkspace/ZooKeeper-trunk/src/ >> > > > c/tests/TestReadOnlyClient.cc:99: Assertion: equality assertion >> failed >> > > > [Expected: 0, Actual : -4] >> > > > [exec] Failures !!! >> > > > [exec] Run: 74 Failure total: 1 Failures: 1 Errors: 0 >> > > > [exec] FAIL: zktest-mt >> > > > [exec] ========================================== >> > > > [exec] 1 of 2 tests failed >> > > > [exec] Please report to [email protected] >> > > > [exec] ========================================== >> > > > [exec] Makefile:1744: recipe for target 'check-TESTS' failed >> > > > [exec] make[1]: Leaving directory '/home/jenkins/jenkins-slave/ >> > > > workspace/ZooKeeper-trunk/build/test/test-cppunit' >> > > > [exec] Makefile:2000: recipe for target 'check-am' failed >> > > > [exec] /home/jenkins/jenkins-slave/wo >> rkspace/ZooKeeper-trunk/src/ >> > > c/tests/zkServer.sh: >> > > > line 62: kill: (10156) - No such process >> > > > [exec] make[1]: *** [check-TESTS] Error 1 >> > > > [exec] make: *** [check-am] Error 2 >> > > > >> > > > ---------------------- >> > > > >> > > > Test at line TestReadOnlyClient.cc:99 got ConnectionLoss event. >> > > > Does anyone has a clue what could be the root cause of this? >> > > > >> > > > Regards, >> > > > Andor >> > > > >> > > > >> > > >> > >> > >
