On Tue, 2017-09-05 at 05:41 -0400, William Allen Simpson wrote: > On 9/4/17 6:59 AM, Swen Schillig wrote: > > On Sat, 2017-09-02 at 00:15 -0400, William Allen Simpson wrote: > > > On 9/1/17 6:09 PM, Frank Filz wrote: > > > > Lately, we have been plagued by a lot of intermittent test > > > > failures. > > > > > > > > I have seen intermittent failures in pynfs WRT14, WRT15, and > > > > WRT16. > > > > These > > > > have not been resolved by the latest ntirpc pullup. > > > > > > > > > > Details? What's WRT16? > > > > > > My pynfs results say: > > > > > > WRT13 st_write.testDoubleWrite > > > : > > > PASS > > > WRT14 st_write.testLargeWrite > > > : > > > PASS > > > WRT15 st_write.testSizes > > > : > > > PASS > > > WRT18 st_write.testChangeGranularityWrite > > > : > > > FAILURE > > > consecutive SETATTR(mode)'s don't all change change > > > attribute > > > WRT19 st_write.testStolenStateid > > > : > > > PASS > > > ************************************************** > > > Command line asked for 584 of 668 tests > > > Of those: 10 Skipped, 2 Failed, 2 Warned, 570 Passed > > > > My latest (dev.7) say > > > > WRT5 st_write.testLargeData : > > FAILURE > > RPCAcceptError: RPCError: MSG_ACCEPTED: GARBAGE_ARGS > > > > 100% reproducible all others succeed. > > > > Of course, my WRT5 passes. > > But this is wonderful. Please tell us how you get this 100% > reproducible > result, so that we can reproduce it.... I'm afraid I'm not doing anything special.
Anyway, here are the details of the box running ganesha - 3.10.0-327.el7.ppc64le #1 SMP Thu Oct 29 17:31:13 EDT 2015 ppc64le ppc64le ppc64le GNU/Linux - ganesha-2.6-dev.7 ganesha.conf EXPORT { Export_Id = 12; Path = /gpfs0/V4; Pseudo = /V4; FSAL { Name = GPFS; } CLIENT { Access_Type = RW; Squash = No_root_squash; Clients=9.244.58.107,9.155.107.180,9.155.107.185,9.244.58.169; } } Client 1 Linux fscc-p8-16-o 3.10.0-327.el7.ppc64 #1 SMP Thu Oct 29 17:33:57 EDT 2015 ppc64 ppc64 ppc64 GNU/Linux executing ./nfs4.0/testserver.py -v --outfile ~/pynfs40.out --maketree 9.155.107.180:/V4 --showomit --secure --rundep WRT5 gives INIT st_setclientid.testValid : RUNNING INIT st_setclientid.testValid : PASS MKFILE st_open.testOpen : RUNNING MKFILE st_open.testOpen : PASS WRT5 st_write.testLargeData : RUNNING Got error: [Errno 104] Connection reset by peer Traceback (most recent call last): File "/root/pynfs-HEAD-6618874/nfs4.0/lib/testmod.py", line 222, in run self.runtest(self, environment) File "/root/pynfs-HEAD-6618874/nfs4.0/servertests/st_write.py", line 137, in testLargeData res = c.write_file(fh, data[pos:], pos, stateid) File "/root/pynfs-HEAD-6618874/nfs4.0/nfs4lib.py", line 768, in write_file res = self.compound(ops) File "/root/pynfs-HEAD-6618874/nfs4.0/nfs4lib.py", line 333, in compound res = self.call(NFSPROC4_COMPOUND, p.get_buffer()) File "/root/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 410, in call xid = self.send(procedure, data, program, version) File "/root/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 337, in send self.reconnect().send_record(header + data) File "/root/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 176, in _send_record self.sendall(mark + chunk) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) error: [Errno 104] Connection reset by peer WRT5 st_write.testLargeData : FAILURE error: [Errno 104] Connection reset by peer ************************************************** INIT st_setclientid.testValid : PASS MKFILE st_open.testOpen : PASS WRT5 st_write.testLargeData : FAILURE error: [Errno 104] Connection reset by peer ************************************************** Command line asked for 3 of 668 tests Of those: 0 Skipped, 1 Failed, 0 Warned, 2 Passed Client 2 4.12.9-200.fc25.x86_64 #1 SMP Fri Aug 25 13:23:30 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux running ./nfs4.0/testserver.py -v --outfile ~/pynfs40.out --maketree 9.155.107.180:/V4 --showomit --secure --rundep WRT5 gives INIT st_setclientid.testValid : RUNNING INIT st_setclientid.testValid : PASS MKFILE st_open.testOpen : RUNNING MKFILE st_open.testOpen : PASS WRT5 st_write.testLargeData : RUNNING Got error: [Errno 104] Connection reset by peer Traceback (most recent call last): File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/lib/testmod.py", line 222, in run self.runtest(self, environment) File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/servertests/st_write.py", line 137, in testLargeData res = c.write_file(fh, data[pos:], pos, stateid) File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/nfs4lib.py", line 768, in write_file res = self.compound(ops) File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/nfs4lib.py", line 333, in compound res = self.call(NFSPROC4_COMPOUND, p.get_buffer()) File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 410, in call xid = self.send(procedure, data, program, version) File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 337, in send self.reconnect().send_record(header + data) File "/home/swen/GANESHA/pynfs-HEAD-6618874/nfs4.0/lib/rpc/rpc.py", line 176, in _send_record self.sendall(mark + chunk) File "/usr/lib64/python2.7/socket.py", line 228, in meth return getattr(self._sock,name)(*args) timeout: timed out WRT5 st_write.testLargeData : FAILURE timeout: timed out ************************************************** INIT st_setclientid.testValid : PASS MKFILE st_open.testOpen : PASS WRT5 st_write.testLargeData : FAILURE timeout: timed out ************************************************** Command line asked for 3 of 668 tests Of those: 0 Skipped, 1 Failed, 0 Warned, 2 Passed However, running the exact same test from the server itself succeeds. ./nfs4.0/testserver.py -v --outfile ~/pynfs40.out --maketree 9.155.107.180:/V4 --showomit --secure --rundep WRT5 INIT st_setclientid.testValid : RUNNING INIT st_setclientid.testValid : PASS MKFILE st_open.testOpen : RUNNING MKFILE st_open.testOpen : PASS WRT5 st_write.testLargeData : RUNNING WRT5 st_write.testLargeData : PASS ************************************************** INIT st_setclientid.testValid : PASS MKFILE st_open.testOpen : PASS WRT5 st_write.testLargeData : PASS ************************************************** Command line asked for 3 of 668 tests Of those: 0 Skipped, 0 Failed, 0 Warned, 3 Passed > ------------------------------------------------------------------------------ Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot _______________________________________________ Nfs-ganesha-devel mailing list Nfs-ganesha-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nfs-ganesha-devel