Re: Should truncated READDIR replies return -EIO?

Chuck Lever Fri, 08 Feb 2008 11:30:44 -0800

On Feb 8, 2008, at 1:16 PM, Peter Staubach wrote:

Chuck Lever wrote:
On Feb 8, 2008, at 10:39 AM, Peter Staubach wrote:
Trond Myklebust wrote:
On Fri, 2008-02-08 at 10:04 -0500, Jeff Layton wrote:
Recently, I ran across a server-side bug that caused the serverto sendtruncated READDIR replies. The server would send a valid RPCresponse to
a READDIR call, but the contents of it were basically missing
(everything after the status).
The server problem had long been patched in mainline kernels,but the
interesting bit was that clients didn't return an error in this
situation. The XDR decoders for readdir calls are supposed tocheck the
validity of the response, but in this situation it just fudges the
contents of the pagecache to make it look like a completely empty
directory.
Shouldn't the client return an error in this situation? Theresponseobviously isn't valid so it seems like it shouldn't pretendthat it is.
If so, would something like the following patch make sense?
It is quite valid (though silly!) for a server to return aREADDIR replywith no entries. AFAICR there were servers that actually didthis at one
point (though I shall refrain from naming and shaming).
So whereas I agree that it might be correct to flag a READDIRreply that
contains no entries due to XDR encoding bugs, I'm not sure that we
should be flagging errors in the case where the XDR is correct.
In this case, I believe that the response was malformed.  Pretty
much everything after the status was missing, including the EOF
indicator.  I would agree that it would be silly to return a
response with no error indicated, no entries, and the eof
indication set to false.

This really boils down to how do we handle malformed responses?
Is there a general policy to retransmit the request?  This would
seem to be the right thing because a malformed response would
result from many things including the TCP connection getting
dropped in the middle of receiving the response from a timeout
and other things.  However, in this situation, retransmitting
the request would just have resulted in the same, broken response
from the server.  This was due to a server bug, which has since
been fixed, but exists still out in nature.
Replies that are malformed network or RPC level packets aredropped by the RPC client, and the matching requests areretransmitted by the RPC client after a timeout. Network events(like your TCP connection example) result in a malformed RPC levelpacket that the RPC client never delivers to the XDR layer, andare thus retransmitted by the RPC client.
Replies that have malformed XDR are treated by the NFS client aserrors. The problem is the decoders (on Linux) are not terriblycareful about checking the correctness of the server's XDRencoding, especially in cases like READDIR (Not to mentioncompound RPCs!) where the decoding can be complex. Olaf hasmentioned the Linux XDR layer was hand-coded rather thanconstructed with rpcgen to keep the decoders simple and efficient.
Network-related corruption is likely to be caught by the lowerlayers. I tend to think that malformed XDR is nearly always agenuine software defect on the server, and thus not worthretransmitting (especially if it's an idempotent request!).
What happens if a response is interrupted in the middle by the
TCP connection being broken?  Is this caught at the RPC layer
and then rejected?

As I understand it, xs_tcp_read_request() checks for a truncated TCPread, and discards the reply by not invoking xprt_complete_rqst().If the TCP layer stops calling the RPC client back with more bytes onthe socket, then xprt_complete_rqst() is never invoked to mark theRPC request as complete.

So, ostensibly, the RPC client will discard a partially received RPCreply and at some later point, time out the pending request andretransmit it.


--
Chuck Lever
chuck[dot]lever[at]oracle[dot]com
-
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Should truncated READDIR replies return -EIO?

Reply via email to