[Haskell-cafe] Re: sendfile leaking descriptors on Linux?

Bardur Arantsson Thu, 11 Feb 2010 16:49:47 -0800

Jeremy Shaw wrote:


On Feb 11, 2010, at 1:57 PM, Bardur Arantsson wrote:

[--snip lots of technical info--]

Thanks for digging so much into this.

Just a couple of comments:

The whole point of the sendfile library is to use sendfile(), so notusing sendfile() seems like the wrong solution.

Heh, well, presumably it could still use sendfile() only platforms whereit can actually guarantee correctness :).

There is some evidence that when you are doing select() on a readfds,and the connection is closed, select() will indicate that the fds isready to be read, but when you read it, you get 0-bytes. That indicatesthat a disconnect has happened. However, if you are only doingread()/recv(), I expect that only happens in the event of a properdisconnect, because if you are just listening for packets, there is noway to tell the difference between the sender just not saying anything,and the sender dying:

True, but the point here is that the OS has a built-in timeout mechanism(via keepalives) and *can* tell the program when that timeout has elapsed.

That's the timeout we're trying to "get at" instead of having toimplement a new one.

Good point about the the readfds triggering when the client disconnects.I think that's what I've been seeing in all my other network-relatedcode and I just misremembered the details. All my code is extremelylikely to have been both reading and writing from (roughly) the same setof FDs at the same time.

If this method of detection is correct, then what we need is athreadWaitReadWrite, that will notify us if the socket can be read orwritten. The IO manager does not currently provide a function likethat.. but we could fake it like this: (untested):
import Control.Concurrent
import Control.Concurrent.MVar
import System.Posix.Types

data RW = Read | Write

threadWaitReadWrite :: Fd -> IO RW
threadWaitReadWrite fd =
  do m <- newEmptyMVar
     rid <- forkIO $ threadWaitRead fd  >> putMVar m Read
     wid <- forkIO $ threadWaitWrite fd >> putMVar m Write
     r <- takeMVar m
     killThread rid
     killThread wid
     return r

I'll try to get the sendfile code to use this instead. AFAICT itshouldn't actually be necessary to "peek" on the read end of the socketto detect that something has gone wrong. We're guaranteed thatsendfile() to a connection that's died (according to the OS, either dueto proper disconnect or a timeout) will fail.

I might get a bit tricky to use this if the client is actually expectingto send proper data while the sendfile() is in progress -- if there'sactual data to be read from the socket() then the naive "replacethreadWaitR by threadWaitRW" will end up busy-waiting on EAGAIN sincethe socket() will be readable every time

threadWaitReadWrite gets called.

HOWEVER, that's not an issue in my particular scenario, so a simplerelacement of threadWaitWrite by threadWaitReadWrite should do fine fortesting purposes.

Of course, in the case where the client disconnects because someoneturns off the power or pulls the ethernet cable, we have no way ofknowing what is going on -- so there is still the possibility that deadconnections will be left open for a long time.

True, but then it's (properly) left to the OS to decide and timeouts canbe controlled via setsockopt -- as they should IMO.


I'll test tomorrow.

What I'll expect is that I'll still see a few "dead" threads lingeringaround for ~60 seconds (the OS-based timeout), but that I'll not see anythreads lingering indefinitely -- something which usually happens aftera few hours of persistent use of my media server.


_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

[Haskell-cafe] Re: sendfile leaking descriptors on Linux?

Reply via email to