Re: IoBuffer refactoring (final take?)

Emmanuel Lecharny Tue, 29 Apr 2008 22:50:18 -0700

"이희승 (Trustin Lee) <[EMAIL PROTECTED]>" wrote:

Last night, we purely focused on sequential scanning of incoming data
regarding handling a composite buffer.  I kept thinking about our
decision not to introduce any new type while I sleep, and I found a few
critical issues with it:


* As Rich pointed out, random-accessing the composite buffer has too
much overhead with finding a right ByteBuffer when just an index and
Iterable<ByteBuffer> are provided.  More information (e.g. cache or
index tree) should be provided for faster access.
* Even if what user is doing is just sequential scanning of incoming
data, it's so much pain if a user happens to access data which spans
over more than one ByteBuffer.  For example, imagine the first buffer
contains 3 bytes and you want to read a long integer.

I don't see this as a critical issue. It all depend on what you acceptto deal with on your first filter. If we consider that everything whichhas been discussed applies to the very first filter (generally speaking,the codec), here is how I see the full problem :

1) you have some incoming bytes, received packet by packet, those packetbeing stored in BB, and accumulating until we find a free thread to dealwith it2) The processing thread absorb those buffers and start to process them.Suppose this is a codec, we will have to decode the incoming "stream",assuming that this so called "stream" is not really a stream, but a listof BB (list != List here. we may just have more than one BB waiting)

3) The decoder role is to transform this "stream" of bytes to a "Message"

4) To add some complexity to this whole picture, a BB may contains morethan one "Message" (or to be very clear, the start of a new "message"can be found in the middle of an incoming BB)5) Some "Message" can't be created unless we received the whole byteswhich compose the message (for instance, a newLine terminated messagehas to wait for the '\n' char, which may come after having received alot of bytes in a lot of BB6) You may have to keep in memory a really big piece of data if yourmessage is big7) When your decoder is done with the "Message" it is processing, it canforward it to the next filter, and discard the BB, if not already done,freeing some memory.

I'm not describing the opposite operation (ie encoding and sending databack to the client, but it's a little bit more simpler, as you handlethe Message to encode and the BB to send. There is just a little pb :how to handle a slow client and how to protect the server against OOM inthis case, as the messages can accumulate before they are sent to theclient)

In many respects, the codec will not be funny to write if it has to dealwith BB, as you will have to check for the remaining number of bytes inthe BB _before_ getting some bytes out of it. Typically, your decoderwill looks like :


for each ByteBuffer
 do
   while buffer.hasRemaining()

decoder.process( buffer.get() ); // Over simplified example: youmay read more than one byte

This means that your codec should be able to stop decoding in the middleof a construction (Trustin's example is very clear : you have only 3bytes in the current BB, and you want to construct a Long)

This is possible, you just have to keep a state, and deal with it. Youalso have to kick the codec when you receive new bytes.

Another possibility is to mask all this complexity and offers an higherlevel abstraction to allow the codec to poll the bytes from a "stream"(as suggested by some of our users), something like :


codec :
 ...
 byte[] data = stream.get( 8 )
 long myLong = createLong( data )
 ...

In this case, your codec is not statefull, and you don't want to knowthat behind the curtain, the stream is stored in BB. We can also seethat in this case, you want specifically 8 bytes, which means that ifthe underlying BB does not contains those 8 bytes, the get( 8 ) callwill block until some more bytes will be received.

This leads to some other kind of complexity, as your codec might consumea Thread and it can lead to a Thread exhaustion in the pool of threads.But form the user POV, this is much more simpler to implement a codecusing such a paradigm.

Now, I don't see how those two scenario are incompatible. We can providea Filter which will transform the incoming BB to any kind of otherstructure, including a byte stream. The idea is that from a Filter tothe next one, you have transformed the data. here, we will have :

client data stored into BB -> Filter 1 transform BB to a stream -->Filter 2 is the decoder, transforming the stream to messages --> Filter3 ...

So the question is : why do we have to define a very special new typewhen a specific filter can do the work ? What if we provide this filter,using existing java classes to ease the Codec implementor work instead ?

(PS: it does not change the consensus we reached yesturday : the firstfilter will still have to deal with either BB or Iterator<BB>)


wdyt ? Am I missing something ?

--
--
cordialement, regards,
Emmanuel Lécharny
www.iktek.com
directory.apache.org

Re: IoBuffer refactoring (final take?)

Reply via email to