Re: Redesigning IoBuffer, IoFilter and IoHandler

Emmanuel Lecharny Mon, 28 Apr 2008 08:48:29 -0700

David M. Lloyd wrote:

<snip/>
I think that using a byte[] (for instance in the encoder), transformit to a ByteBuffer, is another way to deal with the problem.
One important point is that ByteBuffers are just mean to contains afixed amount of data. It's a buffer, not a data structure.Transforming ByteBuffer to make them able to expand twist theirintrinsic semantic.
Yes, it makes far more sense to accumulate buffers until you candecode your message from it.

Or decode the stream as it comes, creating the object on the fly. Astatefull decoder...

So I would say that BB should be used on the very low level (readingdata and sending data), but then, the other layers should use byte[]or a stream of bytes.
I don't see the advantage of using byte[] honestly - using at theleast a wrapper object seems preferable.

This is what we are doing in ADS : LDAP messages are built on the fly,simply by coping with ByteBuffers.

Consider that accumulating BB to create a big byte[] should beunderstand as : transform BB directly to the targeted wrapper objects.Thanks for correcting me :)

And if you're going to use a wrapper object, why not just use ByteBuffer.

Because you may receive more than one BB before you can build thewrapper object.

This will lead to very intersting performances questions :
- how to handle large stream of data ?


One buffer at a time. :-)

Well, I tried to think about other strategies, but, eh, you are justplain right ! It's up to the codec filter to deal with the complexity ofthe data it has to decode !

- should we serialize the stream at some point ?


What do you mean by "serialize"?

Write to disk if the received data are too big. See my previous point(it's up to the decoder to deal with this)

- how to write an efficient decoder, when you may receive fractionsof what you are waiting for ?
An ideal decoder would be a state machine which can be entered andexited at any state. This way, even a partial buffer can be fullyconsumed before returning to wait for the next buffer.

This is what we have in ADS : A stateful decoder. Not as simple as ifyou have the whole data in memory, especially if you have to deal withmulti-bytes markers, but not too complex neither.

However many decoders are not ideal due to various constraints. Inthe worst case, you could accumulate ByteBuffer instances until youhave a complete message that can be handled. What I do at this pointis to create a DataInputStream that encapsulates all the receivedbuffers.

Yeah, 100% agree.

Note that a buffer might contain data from more than one message aswell. So it's important to use only a slice of the buffer in this case.

Not a big deal. Again, it's the decoder task to handle such a case. Wehave experimented such a case in LDAP too.(make me think that we should describe the ldap codec on the MINA site,just to give some insight for people who want to write a statefull decoder)

- how to write an efficient encoder when you have no idea about thesize of the data you are going to send ?
Use a buffer factory, such as IoBufferAllocator, or use an evensimpler interface like this:
public interface BufferFactory {
    ByteBuffer createBuffer();
}
which mass-produces pre-sized buffers. In the case of stream-orientedsystems like TCP or serial, you could probably send buffers as youfill them. For message-oriented protocols like UDP, you canaccumulate all the buffers to send, and then use a single gatheringwrite to send them as a single message (yes, this stinks in thecurrent NIO implementation, as Trustin pointed out in DIRMINA-518, butit's no worse than the repeated copying that auto-expanding buffersuse; and APR and other possible backends [and, if I have any say atall in it, future OpenJDK implementations] would hopefully not sufferfrom this limitation).

That's an idea. But this does not solve one little pb : if the reader isslow, you may saturate the server memory with prepared BB. So you mayneed a kind of throttle mechanism, or a blocking queue, to manage thisissue : a new BB should not be created unless the previous one has beencompletely sent.

For all these reasons, the mail I sent a few days ago express mypersonnal opinion that IoBuffer may be a little bit overkilling(remember that this class -and the associated tests- represent around13% of all mina common code ! )
Yes, that's very heavy. I looked at resolving DIRMINA-489 more thanonce, and was overwhelmed by the sheer number of methods that had tobe implemented, and the overly complex class structure.
One option could be to use ByteBuffer with some static support methods,

+1

and streams to act as the "user interface" into collections ofbuffers. For example, an InputStream that reads from a collection ofbuffers, and an OutputStream that is configurable to auto-allocatebuffers, performing an action every time a buffer is filled:
public interface BufferSink {
    void handleBuffer(ByteBuffer buffer);
}

That's an option.

Another option is to skip ByteBuffers and go with raw byte[] objects(though this closes the door completely to direct buffers).

Well, ByteBuffers are so intimately wired with NIO that I don't think wecan easily use byte[] without losing performances... (not sure though ...)

Yet another option is to have a simplified abstraction for byte arrayslike Trustin proposes, and use the stream cleasses for the bufferstate implementation.
This is all in addition to Trustin's idea of providing a byte arrayabstraction and a buffer state abstraction class.

I'm afraid that offering a byte[] abstraction might lead to morecomplexity, wrt with what you wrote about the way codec should handledata. At some point, your ideas are just the good ones, IMHO : use BB,and let the codec deal with it. No need to add more complex datastructure on top of it.

Otherwise, the idea may be to define some simple codec which transform aBB to a array[], for those who need it. As we have a cool Filter chain,let's use it...


wdyt ?


--
--
cordialement, regards,
Emmanuel Lécharny
www.iktek.com
directory.apache.org

Re: Redesigning IoBuffer, IoFilter and IoHandler

Reply via email to