Re: streaming redux

Michel Fortin Tue, 28 Dec 2010 09:40:37 -0800

On 2010-12-28 02:02:29 -0500, Andrei Alexandrescu<[email protected]> said:

I've put together over the past days an embryonic streaming interface.It separates transport from formatting, input from output, and bufferedfrom unbuffered operation.
http://erdani.com/d/phobos/std_stream2.html
There are a number of questions interspersed. It would be great tostart a discussion using that design as a baseline. Please voice anyrelated thoughts - thanks!

One of my concerns is the number of virtual calls required in actualusage, because virtual calls prevent inlining. I know it's necessary tohave virtual calls in the formatter to serialize objects (whichrequires double dispatch), but in your design the underlying transportlayer too wants to be called virtually. How many virtual calls will benecessary to serialize an array of 10 objects, each having 10 fields?Let's see:


          10 calls to Formatter.put(Object)
        + 10 calls to Object.toString(Formatter)
        + 10 objects * 10 calls per object to Formatter.put(<some field type>)

+ 10 objects * 10 calls per object toUnbufferedOutputTransport.write(in ubyte[])

Total: 220 virtual calls, for 10 objects with 10 fields each. Most ofthe functions called virtually here are pretty trivial and wouldnormally be inlined if the context allowed it. Assuming those fieldsare 4 byte integers and are stored as is in the stream, the result willbe between 400 and 500 byte long once we add the object's class name.We end up having almost 1 virtual call for each two byte of emitteddata; is this overhead really acceptable? How much inlining does itprevent?

My second concern is that your approach to Formatter is too rigid. Forinstance, what if an object needs to write different fields dependingon the output format, or write them in a different order? It'll have tocheck at runtime which kind of formatter it got (through castsprobably). Or what if I have a formatter that wants to expose an XMLtree instead of bytes? It'll need a totally different interface thatdeals with XML elements, attributes, and character data, not bytes.

So because of all this virtual dispatch and all this rigidity, I thinkFormatter needs to be rethought a little. My preference obviously goesto satically-typed formatters. But what I'd like to see is somethinglike this:


        interface Serializable(F) {
                void writeTo(F formatter);
        }

Any object can implement a serialization for a given formatter byimplementing the interface above parametrized with the formatter type.(Struct types could have a similar writeTo function too, they justdon't need to implement an interface.) The formatter type can exposethe interface it wants and use or not use virtual functions, it couldbe an XML writer interface (something with openElement,writeCharacterData, closeElement, etc), it could be a JSON interface;it could even be your Formatter as proposed, we just wouldn't belimited by it.

So basically, I'm not proposing you dump Formatter, just that you makeit part of a reusable pattern forformatting/serializing/unformatting/unserializing things using otherthings that your Formatter interface.

As for the transport layer, I don't mind it much if it's an interface.Unlike Formatter, nothing prevents you from creating a 'final' classand using it directly when you can to avoid virtual dispatch. Thisdoesn't work so well for Formatter however because it requires doubledispatch when it encounters a class, which washes away all staticinformation.



--
Michel Fortin
[email protected]
http://michelf.com/

Re: streaming redux

Reply via email to