Re: [Axis2] Binary Serialisation

Dennis Sosnoski Fri, 29 Jul 2005 13:13:57 -0700

Tatu Saloranta wrote:

...
I don't see the need to call 3 accessor methods to get
the raw char array as a significant performance block
-- it certainly does not even register on profiles I
have taken for parsing.

I didn't mean to suggest that the 3 method calls caused a performanceproblem - it's just somewhat awkward, and giving access to theunderlying parser buffer is not very clean from a structure standpoint.The big potential advantage I see with a CharSequence-type approach isthat it would allow the parser to avoid translating data to a char[] inthe first place, instead returning characters directly from the bytestream input (or internal byte[]). For UTF-8 and UTF-16 this would bevery easy to implement - it's not so easy for some other characterencodings, but those could be handled by the current approach oftranslating everything to chars up front.

The only remaining area where non-shared Strings are
used are attribute values; and here DTD/schema-based
handling might allow sharing too (for enumerated
types). Or, for minor improvements, type-based
accessors could be used too. If there's interest, I
could experiment with Woodstox stax-parser -- adding
low-level typed accessors would be quite easy to do,
and would avoid String creation.

It'd be interesting to see how much parsing speeds up if you disable thecreation of Strings for attributes. Maybe that's an easy test you could try?

As to typed accessors, I think they'd be somewhat useful but I expectthey'd also be a lot of trouble. The main benefit I see is that theywould make it simpler to substitute a binary data decoder for theparser, and I'm not all that thrilled by the idea of pure binary datastreams. The binary formats would be based on schemas, so in theorydifferent implementations should translate the same schema to compatibleformats - but we can't even get a reasonable level of compatibility inthe use of schemas for web services with *text* documents, so how muchmore difficult would it be to do this with binary formats?


 - Dennis

Re: [Axis2] Binary Serialisation

Reply via email to