Re: [cocoon3] Stax Pipelines

Sylvain Wallez Wed, 03 Dec 2008 03:36:10 -0800

Steven Dolg wrote:

Thorsten Scherler schrieb:
El mié, 03-12-2008 a las 08:56 +0100, Sylvain Wallez escribió:
Andreas Pieber wrote:
...
One big "problem" in this approach is that the "flow direction ofevents" is completely inverted. This means that StAX and SAXcomponents would not be able to work "directly" together. But alsoin a push-pull approach a conversion between StAX and SAX eventshave to be done and further more this problem could be tackled bywriting a wrapper or adapters around the SAX components and addthem to an StAX pipe.
Absolutely. Converting Stax to SAX is fairly trivial, but the otherway around requires buffering or multithreading. Have you looked atStax-Utils [1]? It contains many classes to ease the SAX <-> Staxtranslation.
I lately played around (and still do) with such approach in the forrest
dispatcher rewrite [3]. I am using Axiom which is a quite interesting
approach and maybe worth looking into [4]. However I did some profiling
and for the dispatcher the old SAX approach had been ways faster.
We started with an evaluation of some StAX implementations includingAxiom, WoodStox and the reference implementation.However quite early we felt that the DOM-like approach of Axiom is notideally suited for our current phase.I'm quite sure that there are occasions where Axiom can be reallycharming, but I believe there are too many premises required toefficiently use it (e.g. you will want to be sure that the XML data isnot too large). But if you have some complex transformation thatappear to difficult to be implemented in a one-pass approach Axiomcould probably do the trick.
I'm sure we will explore this idea at a later time, though...

Axiom is very interesting as the DOM-like structure it provides is theeasiest way to traverse a document while avoiding full parsing of thedocument. But this comes with a price, since any traversal on a list ofelements requires to parse all of these elements. So without verycareful use, it can quickly degenerate into a classical DOM with theassiociated problems, or even worse because of the additional complexityrequired by deferred parsing.

Not to say that Axiom is bad, rather the contrary: it's a powerfulweapon which you can easily shoot yourself in the foot with :-)

However this is due to the buffering issue pointed out by Sylvain which
[5] is not solving at all. Brings me back to do a sax (+stax) approach
again (the other class in the package).

I am really exited about this thread. :)

I must admit that it got me all excited by now, too.


Should I say me too? ;-)

Yesterday, I did a very minimalistic POC, just to make sure ourcurrent approach is not missing any major point.I have to say I was simply amazed how easy state handling can be whenusing StAX compared to SAX and I'm very confident that we came up witha pretty thorough concept.


I'm eager to see what it looks like!

After all, we tortured our poor students more than a month withevaluating implemenations, writing uses cases - outside Cocoon! -using both SAX and StAX, before even "allowing" them to think abouthow to integrate this into Cocoon.I believe this was necessary to fully understand the differencesbetween StAX and SAX and - even more important - the different usagepatterns associated with them.
And I'm sure this allows us now to fully reap the benefits of this API.

Well, I can't wait to see the first components ...

Same here! But don't forget in the torture program the important XSLTtransformer, since I don't know of any implementation that would supportpull callbacks and thus avoid buffering the ouput in a Stax pipeline.


Sylvain

--
Sylvain Wallez - http://bluxte.net

Re: [cocoon3] Stax Pipelines

Reply via email to