[Axis2] Axiom restructuring proposal

Dennis Sosnoski Wed, 29 Mar 2006 15:36:54 -0800

Right now the Axiom code is explicitly tied to StAX as the source ofdata for constructing the document tree. This creates problems inworking with data binding frameworks which do not support marshallingvia a StAX XMLStreamReader, and limits the usability of Axiom across awider range of applications.

On the output side, Axiom generally uses XMLStreamWriter but alsodefines OMNode methods taking an OutputStream, a Writer, and aOMOutputImpl, as well as variations with OMOutputFormat included, andvariations for serialize vs. serializeAndConsume. This proliferation ofmethods adds a lot of complexity to the interface while requiring allcomponents of the tree to handle each of these forms of output. Forcomponents representing marshalling output from data binding frameworksthese variations also force inefficient handling of output (forinstance, by writing to an XMLStreamWriter when the framework can moreefficiently write directly to an OutputStream).

I'd like to see Axiom instead define more generic interfaces forhandling various data sources and output mechanisms. Really all that'srequired from potential data sources is that (1) the element holdingsome data can be converted to an Axiom tree structure on demand(possibly piecemeal, as in the case of elements backed by anXMLStreamReader), and (2) the data source can write itself to anOutputStream, Writer and/or XMLStreamWriter. We can abstract out theseoperations to the data source for unexpanded elements. This will allowcleaner handling of data binding framework extensions to Axis2, whilealso allowing flexibility for developers who have their own ways ofprocessing XML (see http://issues.apache.org/jira/browse/AXIS2-483 foran example).

Here's a first cut at an interface for the data source of an unexpandedelement:


public interface BackingData {
   void expandElement(OMElement element); // expand the element information

void expandNextContent(OMElement element); // expand next contentitem of elementboolean isReusable(); // check if data source can be used repeatedly(may avoid the need for expansion if so)void serialize(SerializationTarget target); // serialize using anysupported approach

When the expandElement() method of the BackingData is called, it willpopulate at least the element's attributes and namespaces information.When the expandNextContent() method is called, it would be theresponsibility of that BackingData instance to construct at least thenext content node of the element. If that next content node is anelement, the BackingData would be able to leave that element unexpandedand attach itself to the element. The idea here is to be flexible enoughto handle both elements backed by an XMLStreamReader and those backed bydata binding or an alternative form of XML handling. TheexpandElement()/expandNextContent() methods would need to be called inproper document order, so that if the data is coming from anXMLStreamReader it will be read sequentially (no expandNextContent()higher in the tree until all the content before that point in documentorder has been expanded).


Here's a first cut at an interface for the serialization handling:

public inteface SerializationTarget {

OutputStream getOutputStream(); // return output stream if availablefor direct output, otherwise nullWriter getWriter(); // return writer if available for direct output,otherwise nullXMLStreamWriter getXMLWriter(); // return XMLStreamWriter (alwaysavailable)

boolean isAttachable(String contentType, long estimatedSize); //check if "optimizable" data should be sent as attachmentString addAttachment(String contentType, InputStream is); // addattachment in the form of a stream (returns content id)String addAttachment(String contentType, byte[] bytes); // addattachment in the form of a byte array (returns content id)

The first part of this interface is the basic output handling. The rulehere is that every SerializationTarget will supply an XMLStreamWriter ondemand, but will also supply either an OutputStream or a Writer (soeither of the first two methods may return null, but not both). Theprinciple here is that many forms of XML handling can write directly toan output stream or writer but not to an XMLStreamWriter, while thelatter provides a flush() method which should make it safe for theoutput stream or writer to be used independently for XML fragments - souse the XMLStreamWriter for the envelope, if that's what you want, butstill use the stream or writer to output the body of the document.

The second part of this interface deals with attachments. It gives theSerializationTarget (which would be transport-dependent, of course) thecontrol over what actually gets sent as an attachment, and provides thedata to be output as an attachment in the form of either a stream or anarray of bytes. This would allow us to fix the current broken outputbehavior which forces generation of a fully-expanded OM tree for everymessage being sent, just so the transport code can check for anything itwants to send as an attachment.

I'm planning to make the chat later today if anyone wants to discussthese ideas (and also via email exchange, of course).


 - Dennis

--
Dennis M. Sosnoski
SOA, Web Services, and XML
Training and Consulting
http://www.sosnoski.com - http://www.sosnoski.co.nz
Seattle, WA +1-425-296-6194 - Wellington, NZ +64-4-298-6117

[Axis2] Axiom restructuring proposal

Reply via email to