Peter Donald wrote:
>
> <snip>
> > So, my proposol is to add a getMimeType() method to the Source interface
> > and an "XMLizer" component to the xml package. This XMLizer gets
> > a mime-type and an input stream as input and creates sax events
> from this.
> > This will
> > - decouple the source and the xml package totally
> > - the XMLizer is not only usable for source objects
> > - the XMLizer is extensible...
>
> fantastic. This brings up another idea I have been playing with
> recently. I
> have a couple of binary file formats that I convert into SAX events for
> further processing...
>
Yo, that sort of things.

> Maybe it would be an idea to instead create a new package for XMLizer
> interface and related implementations. ie I assume that there is a POI
> deserializer (or whatever they are called in cocoon) that reads the
> excel/word/whatever documents and produces an XML stream (either
> as SAX or as
> DOM). I have similar readers that deal with other binary
> protocols. I also
> used to always use a directory to XML producer (I stole idea from
> stylebook
> IIRC). I am sure other people have other similar demands. (PDF to XML?)
>
> So maybe a generic framework for converting streams to XML may be
> in order ?
> Something separate from XML parsing stuff already in place?
>
The XMLizer itself is intependent from the parser stuff but of course
some implementations like the conversion from "text/xml" into SAX
events requires a parser.
I thought of the package "org.apache.excalibur.xmlizer" resp.
"org.apache.excalibur.xmlizer.impl".

Carsten


--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to