Hi all,

I found a weird behaviour in XMLSerializer: it apparently ignores start/endElement events that are between start/endEntity events, and only keeps character events, thus removing all the markup.

This occurs when the pipeline starts with a by parsing a file containing entities (using the file generator) and ends with the XML serializer with no XSL transformations inbetween. This problem doesn't appear when an XSL is present as if buffers the SAX stream to build its own internal DOM and then processes that DOM.

I've setup a test example to show this: update the 2.1.x branch and point your browser to http://localhost:8888/samples/test/serialized-entities/explain-test.html

Is this a known problem? Should we filter out start/endEntity events in XMLSerializer?

Thanks for any advice,
Sylvain

--
Sylvain Wallez                        Anyware Technologies
http://people.apache.org/~sylvain     http://www.anyware-tech.com
Apache Software Foundation Member     Research & Technology Director

Reply via email to