No, parsers are free to report any contiguous bit of text as multiple
calls to characters() in the ContentHandler.  This is many times affected by
the parser's own internal buffer sizing.  Full buffer, it pops a call to
characters().  This can result in an Element node containing multiple
adjacent Text nodes as children.  Normalization simply merges all of the
characters in all of the Text nodes into a single Text node.  No changes are
made to the actual content/characters.  Just the division and organization
of them within the DOM.

        -bob


On Sat, 22 May 2004, Ben Munat wrote:

> I think I've solved my problem with SaxReader's setStripWhitespaceText.
> However, I wouldn't mind understanding the point of this normalize
> method... by "concatenate" do you mean remove newlines?


-------------------------------------------------------
This SF.Net email is sponsored by: Oracle 10g
Get certified on the hottest thing ever to hit the market... Oracle 10g. 
Take an Oracle 10g class now, and we'll give you the exam FREE.
http://ads.osdn.com/?ad_id=3149&alloc_id=8166&op=click
_______________________________________________
dom4j-user mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/dom4j-user

Reply via email to