No, parsers are free to report any contiguous bit of text as multiple calls to characters() in the ContentHandler. This is many times affected by the parser's own internal buffer sizing. Full buffer, it pops a call to characters(). This can result in an Element node containing multiple adjacent Text nodes as children. Normalization simply merges all of the characters in all of the Text nodes into a single Text node. No changes are made to the actual content/characters. Just the division and organization of them within the DOM.
-bob On Sat, 22 May 2004, Ben Munat wrote: > I think I've solved my problem with SaxReader's setStripWhitespaceText. > However, I wouldn't mind understanding the point of this normalize > method... by "concatenate" do you mean remove newlines? ------------------------------------------------------- This SF.Net email is sponsored by: Oracle 10g Get certified on the hottest thing ever to hit the market... Oracle 10g. Take an Oracle 10g class now, and we'll give you the exam FREE. http://ads.osdn.com/?ad_id=3149&alloc_id=8166&op=click _______________________________________________ dom4j-user mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/dom4j-user