Alexandro, In the 'character ' function, I would just use a 'sub' using the regexp 're' module if you are in Python.
Laurent Alexandro Colorado <[EMAIL PROTECTED]> 16/04/2007 19:09 Veuillez répondre à [email protected] A [email protected] cc Objet [xml-dev] Understanding SAX and IgnoreWhiteSpaces I wonder if there is any SAX expert here. I want to parse an XML file with line breaks and tabs. It seems that the 'character' handler under ContentHandler will include linebreaks spaces and other invisible characters as content of the tags. So for example I will get something like: [ ] [ ] [Content] [ ] [ ] [OtherContent] [ ] <tag> <value>Conent</value> <value>OtherContent</value> </tag> [ ] = Line Breaks (\n) [ ] = Tabs (\t) How can I ignore this non-visible character and just get the visual one. Thanks. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
