Alexandro,

In the 'character ' function, I would just use a 'sub' using the regexp 
're' module if you are in Python.

Laurent




Alexandro Colorado <[EMAIL PROTECTED]> 
16/04/2007 19:09
Veuillez répondre à
[email protected]


A
[email protected]
cc

Objet
[xml-dev] Understanding SAX and IgnoreWhiteSpaces





I wonder if there is any SAX expert here. I want to parse an XML file
with line breaks and tabs. It seems that the 'character' handler under
ContentHandler will include linebreaks spaces and other invisible
characters as content of the tags.

So for example I will get something like:
[  ] [     ] [Content] [ ] [     ] [OtherContent] [ ]

<tag>
<value>Conent</value>
<value>OtherContent</value>
</tag>

[ ] = Line Breaks (\n)
[    ] = Tabs (\t)

How can I ignore this non-visible character and just get the visual
one. Thanks.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to