Re: [Tutor] module to parse XMLish text?

Karim Sat, 15 Jan 2011 12:52:46 -0800


Hello,

I did not see the XML code in details before I gave the code withElementTree.In fact with unclosing tags you will get errors at parse time and itwill give you

the location of errors.

You could use the module from Stefan which is way way superior thanElementTreewhich can validate against DTD or XSD and many many other features(speed, etc...)


Regards
Karim

On 01/15/2011 07:53 AM, Stefan Behnel wrote:

Wayne Werner, 15.01.2011 03:25:
On Fri, Jan 14, 2011 at 4:42 PM, Terry Carroll wrote:
On Fri, 14 Jan 2011, Karim wrote:

  from xml.etree.ElementTree import ElementTree

I don't think straight XML parsing will work on this, as it's not valid
XML; it just looks XML-like enough to cause confusion.
It's worth trying out - most (good) parsers "do the right thing" evenwhenthey don't have strictly valid code. I don't know if xml.etree isone, but
I'm fairly sure both lxml and BeautifulSoup would probably parse it
correctly.
They wouldn't. For the first tags, the text values would either notcome out at all or they would be read as attributes and thus loosetheir order and potentially their whitespace as well. The other tagswould likely get parsed properly, but the parser may end up nestingthem as it hasn't found a closing tag for the previous tags yet.
So, in any case, you'd end up with data loss and/or a structure thatwould be much harder to handle than the (relatively) simple filestructure.
Stefan

_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
http://mail.python.org/mailman/listinfo/tutor


_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
http://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] module to parse XMLish text?

Reply via email to