Re: [BangPypers] parsing xml

Gora Mohanty Thu, 28 Jul 2011 13:01:48 -0700

On Thu, Jul 28, 2011 at 10:37 PM, Venkatraman S <venka...@gmail.com> wrote:
> parsing using minidom is one of the slowest. if you just want to extract the
> distance and assuming that it(the tag) will always be consistent, then i
> would always suggest regexp. xml parsing is a pain.
[...]


Strongly disagree. IMHO, regexps are the wrong solution
for parsing XML (or, any kind of well-structured text), as
they end up becoming intolerably complex, and do not
degrade gracefully for broken XML.

Have not compared speeds myself, but there are blogs
that go into that. In my experience, the cleanest, most
efficient, and richest-in-features Python XML library is
lxml. For people used to BeautifulSoup, lxml has a
BeautifulSoup parser, and is significantly more efficient.

Regards,
Gora
_______________________________________________
BangPypers mailing list
BangPypers@python.org
http://mail.python.org/mailman/listinfo/bangpypers

Re: [BangPypers] parsing xml

Reply via email to