[EMAIL PROTECTED] wrote:
I understand that the web is full of ill-formed XHTML web pages but this is
Microsoft:

http://moneycentral.msn.com/companyreport?Symbol=BBBY

I can't validate it and my standard Python XML parsing tools don't work on it.

Use BeautifulSoup:
http://www.crummy.com/software/BeautifulSoup/

"You didn't write that awful page. You're just trying to get some data out of it. Right now, you don't really care what HTML is supposed to look like.

Neither does this parser."

-a


--
[email protected]
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-list

Reply via email to