I understand that the web is full of ill-formed XHTML web pages but this is
Microsoft:

http://moneycentral.msn.com/companyreport?Symbol=BBBY

I can't validate it and my standard Python XML parsing tools don't work on it.

If this was just some teenager's web site I'd move on.  Is there any hope
avoiding regular expression hacks to extract the data from this page?

Chris


-- 
[email protected]
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-list

Reply via email to