Kenneth McDonald wrote: > Ruby has a package called 'hpricot' which can perform limited xpath > queries, and CSS selector queries. However, what makes it really useful > is that it does a good job of handling the "broken" html that is so > commonly found on the web. Does Python have anything similar, i.e. > something that will not only do XPath queries, but will do so on > imperfect HTML?
lxml.html is your friend. http://codespeak.net/lxml/lxmlhtml.html Stefan -- http://mail.python.org/mailman/listinfo/python-list