On 13/08/2014 7:28 AM, Roy Smith wrote:
Second, if you're going to be parsing web pages, trying to use regexes
is a losing game.  You need something that knows how to parse HTML.  The
canonical answer is lxml (http://lxml.de/), but Beautiful Soup
(http://www.crummy.com/software/BeautifulSoup/) is less intimidating to
use.

lxml also has a BeautifulSoup parser, so you can easily mix and match approaches:

http://lxml.de/elementsoup.html

--
https://mail.python.org/mailman/listinfo/python-list

Reply via email to