Re: html parser?

Thorsten Kampe Tue, 18 Oct 2005 09:10:53 -0700

* Christoph Söllner (2005-10-18 12:20 +0100)
> right, that's what I was looking for. Thanks very much.


For simple things like that "BeautifulSoup" might be overkill.

import formatter, \ 
       htmllib,   \ 
       urllib 

url = 'http://python.org' 

htmlp = htmllib.HTMLParser(formatter.NullFormatter()) 
htmlp.feed(urllib.urlopen(url).read()) 
htmlp.close() 

print htmlp.anchorlist

and then use urlparse to parse the links/urls...
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: html parser?

Reply via email to