sacha rook wrote:
> Hi I wonder if anyone can help with the following
>  
> I am trying to read a html page extract only fully qualified hostnames 
> from the page and output these hostnames to a file on disk to be used 
> later as input to another program.

I would use BeautifulSoup to parse out the hrefs and urlparse.urlparse() 
to split the hostname out of the href.

http://www.crummy.com/software/BeautifulSoup/documentation.html

Kent
_______________________________________________
Tutor maillist  -  Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor

Reply via email to