sacha rook wrote: > Hi I wonder if anyone can help with the following > > I am trying to read a html page extract only fully qualified hostnames > from the page and output these hostnames to a file on disk to be used > later as input to another program.
I would use BeautifulSoup to parse out the hrefs and urlparse.urlparse() to split the hostname out of the href. http://www.crummy.com/software/BeautifulSoup/documentation.html Kent _______________________________________________ Tutor maillist - Tutor@python.org http://mail.python.org/mailman/listinfo/tutor