On 25 Oct 2009, at 07:45 , elca wrote:
i want to make web scraper.
if possible i really want to make it work together with
beautifulsoup or
lxml with PAMIE.
Scraping information from webpages falls apart in two tasks:
1. Getting the HTML data
2. Extracting information from the HTML data
It looks like you want to use Internet Explorer for getting the HTML
data; is there any reason you can't use a simpler approach like using
urllib2.urlopen()?
Once you have the HTML data, you could feed it into BeautifulSoup or
lxml.
Mixing up 1 and 2 into a single statement created some confusion for
you, I think.
Greetings,
--
http://mail.python.org/mailman/listinfo/python-list