On Thu, Mar 18, 2010 at 6:51 PM, Dave Watts wrote: > >> We'd like to pull meta tags from the home page of various websites. >> >> Here's how we'd like for this to work: >> 1. SQl table listing over 3,000 urls is queried. >> 2. Pull the meta tags and description from each of the home pages of these >> websites. >> 3. Insert these meta tags into a database. >> >> What's the best way to accomplish this? In particular, how do we scrape the >> meta tags using CF8? > > The best way to accomplish this would probably be to use something > other than CF, which is not intended for this kind of thing. There are > all sorts of products, free and other, that can do individual parts of > this, without being tied to the request/response model that CF is > designed to work within.
I'm really fond of web-harvest: http://web-harvest.sourceforge.net/ It's the cat's meow. Even wrote a custom tag (Railo only ATM) for using it with CFML. :den -- True time is four-dimensional. Martin Heidegger ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| Want to reach the ColdFusion community with something they want? Let them know on the House of Fusion mailing lists Archive: http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:331882 Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm

