On Thu, Mar 18, 2010 at 6:51 PM, Dave Watts wrote:
>
>> We'd like to pull meta tags from the home page of various websites.
>>
>> Here's how we'd like for this to work:
>> 1. SQl table listing over 3,000 urls is queried.
>> 2. Pull the meta tags and description from each of the home pages of these 
>> websites.
>> 3. Insert these meta tags into a database.
>>
>> What's the best way to accomplish this? In particular, how do we scrape the 
>> meta tags using CF8?
>
> The best way to accomplish this would probably be to use something
> other than CF, which is not intended for this kind of thing. There are
> all sorts of products, free and other, that can do individual parts of
> this, without being tied to the request/response model that CF is
> designed to work within.

I'm really fond of web-harvest:

http://web-harvest.sourceforge.net/

It's the cat's meow.  Even wrote a custom tag (Railo only ATM) for
using it with CFML.

:den

-- 
True time is four-dimensional.
Martin Heidegger

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Want to reach the ColdFusion community with something they want? Let them know 
on the House of Fusion mailing lists
Archive: 
http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:331882
Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm

Reply via email to