Michael Dinowitz has a great presentation/tutorial on how to do exactly that. 
http://www.houseoffusion.com/httpagent.ppt

hth,
larry

> I need to index web page contents for doing verity (or similar) 
> searching.  I'd like to insert just the text that a web page returns 
> and not any of the other stuff (like html, JS, CSS, images, etc)  
> 
> I noticed that cfhttp.filecontent returns the entire contents of the 
> page, anyone have a good way to get at just the text?
> 
> Also, I am storing the results in a mysql database and was 
> anticipating using the "text" data type, I assume that is the best way 
> to go? 


~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to 
date
Get the Free Trial
http://ad.doubleclick.net/clk;203748912;27390454;j

Archive: 
http://www.houseoffusion.com/groups/CF-Talk/message.cfm/messageid:308822
Subscription: http://www.houseoffusion.com/groups/CF-Talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4

Reply via email to