On Fri, 1 Mar 2002 11:18:32 -0500, Sean McCarthy wrote: >look into the verity spider > >-----Original Message----- >From: phil e hebenstreit [mailto:[EMAIL PROTECTED]] >Sent: Friday, March 01, 2002 11:14 AM >To: CF-Talk >Subject: RE: Verity Issue - Returning Incorrect Results? > > >Hi Mark, > >I've determined this now. Its returning the correct results, >because >some >of the articles are uploaded files (html) that were created in Word >and >contain a bunch of excess html code at the type (one being style >sheets >that >contains the word "hybrid", which what the search word was). So now >I'm >trying to find out the best way to strip out all this "extra" and >just >index >the actual text of the uploaded file. > >Thanks again, >Phil > >-----Original Message----- >From: Mark Leder [mailto:[EMAIL PROTECTED]] >Sent: Friday, March 01, 2002 11:05 AM >To: CF-Talk >Subject: Re: Verity Issue - Returning Incorrect Results? > > >On Fri, 1 Mar 2002 10:23:57 -0500, phil e hebenstreit wrote: >>Running CF 5.0 and when we run our search against the >collection, it >>returns >>what from the surface appear to be inconsistent results. >> >>I will search on "hybrid" and first 7 articles will be "foreign >>text" and >>when I search the body that word does not appear. >>Next article will be in English and does contain the keyword in >the >>body. >>But then next article in English does not contain the keyword in >the >>body. >> >>Wondering if anyone could point me to where the mistake might >be >>taking >>place. During the index/update of the collection? During the >>actual search >>call? >> >>Thanks in advance >>Phil E Hebenstreit >> >>_______________________________________________________________ _ >_____ > >>_ >>Get Your Own Dedicated Windows 2000 Server >>PIII 800 / 256 MB RAM / 40 GB HD / 20 GB MO/XFER >>Instant Activation � $99/Month � Free Setup >>http://www.pennyhost.com/redirect.cfm?adcode=coldfusionb >>FAQ: http://www.thenetprofits.co.uk/coldfusion/faq >>Archives: >http://www.mail-archive.com/[email protected]/ >>Unsubscribe: >http://www.houseoffusion.com/index.cfm?sidebar=lists >> > >Hi Phil, >Is this a document search or a db search? Did you set a specific > >language in the CFIndex tag? > >Mark >-- >Mark Leder, [EMAIL PROTECTED] on 03/01/2002 > > > >________________________________________________________________ _____
>_ >Dedicated Windows 2000 Server >PIII 800 / 256 MB RAM / 40 GB HD / 20 GB MO/XFER >Instant Activation � $99/Month � Free Setup >http://www.pennyhost.com/redirect.cfm?adcode=coldfusiona >FAQ: http://www.thenetprofits.co.uk/coldfusion/faq >Archives: http://www.mail-archive.com/[email protected]/ >Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists > I thought I saw a custom tag at the DevCenter which addresses the junk html strip-out, you may also want to check CFLib.org -- Mark Leder, [EMAIL PROTECTED] on 03/01/2002 ______________________________________________________________________ Get Your Own Dedicated Windows 2000 Server PIII 800 / 256 MB RAM / 40 GB HD / 20 GB MO/XFER Instant Activation � $99/Month � Free Setup http://www.pennyhost.com/redirect.cfm?adcode=coldfusionb FAQ: http://www.thenetprofits.co.uk/coldfusion/faq Archives: http://www.mail-archive.com/[email protected]/ Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists

