OK, I think the problem is you left off the -i option on htdig, to make
it reindex from scratch.  Without -i, htdig will update the existing
database, and won't delete documents that are already in the database.

According to Shoba J:
> NO, These are two different problems, There is no connection between these
> two.
> ----- Original Message -----
> From: Gilles Detillieux <[EMAIL PROTECTED]>
> To: Shoba J <[EMAIL PROTECTED]>
> Cc: ht://Dig mailing list <[EMAIL PROTECTED]>
> Sent: Thursday, March 01, 2001 4:28 AM
> Subject: Re: [htdig] problem with exclude urls and meta tags
> 
> 
> > According to Shoba J:
> > > I recently installed htdig for searching our intranet. I am very new to
> > > the list. I have given a list of words for exclusion in the htdig.conf
> > > file and i have added meta tags to all the html files. I ran htdig and
> > > htmerge again to rebuild the database. But the results are showing files
> > > from urls which i had excluded and not very different even after adding
> > > meta tags.
> > >
> > > Am I missing anything?
> >
> > You don't say exactly what connection you seem to think there should be
> > between meta tags in the HTML files and the exclude_urls attribute setting
> > in your htdig.conf file.  There is no connection.  The exclude_urls
> attribute
> > is a list of substrings which, if any is found in a URL, that URL will be
> > excluded from the index.  It has nothing to do with the file's contents,
> > meta tags or otherwise, because htdig won't even fetch the file if its
> > URL matches one of the exclude_urls patterns.
> >
> > Unless I'm misunderstanding what you want, it seems you want a "stop list"
> > capability, where a file is excluded from the index if it contains certain
> > "stop words" in your list.  htdig doesn't have a stop list capability.


-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to