According to Jeff Johnson:
> I just got htdig setup and working.  We are currently saving the documents
> as the case number (ex: 400cv9999.pdf).  I have added the line
> extra_word_characters:: and modified search_algorithm: to include
> substring:1 in the htdig.conf file.  If you search for 4:00cv9999 (we don't
> use the semi colon in the file name), it will give you a listing of all
> documents that have that case number in it.  The problem is, if they enter a
> partial search, cv9999, it will list the documents, but it also shows
> listings for other documents.  These just point back to the directory the
> documents are saved in.  It appears that when it indexes, it is indexing the
> file name also.  Is there a way to exclude those extra listings?  Thanks.

According to David Adams:
> I'm not certain I understand you.  Is your problem that you have indexed
> index.html pages, and don't want them returned on a search?
> 
> If so then there are two solutions.  The simpler is to use the "exclude"
> option on the search form:
> 
> <input type=hidden name=exclude value="index.html">
> 
> The other is to add
> <META name="robots" content="noindex, follow">
> to the head of each file you wish to exclude and then rerun htdig & htmerge.

On the other hand, if you're talking about excluding directory indexes
automatically generated by Apache, then follow the suggestions in these
messages:

    http://www.mail-archive.com/[email protected]/msg05984.html
    http://www.mail-archive.com/[email protected]/msg05987.html

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to