According to Jeff Johnson:
> I just got htdig setup and working. We are currently saving the documents
> as the case number (ex: 400cv9999.pdf). I have added the line
> extra_word_characters:: and modified search_algorithm: to include
> substring:1 in the htdig.conf file. If you search for 4:00cv9999 (we don't
> use the semi colon in the file name), it will give you a listing of all
> documents that have that case number in it. The problem is, if they enter a
> partial search, cv9999, it will list the documents, but it also shows
> listings for other documents. These just point back to the directory the
> documents are saved in. It appears that when it indexes, it is indexing the
> file name also. Is there a way to exclude those extra listings? Thanks.
According to David Adams:
> I'm not certain I understand you. Is your problem that you have indexed
> index.html pages, and don't want them returned on a search?
>
> If so then there are two solutions. The simpler is to use the "exclude"
> option on the search form:
>
> <input type=hidden name=exclude value="index.html">
>
> The other is to add
> <META name="robots" content="noindex, follow">
> to the head of each file you wish to exclude and then rerun htdig & htmerge.
On the other hand, if you're talking about excluding directory indexes
automatically generated by Apache, then follow the suggestions in these
messages:
http://www.mail-archive.com/[email protected]/msg05984.html
http://www.mail-archive.com/[email protected]/msg05987.html
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html