I had a look at the bad_word file that came with htdig. It's very small, so
many very common words would still be indexed.
I've created a much larger list - partly based on the standard "stop words"
from SWISH-E but edited and extended. This takes into account how htdig
treats apostrophes by default.
I'm using this basic list to create site-specific lists with extra words
that occur on practically every page in a site (such as my name ;-)).
If anyone is interested in the basic list, which now contains 348 "words",
I can zip it up and post it on the web somewhere. No private emails,
please, just post to the list and I'll post the URL to the list.
Marjolein Katsma [EMAIL PROTECTED]
Java Woman - http://javawoman.com/
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.