Hi.

You will probably want to run htdig with in verbose mode and redirect the
output to a file that you can examine at your leisure. Then you will want to
look for cases where a given file is being indexed multiple times. This can
happen if htdig encounters symbolic links or syntax errors in URL
specifications. Such occurrences result in a number of non-unique URLs
pointing to the same file, each of which is indexed repeatedly. The one time
I encountered this problem, it was because a user had specified something
like ../index.html/ in one file. The trailing / was enough to break things.
However, I think symbolic links are the more common cause for the type of
problem you are encountering.

Jim Cole

> However rundig takes 10-12 hours to run, but doesn't complete because
> db.docdb db.wordlist completely fill the 4Gb partition so rundig can't do
> the merge (I'm not using the -a option)


------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.

Reply via email to