My htdig indexing process is missing many documents because it appears that htdig strips off "index.html" from HREFs that it encounters. I can't find anything in the configuration options that would cause that.
I have files that contain references to other files that look like: <A HREF="/en/mybook/index.html">my book</A> When htdig scans the file and finds this HREF, instead of trying to process the full URL, it only tries "/en/mybook/". On this server, I do not have DirectoryIndex set to "index.html", so such URLs fail. Therefore, htdig does not index those documents. When I use the -vvv option to htdig, this is what it tells me: 8:8:1:http://myserver.mydomain.com/en/mybook/: not found What am I doing wrong? Why does it strip off the "index.html" when it follows such URLs? Does the server have to use DirectoryIndex set to index.html? bobs _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

