My htdig indexing process is missing many documents because
it appears that htdig strips off "index.html" from HREFs
that it encounters.  I can't find anything in the 
configuration options that would cause that.

I have files that contain references to other files
that look like:

<A HREF="/en/mybook/index.html">my book</A>

When htdig scans the file and finds this HREF, 
instead of trying to process the full URL, it
only tries "/en/mybook/".  On this server, I do
not have DirectoryIndex set to "index.html", so
such URLs fail.  Therefore, htdig does not
index those documents.  When I use the -vvv option
to htdig, this is what it tells me:

8:8:1:http://myserver.mydomain.com/en/mybook/:  not found

What am I doing wrong?  Why does it strip
off the "index.html" when it follows such URLs?
Does the server have to use DirectoryIndex
set to index.html?

bobs

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to