On Thu, 11 Oct 2001, Gilles Detillieux wrote:

+ According to Malcolm Austen:
+ >     remove_default_doc: index.htm
+ > will also strip out index.html ... hey it's a feature?
+ Are you sure about this?

Ah, well, I was, but I don't seem to be able to repeat it in isolation.
Maybe I fooled myself into misreading the -vvv output I generated while
hunting the missing .xml files, I guess it just fitted too neatly in with
index.html.ID=... being stripped out!

+ suggest a bug in CompareWord(), which is an unsettling prospect indeed.

It looks like CompareWord() can breathe a sigh of relief 8-)

+ >     remove_default_doc: index.html index.htm index.xml
+ > and to my horror htdig is no longer indexing pages that are not the
+ > default document like 'index.xml.ID=something'
+
+ Ah, that would be a bug!  removeIndex() makes an exception for query
+ strings beginning with "?", but doesn't test for other things that may
+ be appended to the file name.  It should.

It was me who persuaded our AxKit man not to use '?' because I have a
blanket exclusion on indexing '?' URLs , oh well 8-)

+ Please try out the following patch, but only after confirming that
+ the unpatched code does indeed allow a pattern of index.htm to match
+ index.html.

Please take the .htm/.html part as a red herring unless I stumble upon
some reproducible evidence to the contrary!

+ Apply this in your 3.1.5 or 3.1.6 snapshot main source directory, using
+ "patch -p0 < this-message".

I don't have a test setup currently but maybe I'll ask our sysadmin to
apply the patch after next Monday night's database update, then I have the
rest of the week to play and ask for the old version to be reinstated!

Thanks,
        Malcolm.

 [EMAIL PROTECTED]     http://users.ox.ac.uk/~malcolm/


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to