> From: Geoff Hutchison <[EMAIL PROTECTED]>
> Date: Sat, 13 Mar 1999 14:30:54 -0500

> New server: www.htdig.org, 80
> 0:2:0:http://www.htdig.org/: ++ size = 373
> 1:3:1:http://www.htdig.org/contents.html: *++++++++++++++++++++++- size = 3396
> 2:4:1:http://www.htdig.org/main.html: *-----*-*+ size = 7062
> 3:5:2:http://www.htdig.org/require.html: *--* size = 4227
> ..
> 28:5:2:http://www.htdig.org/require.html:  not changed
> 
> This is a big bug--it's trying to reindex the same URLs.

Sorry, when I try to repeat by indexing htdig.org, I don't see this (the
"not changed" line, which I assume is the problem).
 If you can point to the .conf-file, I can probably track it down (unless
it's related to local_urls, of course).

> Shouldn't it be
> realizing that it's indexed those URLs and toss them before it makes a new
> DocID, sends the HTTP request to find out that it's "not changed?"

There's no new DocID, at least not for require.html; the "5" is the DocID,
not the "28" - that's just a running counter for this "dig".

The leads point to Retriever.cc:Need2Get() returning nonzero in this case,
which indicates problems with the Dictionary class.  I didn't touch that
one (but the bugs seldom care about what parts of the code you actually
change ;-)

brgds, H-P

------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to