If I may off shoot on a related subject....
If I have an existing database, leave off the -i, and give htdig
a completely different start url, why then does it seem to go
back through the previously indexed pages and tell me "retrieved but not
changed",
( I'm not sure how it determines this, as they all are cgi scripts ) before
going off to index my new starting url?? This can be very time consuming
on a large site. Also... though I doubt it, but is there an easy way to
remove
or "invalidate" a url that no longer exists? ( perhaps that's what the
"retrieved but not changed" check is for, but I really don't want it to
check _every_
page, and besides, this url would exist, I just don't want it to come up.
Thanks for any info.
Dave
( I'm running 3.1.5 )
----- Original Message -----
From: "Gilles Detillieux" <[EMAIL PROTECTED]>
> OK, I think the problem is you left off the -i option on htdig, to make
> it reindex from scratch. Without -i, htdig will update the existing
> database, and won't delete documents that are already in the database.
>
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html