If I may off shoot on a related subject....

If I have an existing database, leave off the -i, and give htdig
a completely different start url, why then does it seem to go
back through the previously indexed pages and tell me "retrieved but not
changed",
( I'm not sure how it determines this, as they all are cgi scripts ) before
going off to index my new starting url??  This can be very time consuming
on a large site.  Also... though I doubt it, but is there an easy way to
remove
or "invalidate" a url that no longer exists?  ( perhaps that's what the
"retrieved but not changed" check is for, but I really don't want it to
check _every_
page, and besides, this url would exist, I just don't want it to come up.

Thanks for any info.
Dave

( I'm running 3.1.5 )


----- Original Message -----
From: "Gilles Detillieux" <[EMAIL PROTECTED]>

> OK, I think the problem is you left off the -i option on htdig, to make
> it reindex from scratch.  Without -i, htdig will update the existing
> database, and won't delete documents that are already in the database.
>



_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to