Re: [htdig] Recrusiv Digging

Michael Reutlinger Tue, 22 Jun 1999 10:45:19 -0700


Hi ...

 Thanx for your answer ...

> It does realize it saw a page. However, it's criteria is based on the URL.
> So if you have several URLs pointing to the same document, you're going to
> get duplicates. More powerful duplicate elimination code is in the works.

 On our Webserver System we have some MHonArc Archives, all with 
 the same url ! 
  
 The whole Sever is about 3.500 Documents large and the last
 run told me, that htDig has about 55.000 Documents in its
 Database .. I think this is way to much ;) 

 We had one "cross link" with different directories and only
 Filesystem Symlinks, but we eliminated this one (so there should
 be 100 files less ;)) 

 Actually i don't see a message like "already habe this document   
 scipping everything inside ..."

 Do you have any idea about this ?

Thanx

 Michael

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Re: [htdig] Recrusiv Digging

Reply via email to