"
bin/nutch updatedb db ...
"

to update db's URL list by using the last day's db;

I think it works; you need to create a new segment to
put the new fetch content and do merge after;

but there is problem I met that how to control the
fetch depth; if no control, day after day, the fetched
list will grow very fast (at least exponential, I
guess);

Reg,

Michael Ji


--- Robert Goene <[EMAIL PROTECTED]> wrote:

> Hi List,
> 
> I am doing my first experiments with Nutch and am
> wondering how i can 
> update a previous intranet crawl? It seems like i
> have to run a complete 
> new crawl and replace the existing index with the
> new index.
> Is there a better way of updating the index, that
> is: recrawling for 
> pages and redindexing of these pages?
> 
> Regards, Robert
> 



                
____________________________________________________
Start your day with Yahoo! - make it your home page 
http://www.yahoo.com/r/hs 
 

Reply via email to