"
bin/nutch updatedb db ...
"
to update db's URL list by using the last day's db;
I think it works; you need to create a new segment to
put the new fetch content and do merge after;
but there is problem I met that how to control the
fetch depth; if no control, day after day, the fetched
list will grow very fast (at least exponential, I
guess);
Reg,
Michael Ji
--- Robert Goene <[EMAIL PROTECTED]> wrote:
> Hi List,
>
> I am doing my first experiments with Nutch and am
> wondering how i can
> update a previous intranet crawl? It seems like i
> have to run a complete
> new crawl and replace the existing index with the
> new index.
> Is there a better way of updating the index, that
> is: recrawling for
> pages and redindexing of these pages?
>
> Regards, Robert
>
____________________________________________________
Start your day with Yahoo! - make it your home page
http://www.yahoo.com/r/hs