How about crawling to a new segment directory and
merge to the original one---where tomcat lives in?

But, I guess, in order to reflect your new crawled
data, you need at least stop and restart tomcat. ( ? I
did such testing, not sure if it is 100% correct, I
didn't see a formal document talking about data
updating in lucene indexing for nutch segment merging)

Michael Ji

--- blackwater dev <[EMAIL PROTECTED]> wrote:

> I have completed a crawl and have my crawl
> directory.  I now want to
> set up a cron job to run nightly to keep updating
> this directory.  How
> do I do this so I don't have to create a new
> directory each time (It
> dies if the directory exists) and so I can keep
> tomcat running without
> restarts into the new directory?
> 
> Thanks!
> 


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 

Reply via email to