How about crawling to a new segment directory and merge to the original one---where tomcat lives in?
But, I guess, in order to reflect your new crawled data, you need at least stop and restart tomcat. ( ? I did such testing, not sure if it is 100% correct, I didn't see a formal document talking about data updating in lucene indexing for nutch segment merging) Michael Ji --- blackwater dev <[EMAIL PROTECTED]> wrote: > I have completed a crawl and have my crawl > directory. I now want to > set up a cron job to run nightly to keep updating > this directory. How > do I do this so I don't have to create a new > directory each time (It > dies if the directory exists) and so I can keep > tomcat running without > restarts into the new directory? > > Thanks! > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com
