The recrawl script for 0.9 I found in
http://wiki.apache.org/nutch/IntranetRecrawl is not working. It works
first time successfully. Second time, it fails with this error.
merging indexes to: crawl/index
IndexMerger: org.apache.hadoop.mapred.FileAlreadyExistsException:
Output directory crawl/index already exists!
at org.apache.nutch.indexer.IndexMerger.merge(IndexMerger.java:74)
at org.apache.nutch.indexer.IndexMerger.run(IndexMerger.java:148)
at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
at org.apache.nutch.indexer.IndexMerger.main(IndexMerger.java:111)
I am trying this with the latest version available in trunk. Please
help me to rectify this.