The recrawl script for 0.9 I found in
http://wiki.apache.org/nutch/IntranetRecrawl is not working. It works
first time successfully. Second time, it fails with this error.

merging indexes to: crawl/index
IndexMerger: org.apache.hadoop.mapred.FileAlreadyExistsException:
Output directory crawl/index already exists!
        at org.apache.nutch.indexer.IndexMerger.merge(IndexMerger.java:74)
        at org.apache.nutch.indexer.IndexMerger.run(IndexMerger.java:148)
        at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
        at org.apache.nutch.indexer.IndexMerger.main(IndexMerger.java:111)

I am trying this with the latest version available in trunk. Please
help me to rectify this.

Reply via email to