Hi Everyone,
Having recently performed a crawl, and attempting to search the index via my
browser, I was left looking at a white screen instead of viewing search results.
Upon viewing the text within my cygwin window I immediately noticed the
following output:
Dedup: done
merging indexes to: results/index
Exception in thread "main" org.apache.hadoop.mapred.FileAlreadyExistsException:
Output directory results/index already exists!
at org.apache.nutch.indexer.IndexMerger.merge(IndexMerger.java:77)
at org.apache.nutch.crawl.Crawl.main(Crawl.java:174)
My question... is it necessary for me to delete my existing index folder within
the results directory before I can view any search results?
Thank you in advance
Lewis Mc
Glasgow Caledonian University is a registered Scottish charity, number SC021474
Winner: Times Higher Education's Widening Participation Initiative of the Year
2009 and Herald Society's Education Initiative of the Year 2009
http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html