Problem in IndexSorter after dedup
----------------------------------
Key: NUTCH-259
URL: http://issues.apache.org/jira/browse/NUTCH-259
Project: Nutch
Type: Bug
Components: indexer
Reporter: Michael
Priority: Minor
When trying to run IndexSorter i'm getting an error:
Exception in thread "main" java.lang.IllegalArgumentException: attempt to
access a deleted document
at
org.apache.lucene.index.SegmentReader.document(SegmentReader.java:282)
at
org.apache.lucene.index.FilterIndexReader.document(FilterIndexReader.java:104)
at
org.apache.nutch.indexer.IndexSorter$SortingReader.document(IndexSorter.java:170)
at
org.apache.lucene.index.SegmentMerger.mergeFields(SegmentMerger.java:186)
at org.apache.lucene.index.SegmentMerger.merge(SegmentMerger.java:88)
at org.apache.lucene.index.IndexWriter.addIndexes(IndexWriter.java:579)
at org.apache.nutch.indexer.IndexSorter.sort(IndexSorter.java:240)
at org.apache.nutch.indexer.IndexSorter.main(IndexSorter.java:291)
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira