Hello Nutch Users,
I've got about 25 segments and I am trying to merge them. No matter
which way I try I run into fatal errors. If you have an idea of what
is wrong any help is appreciated. I included some commands and the
errors they produce.
$nutch mergesegs my-segments/ -o output_segment_dir -m
my-segments/20050309012353
...
050502 164206 Segment 20050413210403: 199 entries.
050502 164206 Segment 20050413201837: 0 entries.
050502 164206 TOTAL 858045 input entries in 59 segments.
050502 164206 Looking for master index in segments-tripset/20050309012353
050502 164206 SEVERE No master index, and createMaster == false
$nutch mergesegs my-segments -o output_segment_dir -cm
(works initially but after 4 minutes this error occurs)
...
java.io.FileNotFoundException:
my-segments/20050118104221/index/_1rq.f5 (Too many open files)
at java.io.RandomAccessFile.open(Native Method)
at java.io.RandomAccessFile.<init>(RandomAccessFile.java:204)
at
org.apache.lucene.store.FSInputStream$Descriptor.<init>(FSDirectory.java:376)
at org.apache.lucene.store.FSInputStream.<init>(FSDirectory.java:405)
at org.apache.lucene.store.FSDirectory.openFile(FSDirectory.java:268)
at
org.apache.lucene.index.SegmentReader.openNorms(SegmentReader.java:369)
at
org.apache.lucene.index.SegmentReader.initialize(SegmentReader.java:122)
at org.apache.lucene.index.SegmentReader.<init>(SegmentReader.java:94)
at
org.apache.lucene.index.IndexWriter.mergeSegments(IndexWriter.java:480)
at
org.apache.lucene.index.IndexWriter.maybeMergeSegments(IndexWriter.java:458)
at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:310)
at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:294)
at net.nutch.indexer.IndexSegment.indexPages(IndexSegment.java:121)
at net.nutch.indexer.IndexSegment.main(IndexSegment.java:223)
at net.nutch.tools.SegmentMergeTool.run(SegmentMergeTool.java:202)
at net.nutch.tools.SegmentMergeTool.main(SegmentMergeTool.java:358)
050502 165126 SEVERE my-segments/20050118104221/index/_1rq.f5 (Too
many open files)
zennet