Author: dogacan Date: Sun Jul 8 23:44:18 2007 New Revision: 554539 URL: http://svn.apache.org/viewvc?view=rev&rev=554539 Log: NUTCH-503 - Generator exits incorrectly for small fetchlists.
Modified: lucene/nutch/trunk/CHANGES.txt lucene/nutch/trunk/src/java/org/apache/nutch/crawl/Generator.java Modified: lucene/nutch/trunk/CHANGES.txt URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/CHANGES.txt?view=diff&rev=554539&r1=554538&r2=554539 ============================================================================== --- lucene/nutch/trunk/CHANGES.txt (original) +++ lucene/nutch/trunk/CHANGES.txt Sun Jul 8 23:44:18 2007 @@ -78,6 +78,9 @@ 25. NUTCH-507 - lib-lucene-analyzers jar defintion is wrong in plugin.xml. (Emmanuel Joke via dogacan) +26. NUTCH-503 - Generator exits incorrectly for small fetchlists. + (Vishal Shah via dogacan) + Release 0.9 - 2007-04-02 1. Changed log4j confiquration to log to stdout on commandline Modified: lucene/nutch/trunk/src/java/org/apache/nutch/crawl/Generator.java URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/src/java/org/apache/nutch/crawl/Generator.java?view=diff&rev=554539&r1=554538&r2=554539 ============================================================================== --- lucene/nutch/trunk/src/java/org/apache/nutch/crawl/Generator.java (original) +++ lucene/nutch/trunk/src/java/org/apache/nutch/crawl/Generator.java Sun Jul 8 23:44:18 2007 @@ -422,7 +422,16 @@ // check that we selected at least some entries ... SequenceFile.Reader[] readers = SequenceFileOutputFormat.getReaders(job, tempDir); - if (readers == null || readers.length == 0 || !readers[0].next(new FloatWritable())) { + boolean empty = true; + if (readers != null && readers.length > 0) { + for (int num = 0; num < readers.length; num++) { + if (readers[num].next(new FloatWritable())) { + empty = false; + break; + } + } + } + if (empty) { LOG.warn("Generator: 0 records selected for fetching, exiting ..."); LockUtil.removeLockFile(fs, lock); fs.delete(tempDir); ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-cvs mailing list Nutch-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-cvs