topN is ignored with maxNumSegments ----------------------------------- Key: NUTCH-1074 URL: https://issues.apache.org/jira/browse/NUTCH-1074 Project: Nutch Issue Type: Bug Components: generator Affects Versions: 1.3 Reporter: Markus Jelsma Fix For: 1.4
When generating segments with topN and maxNumSegments, topN is not respected. It looks like the first generated segment contains topN * maxNumSegments of URLs's, at least the number of map input records roughly matches. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira